Rewrite Huffman Coding Implementation #196

ben-e-whitney · 2022-06-28T21:23:07Z

These commits

refactor the original Huffman coding implementation,
add regression tests for huffman_encoding, huffman_decoding, compress_memory_huffman, and decompress_memory_huffman;
remove the timing statements for compress_memory_huffman;
add new Huffman coding functions, huffman_encode and huffman_decode;
add a new Huffman code serialization method, RFMH, and new lossless compression methods, CPU_HUFFMAN_ZLIB and CPU_ZSTD (renaming original CPU_HUFFMAN_ZLIB to CPU_ZLIB);
fix an asymmetry between zlib and ZSTD compression (CPU_ZLIB, formerly, CPU_HUFFMAN_ZLIB, didn't use Huffman coding, while CPU_HUFFMAN_ZSTD did);
reimplement compress and decompress, adding support for the new lossless compression methods and the new Huffman code serialization method;
rename include/compressors.hpp and the zlib and ZSTD compression functions;
split the frequency and 'missed' buffers into sequences of subbuffers of limited size so that Protobuf doesn't complain (I was getting errors linked to this);
bump the file format version number to 1.1.0 and the MGARD version number to 1.3.0; and
add a lot of tests.

ben-e-whitney · 2022-06-28T21:25:40Z

@qliu21, please don't merge this pull request yet. I'll rebase once you merge #194 and #195.

ben-e-whitney · 2022-07-12T19:52:51Z

@JieyangChen7, I am going to need a little bit of help getting this pull request ready to merge. I just rebased on #197. When I try to build, I get the following error message.

$ cmake -S . -B build -D CMAKE_PREFIX_PATH="$HOME/.local" -D MGARD_ENABLE_SERIAL=ON
$ cmake --build build --parallel 8
…
lib/libmgard.so.1.3.0: undefined reference to `mgard::huffman_decoding(long*, unsigned long, unsigned char*, unsigned long, unsigned char*, unsigned long, unsigned char*, unsigned long)'
lib/libmgard.so.1.3.0: undefined reference to `mgard::huffman_encoding(long*, unsigned long, unsigned char**, unsigned long*, unsigned char**, unsigned long*, unsigned char**, unsigned long*)'
collect2: error: ld returned 1 exit status
CMakeFiles/mgard-x-autotuner.dir/build.make:107: recipe for target 'bin/mgard-x-autotuner' failed

(This is on 6e281a6.) Here's the issue, as far as I can tell.

In include/mgard-x/CompressionLowLevel/CompressionLowLevel.hpp, compress calls CPUCompress and decompress calls CPUDecompress.
In include/mgard-x/Lossless/CPU.hpp, CPUCompress calls mgard_x::compress_memory_huffman and CPUDecompress calls mgard_x::decompress_memory_huffman.
mgard_x::compress_memory_huffman calls mgard::huffman_encoding and mgard_x::decompress_memory_huffman calls mgard::huffman_decoding.
I've removed those functions. It's possible to achieve the same effect with compress and decompress (declared in include/lossless.hpp), but that functionality is deprecated because of issues with the original Huffman coding implementation (see Assertion Failure in huffman_decoding #190).

I would just modify CPUCompress and CPUDecompress to use the new Huffman coding implementation myself, but any code using the new implementation will need to interact with some new Protobuf fields I've introduced, and I don't know how to do that with your code. That's why I'm roping you in. Do you have any time before the 19th (the next MGARD software engineering meeting) to meet and figure out what changes are needed?

JieyangChen7 · 2022-07-13T14:36:17Z

@ben-e-whitney Ok, let me take a look at this commit first and figure out the possible solutions. I will let you know.

JieyangChen7 · 2022-07-14T15:11:31Z

@ben-e-whitney I managed to fix this issue by calling the new compress/decompress API in include/mgard-x/Lossless/CPU.hpp. No major changes are necessary from your side.

There is one minor change needed. Since the new compress API returns mgard::MemoryBuffer, which requires me to indirectly include "utilities.hpp", it triggers the NVCC bug. So, I added #ifndef __NVCC__ around the member functions related to mgard::CartesianProduct to make it work. Do you think this is fine?

Those changes are on my local machine. If you are ok with those changes, I can push the changes directly to this huffman-decoding branch or I can let you know what is changed and you make a commit?

JieyangChen7 · 2022-07-14T23:36:08Z

@ben-e-whitney I noticed that the new lossless compress/decompress implementation has a lower throughput compared with the old implementation. Here are my results with the same data (134 million quantized elements) and settings (Zstd with level=1). I timed just the lossless compress/decompress functions. Is this performance expected? I want to make sure I'm using the functions in the right way.
Old: compression: 0.51s (huffman: 0.47s + Zstd: 0.03s), decompression: 0.54s (huffman: 0.52s + Zstd: 0.01s) , compression ratio: 208x
New: compression: 2.11s (huffman: 2.07s + Zstd: 0.03s) , decompression: 3.13s (huffman: 3.10s + Zstd: 0.01s), compression ratio: 219x

ben-e-whitney · 2022-07-15T15:53:44Z

@JieyangChen7 Thanks for all the help.

There is one minor change needed. Since the new compress API returns mgard::MemoryBuffer, which requires me to indirectly include "utilities.hpp", it triggers the NVCC bug. So, I added #ifndef __NVCC__ around the member functions related to mgard::CartesianProduct to make it work. Do you think this is fine?

Yes, I think that's fine. I'd even put all of CartesianProduct and CartesianProduct::iterator in the #ifndef __NVCC__–#endif block. I think that might make any future compiler errors easier to understand.

Sorry for all the trouble CartesianProduct/the NVCC bug is causing. If this keeps happening, I can split utilities.hpp into utilities/MemoryBuffer.hpp, utilities/CartesianProduct.hpp, etc. Then we could prevent NVCC from ever encountering CartesianProduct.

Those changes are on my local machine. If you are ok with those changes, I can push the changes directly to this huffman-decoding branch or I can let you know what is changed and you make a commit?

Please push them directly to this branch.

I noticed that the new lossless compress/decompress implementation has a lower throughput compared with the old implementation. … Is this performance expected?

Definitely not expected/intended, but I didn't do any performance regression tests, so I'm not shocked to learn that something is slow. I'll look into this next week and try to fix it before asking for this branch to be merged.

…__ – #endif block

JieyangChen7 · 2022-07-18T15:21:43Z

@ben-e-whitney Thanks for the reply.

Yes, I think that's fine. I'd even put all of CartesianProduct and CartesianProduct::iterator in the #ifndef __NVCC__–#endif block. I think that might make any future compiler errors easier to understand.

I have put both CartesianProduct and CartesianProduct::iterator in the #ifndef __NVCC__ block and pushed the changes to this branch. It compiles fine on both my machine and on GitHub. It can also pass all tests of MGARD-X on my machine using the new CPU lossless compressor. But it is reporting failures for some of your tests. For example: /home/runner/work/MGARD/MGARD/tests/src/test_compress.cpp:248: FAILED. Do you know what might be wrong?

ben-e-whitney · 2022-07-19T16:57:29Z

@JieyangChen7 I modified include/mgard-x/CompressionHighLevel/Metadata.hpp by replacing std::int64_t with QUANTIZED_INT in a few places in the commit I just pushed (4ef023d). Please give those changes a quick look.

I'm still working on the performance regression and the test failures.

ben-e-whitney added bug Something isn't working enhancement New feature or request labels Jun 28, 2022

This was linked to issues Jun 28, 2022

Assertion Failure in huffman_decoding #190

Open

CPU_HUFFMAN_ZLIB Compressor Doesn't Use Huffman Coding #192

Open

Poor compression & quality for difficult-to-compress data #189

Open

ben-e-whitney force-pushed the huffman-decoding branch from 005c538 to cfd7e3e Compare June 29, 2022 12:58

ben-e-whitney mentioned this pull request Jul 5, 2022

Poor compression & quality for difficult-to-compress data #189

Open

ben-e-whitney force-pushed the huffman-decoding branch 2 times, most recently from 4242052 to 4d3a394 Compare July 5, 2022 18:46

ben-e-whitney mentioned this pull request Jul 7, 2022

Add a new SYCL backend for supporting Intel GPUs and embedded GPUs #197

Merged

ben-e-whitney added 19 commits July 12, 2022 12:24

Move Huffman tree functions to separate header.

c63dbfd

Delete unused comparison function.

f43b69f

Replace size_t with std::size_t.

258bec8

Replace malloc calls with new expressions.

bf1d545

Replace new_htree_node with a constructor.

444541d

Add const to Huffman tree variable types.

362a5d5

Use nullptr instead of 0 for pointer values.

a8d3119

Pass huffman_encoding parameters by reference.

4125d93

Use std::vector for Huffman codec array.

846e8c7

Gather codecs and frequency table into struct.

9ff8b96

Add Bits to allow iteration over bits of array.

6a249be

Allow nonzero end bit offsets in Bits.

24f3052

Add Huffman encoding regression tests.

8888b56

Reimplement Huffman encoding with HuffmanCode.

d513127

Return struct from rewritten Huffman encoder.

44cdc12

Return struct from original Huffman encoder.

38a4d96

Avoid buffer copies in huffman_encoding.

83f31e0

Separately copy hit buffer, trailing zero bytes.

c52d44e

Add Huffman compression regression tests.

7a757a6

ben-e-whitney added 19 commits July 12, 2022 12:24

Add static data member for default symbol range.

fec4c94

Separate codeword decoding, missed buffer lookup.

ebf3c34

Pass index–frequency pair range as iterator pair.

1c44399

Add function to check quantization buffer size.

1532c66

Automatically calculate Huffman hit buffer size.

56004d2

Add HuffmanEncodedStream {,de}serializer.

25e639d

Select serialization compressor at runtime.

a05703d

Enable RFMH in {,de}compress.

3e53ce4

Add tests for RFMH in {,de}compress.

d0fe35f

Rename compressors.hpp to lossless.hpp.

309823d

Separate lossless compressor implementations.

0e6c563

Contain z_const casts to lossless_zlib.cpp.

b697831

Rename lossless compression functions.

0848131

Change argument order in periodic data tests.

48529f9

Remove unused NOOP_COMPRESSOR decompressor.

547d4e0

Add Chain to allow iterator range concatenation.

021e330

Limit sizes of frequency and 'missed' subtables.

1d07149

Add comments motivating Supertable, Chain use.

f887c7d

Add member functions for common size computations.

6e281a6

ben-e-whitney force-pushed the huffman-decoding branch from 4d3a394 to 6e281a6 Compare July 12, 2022 17:00

JieyangChen7 added 2 commits July 15, 2022 15:50

Fix CPU lossless in MGARD-X

aeb89a9

Put CartesianProduct and CartesianProduct::iterator in #ifndef __NVCC…

eed1eb1

…__ – #endif block

Add quantization type function template.

4ef023d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rewrite Huffman Coding Implementation #196

Rewrite Huffman Coding Implementation #196

ben-e-whitney commented Jun 28, 2022

ben-e-whitney commented Jun 28, 2022

ben-e-whitney commented Jul 12, 2022

JieyangChen7 commented Jul 13, 2022

JieyangChen7 commented Jul 14, 2022

JieyangChen7 commented Jul 14, 2022

ben-e-whitney commented Jul 15, 2022

JieyangChen7 commented Jul 18, 2022

ben-e-whitney commented Jul 19, 2022

Rewrite Huffman Coding Implementation #196

Are you sure you want to change the base?

Rewrite Huffman Coding Implementation #196

Conversation

ben-e-whitney commented Jun 28, 2022

ben-e-whitney commented Jun 28, 2022

ben-e-whitney commented Jul 12, 2022

JieyangChen7 commented Jul 13, 2022

JieyangChen7 commented Jul 14, 2022

JieyangChen7 commented Jul 14, 2022

ben-e-whitney commented Jul 15, 2022

JieyangChen7 commented Jul 18, 2022

ben-e-whitney commented Jul 19, 2022