allocation optimization for lz4frame compression #1158

Cyan4973 · 2022-09-08T07:15:19Z

as noted by @yixiutt in #1157, the temporary buffer managed by lz4frame compression context is being invalidated at end of compression, forcing it to be re-allocated at next compression job.

This shouldn't be necessary.
This behavior was introduced in #236, as a way to fix #232, but neither the issue is explained, nor why the patch fixes it.

This patch reverts to previous behavior,
where temporary buffer is reused between compression calls.
This results in a net reduction of allocation workload.

Additionally, the temporary buffer should only need malloc(), not calloc(), thus saving some potential 0-initialization cost.

This diff implements both changes. It's expected to improve compression speed when repetitively compressing small data.

Performance impact on M1 laptop :

filename	size	cSpeed before	cSpeed after	delta
enwik7	1000000	485 MB/s	485 MB/s	+0%
enwik6	100000	505 MB/s	508 MB/s	+1%
enwik5	10000	862 MB/s	1037 MB/s	+20%
enwik4	1000	359 MB/s	940 MB/s	+160%

As expected, performance difference is only perceptible for small data, but it can matter a lot in this case.

Once this diff is merged, long fuzzer tests will be run to ensure that no sanitizer warning gets triggered.

Additionally :

fixed a minor ubsan warning in LZ4F_decompress()
added an LZ4F_compressUpdate() test to fullbench

@yixiutt

as noted by @yixiutt, the temporary buffer inside lz4frame compression is being invalidated at end of compression, forcing it to be re-allocated at next compression job. This shouldn't be necessary. This change was introduced in #236, as a way to fix #232, but neither the issue is explained, nor why the patch fixes it. This patch revert to previous behavior, where temporary buffer is kept between compression calls. This results in a net reduction of allocation workload. Additionally, the temporary buffer should only need malloc(), not calloc(), thus saving initialization. This diff implements both changes. Long fuzzer tests will be run to ensure that no sanitizer warning get triggered. Additionally : fixed a minor ubsan warning in LZ4F_decompress().

yixiutt · 2022-09-08T07:17:38Z

LGTM

Cyan4973 added 2 commits September 7, 2022 22:50

added LZ4F_compressUpdate() in fullbench

2822825

Cyan4973 merged commit 72997c5 into dev Sep 8, 2022

Cyan4973 mentioned this pull request Sep 8, 2022

[performance](lz4f) why LZ4F_compressEnd set maxBufferSize=0 #1157

Closed

Cyan4973 deleted the fix1157 branch September 14, 2022 17:27

moonfruit mentioned this pull request Jul 22, 2024

lz4 1.10.0 Homebrew/homebrew-core#178056

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

allocation optimization for lz4frame compression #1158

allocation optimization for lz4frame compression #1158

Cyan4973 commented Sep 8, 2022 •

edited

Loading

Uh oh!

yixiutt commented Sep 8, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

allocation optimization for lz4frame compression #1158

allocation optimization for lz4frame compression #1158

Conversation

Cyan4973 commented Sep 8, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

yixiutt commented Sep 8, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Cyan4973 commented Sep 8, 2022 •

edited

Loading