Bug#1069602: RFP: rapidgzip -- Parallelized decompression of gzip (Python)
Package: wnpp
Severity: wishlist
* Package name : rapidgzip
Version : 0.13.0 (from Git tags)
Upstream Author : Name <somebody@example.org>
* URL : https://github.com/mxmlnkn/rapidgzip
* License : MIT
Programming Lang: Python
Description : Parallelized decompression of gzip
rapidgzip can be used for parallel
decompression of almost any gzip file. Other
tools, such as bgzip, can only parallelize
decompression of gzip files produced by
themselves. rapidgzip works with all files,
especially those produced by the usually
installed GNU gzip.
And example. This benchmark uses the Silesia
corpus compressed as tar.gz
https://sun.aei.polsl.pl//~sdeor/index.php?page=silesia
rapidgzip --help
# Parallel decoding: 1.7 s
time rapidgzip -d -c -P 0 sample.gz | wc -c
# Serial decoding: 22 s
time gzip -d -c sample.gz | wc -c
Reply to: