[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#1069602: RFP: rapidgzip -- Parallelized decompression of gzip (Python)



Package: wnpp
Severity: wishlist

* Package name    : rapidgzip
  Version         : 0.13.0 (from Git tags)
  Upstream Author : Name <somebody@example.org>
* URL             : https://github.com/mxmlnkn/rapidgzip
* License         : MIT
  Programming Lang: Python
  Description     : Parallelized decompression of gzip

rapidgzip can be used for parallel
decompression of almost any gzip file. Other
tools, such as bgzip, can only parallelize
decompression of gzip files produced by
themselves. rapidgzip works with all files,
especially those produced by the usually
installed GNU gzip.

And example. This benchmark uses the Silesia
corpus compressed as tar.gz
https://sun.aei.polsl.pl//~sdeor/index.php?page=silesia

rapidgzip --help

# Parallel decoding: 1.7 s
time rapidgzip -d -c -P 0 sample.gz | wc -c

# Serial decoding: 22 s
time gzip -d -c sample.gz | wc -c


Reply to: