Home / Web Archiving / WARC I/O Libraries > HadoopConcatGz WARC I/O Libraries > HadoopConcatGz A Splitable Hadoop InputFormat for Concatenated GZIP Files (and .warc.gz). (Stable)* Package 9 stars GitHub Back to Web Archiving