Excerpt |
---|
Description of how to use the toolkit and examples |
Since this project is mainly aimed at building a general purpose Web Archiving Toolkit these packages do not contain any applications, instead they are intended to be used as building blocks.
...
Code Block | ||||
---|---|---|---|---|
| ||||
<dependencies> <dependency> <groupId>org.jwat</groupId> <artifactId>jwat-common</artifactId> <version>0.9.0<1</version> </dependency> <dependency> <groupId>org.jwat</groupId> <artifactId>jwat-gzip</artifactId> <version>0.9.0<1</version> </dependency> <dependency> <groupId>org.jwat</groupId> <artifactId>jwat-arc</artifactId> <version>0.9.0<1</version> </dependency> <dependency> <groupId>org.jwat</groupId> <artifactId>jwat-warc</artifactId> <version>0.9.0<1</version> </dependency> </dependencies> |
...
GZip compression is only supported on ARC/WARC files where each record is compressed individually and concatenated into one file and not the case where the whole ARC/WARC file and all it's records are GZip'ed as a whole. The later mostly because this makes random access to individual record highly ineffective.
Child pages (Children Display) | ||||||
---|---|---|---|---|---|---|
|