Skip to end of metadata
Go to start of metadata

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 3 Next »

Welcome to the Java Web Archive Toolkit

This wiki describes the overall packages and also includes some details on how the main classes are implemented.

The JWAT code was originally intended for use only in a number of JHove2 modules, but since the classes can be of use outside the JHove2 project, an independent project was created.

Note: Even though the repository is about 80mb, most of that is test data. The libraries themselves are very small!

Features

  • GZip support.
    • Reader with validation.
    • Writer with validation.
    • (Multi-file) GZip validating decompressor/compressor.
    • GZip Input/Output wrapper streams.
  • ARC support.
    • Reader with validation.
    • Writer with validation.
  • WARC support.
    • Reader with validation.
    • Writer with validation.
  • Encoding supported:
    • Base64, Base32 and Base16.
    • ISO8859-1.
    • UTF-8.
    • QuotedString.
    • EncodedWords.
    • URI including profiles.
  • Advanced header line reader.
  • HttpHeader request/response parser/validation.
  • Content-Type parser/validation.
  • Various special purpose stream implementations.
  • No labels