Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

This class contains the record parser, fields and validation.

Auxiliary classes

  • WarcHeaderLine.java: Reading a WARC header encapsulates each line in instances of this class.
  • WarcDateParser.java: Parses and validated an WARC date.
  • WarcDigest.java: Parses, validates and encapsulates a WARC digest header (algorithm, digest, encoding). The encoding is auto-detected and added later in the reading process.
  • WarcErrorType.java: Defines the different possible error types.
  • WarcValidationError.java: Defines an WARC validation error using a type, key and value.

Writers

  • WarcWriter.java: Abstract writer class which is the base for all the writers.
  • WarcWriterUncompressed.java: A writer implementation prototype.