Consolidate JHove2 ARC og Gzip module

Integration of the BnF developed ARC and Gzip modules into the JHove2 parent project (Planned ready December 2011).

The original modules are in a separate branch of the JHove2-project: https://bitbucket.org/lbihanic/jhove2-bnf.

The modules have been tested and reviewed.

In the processed of reviewing the code a few problems arose:

  • The JHove2 architecture has changed so the Arc/Gzip modules can not compile with the main JHove2 codebase.
  • The modules were interdependent and relied on changes to the core of JHove2 which have also changed.
  • The Arc code could not validate all the Arc code thrown at it.
  • The Arc code could be improved and simplified while retaining all functionality and becoming more robust.

Tasks - completed

  • JHove2: The Arc/GZip modules have been split in two.
  • JWAT: All the Arc/GZip processing code has been moved to a separate "project" also including the Warc processing code.
  • JWAT: The Arc/GZip code has been refactored.
  • JHove2: Simplified JHove2 modules have been run and tested but lack support required for Gzip'ed Arc files.
  • JHove2: GZip and Arc modules recoded and integrated.
  • JHove2: A new integration plan was found and implemented.