Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.


Excerpt

This is a document describing the design of the NetarchiveSuite software.

This document only describes the underlying design of the NetarchiveSuite software, i.e. it does not describe how to install, run, or use NetarchiveSuite. For that see the Installation Manual and the User Manual.


The first section gives an overview, and the remainder of the document gives more details about the design.

The code is available through the downloaded site or from our github repository .

Contents

Child pages (Children Display)
excerpttrue
excerptTypesimple

Audience

The reader is expected to be familiar with Java programming and have an understanding of the core issues involved in large-scale web harvesting. Previous use of Heritrix is a definite plus, and an elementary understanding of SQL databases is required for some parts.

Search manual

Page Tree Search


Download as pdf