Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: Migrated to Confluence 5.3
Gliffy
align
sizeLM
nameNetarkivet Wayback Architectureleft
version6

Excerpt

The current Wayback module is based on the current Wayback usage in the Danish Webarchive, and can therefore be viewed as a description of the Danish Wayback solution

The

Danish Webarchive implementation of wayback has the following primary components.

ArcRepository

All access to the harvested data and metadata is via the ArcRepository interface from NetarchiveSuite. This ensures that we follow our own guidelines for bitpreservation and restricted access and also allows us to leverage the distributed ArcRepository architecture for the purpose of high-performance indexing.

...

The access component consists of a wayback installation under tomcat in Proxy Access Mode using a composite local CDX index (see wayback documentation for details). In addition, the installation includes the NetarchiveSuite wayback plugin which enables wayback to extract harvested data from the archive via NetarchiveResourceStore or NetarchiveCachingResourceStore.

...