Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Contents

Table of Contents

Tools in Wayback Module

In addition to the tools described here, the NetarchiveSuite Java applications for continuous indexing of an arcrepository are described in the Configuration Manual. 

...

Wayback is a tool for browsing in webarchives. It can be downloaded from http://archive-access.sourceforge.net/projects/wayback/. The NetarchiveSuite plugin for wayback is a class NetarchiveResourceStore which implements org.archive.wayback.ResourceStore. NetarchiveResourceStore instantiates a connection to a NetarchiveSuite ArcRepository and retrieves archive data from it via NetarchiveSuite.  
In order to make use of the plugin, it is necessary to. :

  • Copy the required jar files into the lib-directory of your wayback installation.
  • Ensure that wayback has access to a NetarchiveSuite settings file with the necessary connection information.
  • Configure wayback to use NetarchiveResourceStore

The lib directory for wayback will be under

...

Code Block
java -cp dk.netarkivet.wayback.jar dk.netarkivet.wayback.DeduplicateToCDXApplication crawl1.log crawl2.log crawl3.log > out.cdx
Section



Column

Column
width100%
 
Column