Contents
Table of Contents |
---|
Tools in Wayback Module
In addition to the tools described here, the NetarchiveSuite Java applications for continuous indexing of an arcrepository are described in the Configuration Manual.
...
Wayback is a tool for browsing in webarchives. It can be downloaded from http://archive-access.sourceforge.net/projects/wayback/. The NetarchiveSuite plugin for wayback is a class NetarchiveResourceStore
which implements org.archive.wayback.ResourceStore
. NetarchiveResourceStore
instantiates a connection to a NetarchiveSuite ArcRepository and retrieves archive data from it via NetarchiveSuite.
In order to make use of the plugin, it is necessary to. :
- Copy the required jar files into the lib-directory of your wayback installation.
- Ensure that wayback has access to a NetarchiveSuite settings file with the necessary connection information.
- Configure wayback to use NetarchiveResourceStore
The lib directory for wayback will be under
...
Code Block |
---|
java -cp dk.netarkivet.wayback.jar dk.netarkivet.wayback.DeduplicateToCDXApplication crawl1.log crawl2.log crawl3.log > out.cdx |
Section | |||||||||
---|---|---|---|---|---|---|---|---|---|
|