Gliffy | ||||||||
---|---|---|---|---|---|---|---|---|
|
Introduction
The primary function of the NetarchiveSuite is to plan, schedule and archive web harvests of parts of the internet. We use Heritrix as our web-crawler.
NetarchiveSuite was released on July 2007 as Open Source under the LGPL license and is used by the Danish organization Netarkivet.dk http://netarkivet.dk. This organization has since July 2005 been using NetarchiveSuite to harvest Danish websites as authorized by the latest Danish Legal Deposit Act.
...
The NetarchiveSuite is split into four main modules: One module with common functionality and three modules corresponding to processes of harvesting, archiving and accessing, respectively.MainView.jpg
Gliffy | ||||||
---|---|---|---|---|---|---|
|
The Common Module
The framework and utilities used by the whole suite, like exceptions, settings, messaging, file transfer (!RemoteFile), and logging. It also defines the Java interfaces used to communicate between the different modules, to support alternative implementations.
...