Netarkivet-curator-requirements

Pages in H1 GUI used by the curators at netarkivet.dk

The console (index.jsp):
Gives an overview of how the harvest job is proceeding: Job State, active Threads, and the percentage downloaded.

 

Reports.jsp: Gives deeper insight into which domains are stuck.

logs.jsp: Used to do a closer scrutiny on what is actually harvested - often on domain-basis, and also used when defining new filters in our test-environment. Selection by regular expressions is used to only show logs pertaining to a certain domain or part of a domain.

 

Console - View/Edit Frontier: Used for looking into the frontier, deleting URIs from the frontier, usually defined by a regular expression.

Jobs.jsp: The page in the H1 GUI is seldom used, but used nontheless by our curators.

help.jsp: The User Manual, information about regexps, and the URI Fetch Status codes is currently used.

 

The information on this page is mainly due to feedback by Tue Larsen, and Jon Eirikson, both at Netarkivet.dk