...
Seed list 1 (Harvest template "frontpages"default_orderxml", maxhops=0, extract_javascript=true, robots.txt=ignore, max objects=500; max bytes=400.000.000):
Code Block |
---|
http://netarkivet.dk/adgang/ http://netarkivet.dk/in-english/ http://www.raeder.dk/ # Fjern denne linie og linien nedenunder #http://kb-prod-udv-001.kb.dk/netarchivesuite/clock.php (is not visible from any of the harvesters in the test-system, therefore replaced for now by the link below) http://localtimes.info/Europe/Denmark/Copenhagen/ |
Seed list 2 (Harvest template "frontpages_plus_2levels"default_orderxml", maxhops=2, extract_javascript=true, robots.txt=ignore, max objects=300; max bytes=500.000.000):
Code Block |
---|
http://netarkivet.dk/in-english/ http://www.kaarefc.dk/ http://www.kaarefc.dk/private/ http://www.kaarefc.dk/wop/ |
Seed list "<eh. name>_frontpagesdefault_orderxml_400000000Bytes_500Objects" for domain =raeder.dk= __"
...