...
- From
kb-prod-udv-001.kb.dk
, log into the harvester where kum.dk was being harvested (with user netarkdv if harvester is in Aarhus, and userdevel
if harvester is in Kbh). - Find the crawldir in TEST6/harvester_low
- Find the metadata warcfile in the metadata subdirectory and copy it to TEST6/
Create a Fake Crawl Dir
From kb-prod-udv-001.kb.dk
do:
Code Block |
---|
ssh netarkdv@sb-test-har-001.statsbiblioteket.dk cd TEST6/harvester_high cp -r ~netarkdv/testdata-h3/TEST6/23-fakejobdir . mkdir 23-fakejobdir/heritrix3/jobs/23-fakejobdir/logs touch 23-fakejobdir/heritrix3/jobs/23-fakejobdir/logs/crawl.log touch 23-fakejobdir/heritrix3/jobs/23-fakejobdir/logs/progress-statistics.log |
...