...
- Log into the harvester where kum.dk was being harvested
- Find the crawldir in TEST6/harvester_low
- Find the metadata warcfile in the metadata subdirectory and copy it to /tmp
Create a Fake Crawl Dir
...
Code Block |
---|
ssh netarkdv@sb-test-har-001.statsbiblioteket.dk |
...
cd TEST6/harvester_high |
...
cp -r |
...
~netarkivdv/testdata-h3/TEST6/23-fakejobdir |
...
mkdir 23-fakejobdir/heritrix3/jobs/23-fakejobdir/logs |
...
touch 23-fakejobdir/heritrix3/jobs/23-fakejobdir/logs/crawl.log touch 23-fakejobdir/heritrix3/jobs/23-fakejobdir/logs/progress-statistics.log |
Wait 3 Hours then Restart the System
...