Verifies the Batch GUI functionality
Standard functionality
- Go to the 'Quality Assurance' -> 'Batchjob Overview' page.
- Run the FilelistJob for 'JobID = 1' and 'filetype = Both'. Verify that only filenames starting with 1- are included.
- Run the ChecksumJob for 'JobID = .*' and 'filetype = Both'. Verify that only filenames starting with 1- are included.
Adding new BatchJobs
Install new BatchJobs on kb-test-adm-001.kb.dk:
$ ssh kb-test-adm-001.kb.dk $ cd /home/test/${TESTX}/ $ scp kb-prod-udv-001.kb.dk:test-batch/BatchJobs.jar .
Add the following to conf/settings_GUIApplication.xml in the commons section:
<batch> <batchjobs> <batchjob> <class>dk.netarkivet.common.utils.batch.ChecksumJob</class> <jarfile/> </batchjob> <batchjob> <class>dk.netarkivet.common.utils.batch.FileListJob</class> <jarfile/> </batchjob> <batchjob> <class>batchjobs.MimeSearch</class> <jarfile>BatchJobs.jar</jarfile> </batchjob> <batchjob> <class>batchjobs.URLsearch</class> <jarfile>BatchJobs.jar</jarfile> </batchjob> <batchjob> <class>batchjobs.ContentSearch</class> <jarfile>BatchJobs.jar</jarfile> </batchjob> <batchjob> <class>batchjobs.UrlAndMimeSearch</class> <jarfile>BatchJobs.jar</jarfile> </batchjob> </batchjobs> </batch>
Restart the GUI:
conf/restart.sh
Go to the GUI and verify that the new batch jobs are available.
Run all the BatchJobs on a snapshot harvest (settings the Job ID).
Run the MimeSearch BatchJob with argument text/html
and verify that the result is a list of html pages.
Run URLsearch BatchJob with arguments .kn\.dk/.*
. This should geberate a list of the kb harvested domains.
Run UrlAndMimeSearch with argument 'image/.*' for mimetype and '.*kb\.dk/.*' for url. Verify that only images from the kb domain is listed.