Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Excerpt

Verifies the Batch GUI functionality

Standard functionality

  1. Go to the 'Quality Assurance' -> 'Batchjob Overview' page.
  2. Run the FilelistJob for 'JobID = 1' and 'filetype = Both'. Verify that only filenames starting with 1- are included.
  3. Run the  ChecksumJob for 'JobID = .*' and 'filetype = Both'. Verify that only filenames starting with 1- are included.

Adding new BatchJobs

Install new BatchJobs on kb-test-adm-001.kb.dk:

Code Block
$ ssh kb-test-adm-001.kb.dk
$ cd /home/test/${TESTX}/
$ scp kb-prod-udv-001.kb.dk:test-batch/BatchJobs.jar .

Add the following to conf/settings_GUIApplication.xml in the commons section:

Code Block
<batch>
  <batchjobs>
    <batchjob>
      <class>dk.netarkivet.common.utils.batch.ChecksumJob</class>
      <jarfile/>
    </batchjob>
    <batchjob>
      <class>dk.netarkivet.common.utils.batch.FileListJob</class>
      <jarfile/>
    </batchjob>
    <batchjob>
      <class>batchjobs.MimeSearch</class>
      <jarfile>BatchJobs.jar</jarfile>
    </batchjob>
    <batchjob>
      <class>batchjobs.URLsearch</class>
      <jarfile>BatchJobs.jar</jarfile>
    </batchjob>
    <batchjob>
      <class>batchjobs.ContentSearch</class>
      <jarfile>BatchJobs.jar</jarfile>
    </batchjob>
    <batchjob>
      <class>batchjobs.UrlAndMimeSearch</class>
      <jarfile>BatchJobs.jar</jarfile>
    </batchjob>
  </batchjobs>
</batch>

Restart the GUI: 

Code Block
conf/restart.sh

Go to the GUI and verify that the new batch jobs are available.

Run all the BatchJobs on a snapshot harvest (settings the Job ID).

 

Run the MimeSearch BatchJob with argument text/html and verify that the result is a list of html pages.

Run URLsearch BatchJob with arguments .kn\.dk/.*. This should geberate a list of the kb harvested domains. 

Run UrlAndMimeSearch with argument 'image/.*' for mimetype and '.*kb\.dk/.*' for url. Verify that only images from the kb domain is listed.