Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Check filelist is correct and database is loaded with missing files 

  • Click on Bitpreservation
  • Click on "Update" under Filelist status
  • Open a new tab in your browser and go to System status: http://$GUIadminserver:$http-port/Status/Monitor-JMXsummary.jsp
  • Check that you get INFO messages like this INFO: The file 'TEST2_999.arc' was not found in the database. Thus creating entry for the file.

...

Check checksum is correct

  • Click on Bitpreservation
  • Click on "Update" under Checksum status
  • Open a new tab in your browser and go to System status: http://$GUIadminserver:$http-port/Status/Monitor-JMXsummary.jsp
  • Click on Instanse-ID.
  • Click on one of the the first bitarchive instanse-ID's.
  • Click Show all in the Index column and.
  • Verify that you get log messages like "INFO: The batchjob 'class dk.netarkivet.archive.arcrepository.bitpreservation.ChecksumJob' has run for 1938 seconds and has reached file '11297-MB100.arc' which is number 1615 out of 27620" each 30. sec ( be aware of, that the checksum logmessages can be delayed because of very big files > 1 GB)
  • And wait until the the checksum is completed without any errors. The last log message should read something like 'INFO: Finished batch job dk.netarkivet.archive.arcrepository.bitpreservation.ChecksumJob with result: 0 failures in processing 27620 files at 172.17.0.176_BitApp_E'
 

Stress test batch jobs

Setup test

Code Block
export TESTX=TEST11B
cd /home/test/$TESTX/
mkdir batchprogs
scp test@kb-prod-udv-001.kb.dk:/home/test/test-batch/* batchprogs/.

ChecksumJob

Calculating the MD5 checksum on the archive files

Run the following command:

Code Block
java -cp lib/dk.netarkivet.archive.jar -Ddk.netarkivet.settings.file=conf/settings_ArcRepositoryApplication.xml -Dsettings.common.applicationInstanceId=BATCH dk.netarkivet.archive.tools.RunBatch -Cbatchprogs/ChecksumJob.class -Ooutput.checksum

This should write out the following text messages in the console:

Code Block
Running batch job 'batchprogs/ChecksumJob.class' on files matching '.*' on replica 'KBN', output written to file 'output.checksum', errors written to stderr
Processed 11 files with 0 failures
Cleaning up dk.netarkivet.common.distribute.JMSConnectionSunMQ
Cleaned up dk.netarkivet.common.distribute.JMSConnectionSunMQ

The output is put into the file 'output.checksum'. This file should contain the following text:

Code Block
1-1-20090316092641-00003-kb-test-har-002.kb.dk.arc##c68b3e18f7b870b76d86de7970a822c2
2-2-20090316092643-00003-kb-test-har-001.kb.dk.arc##7d723dd4d374437c5e29e995521bf014
.......