Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

  1. Click 'Harvest status'->'All Jobs' in the left menu
  2. Select "All" in "Only display job status" to the right from the menu
  3. Click the "Show" button, until the jobs have stepped through statuses "NEW", "SUBMITTED", "STARTED", "DONE"
  4. Wait until all jobs have got status "DONE"
  5. Check that you can search on Harvest name, start and end date
  6. Check that you can change number of rows to be displayed per page e.g. 1 and
  7. Check that you can press next and previous page and
  8. Check that the reset button resets all changes to default(note that the display value is also blanked, but is 100 by default)
  9. Check the following for the domains '''raeder.dk''' and '''kb.dk''': (Using page Harvest Status -> All jobs per domain)
    1. Check that the domain has been harvested by one job of the name <eh. name>
    2. Check that this job has configuration <eh. name>_frontpages__ __
    3. Check that there is a number for 'Run number' and 'Job ID'
    4. Check that the 'Start time' and 'End time' columns approximately corresponds to time of test with <eh. name> harvest
    5. Check that the 'Bytes Harvested' and 'Documents Harvested' columns contains positive numbers
    6. Check that the 'Stopped due to' columns contain "Domain Completed"
  10. Check the following job details for the domain '''netarkivet.dk''': (Using page SelectiveHarvests->History->Run Number 0 ->JobID 1)
    1. Check that the 'Submit time', 'Start time' and 'End time' columns approximately corresponds to time of test with <eh. name> harvest
    2. Click on "Browse reports for jobs"
    3. Check that you don't get any errors when you click on some of the links
    4. Click on "Browse harvest files for job"
    5. Check that you don't get any errors when you click on some of the links
    6. Click on "Browse only relevant crawl-log lines for domain netarkivet.dk"
    7. Check that you don't get any errors when you click on some of the links
  11. Check the following for the domain '''netarkivet.dk''': (Using page Harvest Status -> All jobs per domain)
    1. Check that the domain has been harvested by 2 jobs of the name <eh. name>
    2. Check that one of the jobs has configuration <eh. name>_frontpages
    3. Check that the 'Start time' and 'End time' columns approximately corresponds to time of test with <eh. name>
    4. Check that one of the jobs has configuration <eh. name>_frontpages_plus_2levels__ __
    5. Check that the 'Start time' and 'End time' approximately corresponds to time of test with <eh. name> harvest
    6. Check that 'Run number' and 'Job ID' columns contains positive numbers
    7. Check that the 'Bytes Harvested' and 'Documents Harvested' columns contains positive numbers
    8. Check that the 'Stopped due to' columns contains "Domain Completed"
  12. Check the following for the domain '''kaarefc.dk''': (Using page Harvest Status -> All jobs per domain)
    1. Check that the domain has been harvested by 1 job of the name <eh. name>
    2. Check that the job has configuration <eh. name>_frontpages_plus_2levels
    3. Check that the 'Start time' and 'End time' approximately corresponds to time of test with <eh. name> harvest
    4. Check that 'Run number' and 'Job ID' columns contains positive numbers
    5. Check that the 'Bytes Harvested' and 'Documents Harvested' columns contains positive numbers
    6. Check that the 'Stopped due to' columns contains "Domain Completed"

Follow the schedule of the next job

...