...
- Make a new selective (event) harvest definition with a name you can remember
- Click 'Definitions'->'Selective Harvests' in the left menu
- Click 'Create new harvestdefinition' in the bottom of the main window
- Fill in the Harvest name and note the name for later use (from now referred as <eh. name>)
- Choose '''Once_an_hour''' in the drop down list for 'Schedule'
- Click Save (DO NOT CLICK ACTIVATE YET)
- Add seeds to the selective (event) harvest
- Click 'Edit' in column 6 on the line with the <eh. name>
- Write domain list from 'Seed list 1' given below to a file on your desktop e.g. notepad)
- Click 'Add seeds from a file' at the bottom of the main page
- Click 'Browse" and pick up the just created file with seeds
- Choose '''frontpages''' in the drop-down list for 'Harvest template' (set maxobjects pr domain to 500)
- Click 'Insert'
- Now click 'Add seeds'
- Choose '''frontpages_plus_2levels''' in the drop-down list for 'Harvest template'
- Write domain list from 'Seed list 2' given below (you can cut and paste from this page) (set maxobjects pr domain to 500)
- Click 'Insert'
- *Click 'Save'
- Check that seed lists for domains in Seed list 1 has changed correspondingly (You have to click on Show unused configurations/seedlists show all)
- For each of the domains =raeder.dk=, =statsbiblioteket.dk=, =netarkivet.dk= do:
- Click 'Definitions'->'Find Domain(s)'
- Search for domain by writing its name as text and click 'Search'
- Check that there exists a configuration with the name "<eh. name>_frontpages__" __"
- Check that there exists a seed list with the name "<eh. name>_frontpages
- Click 'Edit' in the line with seed list "<eh. name>_frontpages__" __",
- Check that the seed list shown corresponds to the seed list for the domain (see below)
- Check that seed lists for domains in Seed list 2 has changed correspondingly (you have to click on Show unused configurations/seedlists show all)
- For the domains =kaarefc.dk=, =netarkivet.dk= do:
- Click 'Definitions'->'Find Domain(s)'
- Search for =netarkivet.dk= by writing this text and click Search
- Check that there exists a configuration with name "<eh. name>_frontpages_plus_2levels
- Check that there exists a seed list with the name "<eh. name>_frontpages_plus_2levels__" __"
- Click 'Edit' in the line with seed list "<eh. name>_frontpages_plus_2levels
- Check that the seed list shown corresponds to the seed list for the domain (see below)
- Activate the harvest
- Click 'Definitions'->'Selective Harvests' in the left menu
- Click 'Activate' in column 5 on the line with the <eh. name>
- Check harvest status of the event harvest using menu "All Jobs"
- Click 'Harvest status'->'All Jobs' in the left menu
- Select "All" in "Only display job status" to the rigth from the menu
- Click the "Show" button, until the <eh. name> appears in a new job line (approx. after a minute)
- Check that two jobs appears and that they both have Harvest name <eh. name>
- Check the menu "Running jobs", that the jobs appears and that you can go to the Heritrix GUI. by clicking on the host link and by using the login/password: "admin"/"adminPassword" and close the window again.
...