Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Start an hourly selective harvest for the 'netarkivet.dk' domain.

Create a new template:

  • Download Under DefinitionsEdit Harvest Templates, download the template "default_orderxml", by choosing "Save to disk" in the pull-down menu, and clicking "Retrieve".
  • Edit the template so that max-size-bytes is 5000 in the WARCWriterProcessor. Do this as follows.
    • Find  <bean id="simpleOverrides"  and then under bean→propertyvalue, insert: 

      metadata.jobName=default_orderxml_smallwarcs

      metadata.description=Default Profile generating small warc-files (5000 bytes)

      warcWriter.maxFileSizeBytes = 5000
      disposition.maxPerHostBandwidthUsageKbSec=30
    • Under <bean id="metadata", modify the line to read:
      <property name="jobName" value="default_orderxml_smallwarcs" />
  • Upload the template with the filename changed to "default_orderxml_smallwarcs.xml"

...