Standalone H3 standard behavior is to pause the job.
The Harvest Controller ignores the pauseAtStart and unpauses every job.
Checklist
Activity
Show:
Sara AubryFebruary 23, 2017 at 1:47 PM
To test this fix: 1) Edit a harvest template. 2) On the crawlController bean, set the pauseAtStart property to true. 3) Run a job using this template. 4) On the Running Jobs page, check the job is in Paused status and hasn't harvested anything.
The crawlController bean is between the disposition chain and the frontier :
<!-- DISPOSITION CHAIN (END) -->
<!-- CRAWLCONTROLLER (START)
Control interface, unifying context
-->
<bean id="crawlController" class="org.archive.crawler.framework.CrawlController">
<property name="maxToeThreads" value="250" />
<property name="recorderOutBufferBytes" value="4096" />
<property name="recorderInBufferBytes" value="65536" />
<property name="pauseAtStart" value="false" />
<property name="runWhileEmpty" value="false" />
<property name="scratchDir" value="scratch" />
</bean>
<!-- CRAWLCONTROLLER (START) -->
<!-- FRONTIER (START)
Standalone H3 standard behavior is to pause the job.
The Harvest Controller ignores the pauseAtStart and unpauses every job.