4th broadcrawl step 2 - 2022 started a few weeks ago. More than 100 harvesters used concurrently (120 harvester capacity, 77 broadcrawlers)
Also working on other
big crawls.
part of the broadcrawl with selective harvesters.
Bytelimit downgraded 61K shops to 10K maxobjects and 499MB maxbyte
Event harvest
General election still running but will end soon
World Championship Soccer in Quatar- needs more seeds and then to be ended
IIPC WAC 2023
4 proposals approved:
SolrWayback: Best practice, community usage and engagement
Run your own full stack SolrWayback
Browser-Based Crawling For All: Getting Started with Browsertrix Cloud
Browser-Based Crawling For All: The Story So Far
JWAT for validation of Warc-files updated - there might be some more work on documentation.
Browserbased crawling for all IIPC-project proceeding. UX update will come soon with enhancements of exclusions and also using more explanations for each step/input.