...
Status of the production sites
Netarkivet
Panel | ||||||||||
---|---|---|---|---|---|---|---|---|---|---|
We have finished our “little” broad crawl:
We have nearly finished the reorganization of our selective crawls according to the new strategy:
We renewed our account at Archive-IT, it is supposed to be used for Facebook crawls NAS 5.2 is released for developers test. Test for curators is planned for the end of October. We are upgrading the citrix installation, which gives access to wayback. We have testet Ilya Kraemers W/ARC player for displaying https pages: it works fine, but there are some security issues to be fixed. |
BnF
Panel |
---|
Start of our 2017 broad crawl on October 10th (4,4 million domains, 3500 URLs per domain, 40 crawlers working with 10 threads each during the day and 30 threads during the night because of bandwidth constraints). We expect to harvest 80 TB of data. We redesigned our Wayback and will give access to our full text indexed 1996-2000 collection with Shine in November. |
...