- Our 2022 broad crawl ended on November 22nd. The harvest lasted around six weeks, that is to say one more week than last year for a budget of 2700 URLs per domain (instead of 2100 URLs in 2021). 3 billion URLs were crawled for a total of 151 TB.
- Next week, we are going to launch the "Social movements" and "Solidarity" harvests. 1037 and 473 websites are selected respectively. The harvests will last two weeks for a provisional budget of 1 TB for each.
- Our internal harvesting workshop dedicated to podcasts began in November and will end on December 16th. We studied several podcast platforms like SoundCloud, Ausha and podCloud.
- On November 25th, a webinar took place around the harvests and scientific practices of the electoral web, to bring the 20 years of elections harvests into relief. It was organized within the framework of the ResPaDon project, which aims to set up a network about web archives. The contributions made it possible to obtain feedback from the librarians who take part in the selection process and the researchers working on the subject.
|