2021-03-09 Statusmeeting

Agenda for the joint NetarchiveSuite tele-conference 2021-03-09, 13:00-14:00.

Participants

  • BNF: Sara, Auriane, Clara
  • ONB: Andreas
  • KB/DK - Copenhagen: Tue, Stephen, Anders
  • KB/DK - Aarhus: Kristian, Colin
  • BNE: José, Alicia
  • KB/Sweden: Pär, Peter

Update on NAS latest tests and developments


Status of the production sites

Netarkivet

  • SolrWayback in production 
    • SolrWayback IIPC webinars last week and tomorrow at 08.30-09.30 CET:
      • RSS webinar: https://netpreserve.org/events/iipc-rss-webinar-solrwayback4/?occurrence=2021-03-10
      • Thomas Egense, Toke Eskildsen, Anders Klindt Myrvoll, Jesper Lauridsen, and Jørn Thøgersen of the Royal Danish Library, will be demonstrating the new SolrWayback. Youssef Eldakar and Mohamed Elsayed of the Bibliotheca Alexandrina, will talk about publishing the IIPC Covid-19 Collection using SolrWayback. Their presentations will be followed by a Q&A session chaired by Ben O’Brien, National Library of New Zealand and Peter Stirling, BnF.
  • Broad crawl step 2 just started- Fixed crawl time pr. job – 15 days!
  • Bitmagasin-development/testing is progressing
  • Smurf N-gram – Netarkivet - for the public https://labs.statsbiblioteket.dk/netarchive/ngram/#/
  • New website - https://www.kb.dk/en/find-materials/collections/netarkivet
  • Stephen on paternity leave until may
  • Kristian Bak leaving to be a robotics programmer

BnF

This month, one year after the beginning of the Covid-19 epidemic, the BnF organizes an event, around the Web Archives, in order to increase the visibility of the content harvested on this subject. A virtual guided tour, made up of about fifteen themes, will be published on March 17, on our Internet Archives. This date represents the anniversary of the first lockdown in France. A slide show will come with this tour and be published on the BnF website.

This week, we are also launching our first in-house harvesting workshop of the year 2021. The purpose is to carry out improvements to the harvest of websites with flash animations. We aim to study a way to automate the harvest of these websites, as much as possible.

Lastly, within the framework of the development of an offer of services for researchers, the BnF Data Lab, we are going to launch, this week, a selective crawl as part of the BodyCapital research program. Its object of study is the audiovisual representations of the body in the twentieth century, up to the birth of YouTube in 2005. So the common project that has been defined is to archive, preserve and document these audiovisual representations of the body on the internet.

ONB


BNE

  • This week we are working in a special harvesting about women’s day and we have created a form which people can collaborate with web sites about this subject by Twitter
  • We start testing SolrWayback like access tool
  • We are studying upgrade NAS from 5.4.2 to a new version
  • We are working for adapting the option “Bulk create records” of CWeb adding two own fields
  • We have planned to launch our annual broad crawl in April

KB-Sweden


Next meetings

  • April 6th
  • May 4th
  • June 8th
  • July 6th
  • September 7th
  • October 5th
  • November 2nd
  • December 14th
  • January 11th, 2022

Any other business?

·