- Systems
- Overall, our systems work
- Firewall / network upgrade and consequent crashes in the Network Archive have filled a lot. ITD is working at high pressure to get a stable infrastructure in KBH, but we will probably have to enter 2021 before everything has been renewed and updated.
- Heritrix IIPC standard is in production
- SolrWayback is soon on its way into production and in a new updated version.
- Who uses our systems
- Browsing in the Online Archive:
- Statistics 6 months back: At least 1 external (but issues with seeing how many) and correspondingly 13 internal (for QA, development and much more).
- 40+ external user have access to our systems
- Delivery of data from Netarkivet takes place on an ongoing basis and I only expect it to be more comprehensive in the future.
- Collection
- Netarkivet has made a great effort in relation to Corona event harvesting
- Heritrix standard version may mean more efficient harvests from the 2nd cross-sectional harvest 2020, which is underway.
- Much of the interesting content at the moment. requires manual flows: Facebook (especially when it comes to comments)
- Still an issue to get certain types of dynamically generated content. Until we have other solutions e.g. browser-based harvesting uses Archive-IT and various work arounds (eg use of XML sitemaps that give us the URLs Heritrix does not immediately see).
- We are looking at how / if we can get Warc files from webrecorder / conifer.org in Netarkivet. It looks promising.
- Preservation
- We are well underway with the major projects in relation to DKM-077 - one online copy (Closed and part of DKM-085) and DKM-085 - Bitmagasine. Schedules have been made and special work is being done to refine the cutover process.
- Access
- Solrwayback on the way in production. Internal and external rejoice. It looks promising.
- Organization
- It goes well. Our more agile approach with daily stand-up meetings, review, retrospective and planning with a small group of Netarkiv people, provides added value.
- Cooperation
- More and more interest from external. Several from the Netarkivet have participated in a seminar on research in the Netarkivet for researchers at KU (KUB)
- I think there will be a higher demand for Netarkivet's content in the future.
- 40+ external users the last years
- The future
- NAS development and/vs. external solutions (decision proposals must be made)
- Webdanica analysis
- BcWeb
- SolrWayback further development / project setting.
- Workshop with ITU / DKU in relation to our access solutions - PyWb for playback of the Netarkivet in tandem with SolrWayback - IIPC will in future support 1 solution: PyWb (instead of Open Wayback as before)
- More dialogue with researchers about their wishes in relation to e.g. access solutions
- A long way to go to get enough resources / competencies in relation to the Network Archive's tasks (ITU + DKU). Part of 5-year plan.
|