2019 NAS workshop
The 2019 NAS workshop will take place on February 20-22 and will be hosted by National Library of Spain in Madrid.
Location: National Library of Spain (entry at the ground floor)
Address: Paseo de Recoletos, 20-22 - 28071-Madrid
Tourist information: recommended things to see in Madrid
Hotels: see below
Participants:
Organization | Technical | Curator |
---|---|---|
Netarkivet | Colin Samuel Rosenthal Knud Aage Hansen Tue Hejlskov Larsen Kristian Bak | Anders Klindt Myrvoll Sabine Schostag Stephen Hunt |
ONB (via Skype) | Andreas P | Michaela |
BnF | Sara Aubry Clara Wiatrowski | Géraldine Camile |
BNE | Juan Carlos García José María Martín Fernando Monzón Luis Sánchez
| Alicia Pastrana María Bueno María Ezquerra Yasmín Rommaneh Mar Pérez |
NL of Sweden | Thomas Roos | Pär Nilsson Peter Svanberg |
Topics to be discussed:
Technical | Curatorial |
---|---|
|
|
Agenda
Schedule for 20.02.2019 (12:30-17:30)
12:30 - 14:00 Arrival, sandwiches and coffee
14:00 - 14:15 Welcome (Ana Santos Aramburo, director of BNE)
14:15 - 14:30 Workshop introduction (Mar, Sara)
14:30 - 16:00 Institution updates and plans for 2019 (15 min each)
- Update from ONB (Michaela)
- Update from BNE (Mar, Juan Carlos)
- Update from KB Sweden (Pär)
- Update from BnF (Géraldine)
- Update from KB Danemark (Sabine, Tue, Anders)
16:00 - 16:15 Coffee break
16:15 - 17:30 NetarchiveSuite 5.5: demo and discussion of latest features including Umbra (Colin), Umbra usage and experiences, Feedback on tests with input from Clara (ppt), Tue (ppt)
20:00 Dinner (at own expense). Inclan Brutal Bar (Calle Álvarez Gato, 4)
Schedule for 21.02.2019 (9:00-17:00)
09:00 - 12:00
Technical track:
- Share NAS deployment and configuration in our institutions to identify used/unused components: See form.
- Discuss state of the art of current bugs and possible fixes
- Current bugs identified during ONB Daily Crawls & Domain Crawl 2018 (Andreas)
- Current bugs identified during BnF Domain Crawl 2018 (Clara)
- Review lists of NAS bugs and missing features and internal lists : NAS curator roadmap (NASC), BnF 2019 list
- JIRA issues labelled "Madrid":
- Discuss possible integration of OpenJava, latest H3 stable release and WARC 1.1
- Brainstorm on priorities and NAS codebase evolution for future developments
- Discuss the possibility to submit an IIPC project
Curator track:
- Review and update the NAS curator roadmap (NASC)
- Brainstorm on priorities for future developments from a curatorial perspective
- Discuss practices and challenges in coordinating external selections (Géraldine, Sabine, Mar)
10:30 - 10:45 Coffee break
12:00 - 12:30 Sum up of curators and technical priorities
12:30 - 14:00 Lunch
14:00 - 17:00 Complex harvesting
Share experiences, practices and questions in the management of broad crawls:
- How do we make job monitoring during broad or big “deep” crawls?
- How do we manage huge webhotels (companies that host many websites)?
- How do we track web parkings?
How do we manage byte/objects limits for different groups of domains?
Share experiences and practices in crawling and giving access to YouTube videos (Sara)
Share experiences in crawling social media (Facebook, Twitter, SlideShare, Flickr, Instagram)
Discuss possible further cooperation on these topics, common tools integration
15:30 - 15:45 Coffee break
17:00 - 19:00 Guided tour of the BNE
Schedule for 22.02.2019 (9:00-14:30)
09:00 - 10:30 Update on BCweb (Géraldine, Clara) - CSV sample
- Demo of BCweb new functionalities
- Update on BnF current and upcoming developments
- Update on open source status
- Discuss interest in upgrading and possible community developments
10:30 - 10:45 Coffee break
10:45 - 12:30 Access tools to webarchives
- Demo of the SolrWayback search interface and playback engine for WARCs (Anders)
- Discuss perspectives, projects and questions in the different institutions (input Sara, Mar, ?)
- Browser, OpenWayback and CDX creation issues and development, experiences with other tools e.g. pywb, SOLRWayback
12:30 - 13:00 Community next steps
13:00 - 14:30 Lunch and goodbye
Hotels nearby: We suggest to look for some offers on www.booking.com for these hotels below, as the Library can't provide special offers.
- Hotel Gran Versalles ****
Hotel Mediodía **
http://www.mediodiahotel.com/
- Hotel NH Collection Madrid Colón****
https://www.nh-hoteles.es/hotel/nh-collection-madrid-colon
How to get: https://www.google.com/maps/dir/Hotel+NH+Collection+Madrid+Col%C3%B3n,+Calle+Marqu%C3%A9s+de+Zurgena,+Madrid/Biblioteca+Nacional+de+Espa%C3%B1a,+Paseo+de+Recoletos,+20-22,+28001+Madrid,+Espa%C3%B1a/@40.4247258,-3.6917452,17.25z/data=!4m14!4m13!1m5!1m1!1s0xd422890e6513b27:0x1f5a70ccb1a0c263!2m2!1d-3.688885!2d40.426117!1m5!1m1!1s0xd4228907e039627:0xd5b5764d8a0a53f7!2m2!1d-3.6894324!2d40.4235049!3e3 - Hotel NH Madrid Alonso Martínez***
https://www.nh-hoteles.es/hotel/nh-madrid-alonso-martinez
How to get: https://www.google.com/maps/dir/Hotel+NH+Madrid+Alonso+Mart%C3%ADnez,+Calle+de+Santa+Engracia,+Madrid/Biblioteca+Nacional+de+Espa%C3%B1a,+Paseo+de+Recoletos,+20-22,+28001+Madrid,+Espa%C3%B1a/@40.4261357,-3.6950199,17z/data=!3m1!4b1!4m14!4m13!1m5!1m1!1s0xd42288c26ac374b:0xafe322614fff2c1!2m2!1d-3.6962236!2d40.4287247!1m5!1m1!1s0xd4228907e039627:0xd5b5764d8a0a53f7!2m2!1d-3.6894324!2d40.4235049!3e2 Hotel Mora
http://www.hotelmora.com/Hotel One Shot Recoletos 04****
http://www.hoteloneshotrecoletos04.com/Hotel Ibis Styles Madrid Prado***
http://www.ibis.com/es/hotel-8052-ibis-styles-madrid-prado/index.shtmlHotel Leonardo City Center
https://www.leonardo-hotels.es/leonardo-hotel-madrid-city-centerHostal Pizarro
http://www.hostalpizarro.eu/es/index.htmlHostal Residencia Don Diego***
http://www.hostaldondiego.com/Hostal Gallardo***
www.hostalgallardo.comHostal Retiro
http://www.hostalretiro.comHostal Salamanca
http://hostalsalamanca.com/