Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Agenda for the joint BNF, ONB, SB and KB NetarchiveSuite tele-conference August the 14th 21th 2012, 13:00-14:00.

Practical

...

informationSkype-conference

  • Mikis will establish the skype-conference at 13:00 (Please do not connect yourself):
  • TDC tele-conference: (If it fails to establish a skype tele-conference):
    • Dial in number (+45) 70 26 50 45
    • Dial in code 9064479#
  • BridgeIT: BridgeIT conference will be available about 5 min. before start of meeting. The Bridgit url is konf01.statsbiblioteket.dk. The Bridgit password is sbview.

Participants

  • BNF: Nicholas, Sara
  • ONB: Michaela and Andreas
  • KB: Tue, Søren and Nicholas
  • SB: Colin and Mikis, Sabine
  • Any other issues to be discussed on today's tele-conference?

Heritrix 3 in NetarchiveSuite

Panel

 

JhoNAS status (Nicholas)

A status update from the begining of August was sent to the PWG and is accessible from this link: jhonas-project-status-aug.pdf

...

Panel
titleJHoNas NAS status
  • Jira Legacy
    serverSBForge
    keyNAS-1965
    : Done, needs unit testing.
  • Jira Legacy
    serverSBForge
    keyNAS-1960
    : Done, needs unit testing. Besides a WARCBatchJob also ArchiveBatchJob has been implemented for batch jobs running on both ARC and WARC.
  • Jira Legacy
    serverSBForge
    keyNAS-1958
    : Tested in local installation.
  • Jira Legacy
    serverSBForge
    keyNAS-1959
    : Done, needs unit testing.
  • Jira Legacy
    serverSBForge
    keyNAS-1962
    : Done, needs unit testing. Problems with WARC and content-length=0.
  • Jira Legacy
    serverSBForge
    keyNAS-1964
    :Done, needs unit testing. Problems with WARC and content-length=0.
  • Jira Legacy
    serverSBForge
    keyNAS-2091
    : N/A
  • Jira Legacy
    serverSBForge
    keyNAS-2090
    : N/A
  • Jira Legacy
    serverSBForge
    keyNAS-2061
    : Currently it is a mirror of the ARC file.
  • Jira Legacy
    serverSBForge
    keyNAS-2055
    : N/A
  • Jira Legacy
    serverSBForge
    keyNAS-2070
    : N/A
  • Jira Legacy
    serverSBForge
    keyNAS-1961
    : N/A
  • Jira Legacy
    serverSBForge
    keyNAS-1720

 

Moved sourcecode to GitHub?

I think we should consider moving the code to git hub because:

Iteration 52 (3.21 development release) (Mikis)

Panel

Jira Legacy
serverSBForge
keyNAS-2018

Status of the production sites

  • Netarkivet: TLR

    Second broad crawl 2012 (NR 15) was finished primo july.
    Third broad crwawl 2012 (NR 16) was started this morning August the 14th using 3.18.3.
    Version 3.20.* is currently tested and we are preparing for production medio october.
    Our Wayback is now indexed up to July 2012 and I'm preparing/testing automatic indexing in production.
    Thanks to Jon and his son we have downloaded thousands of youtube videos the last month.

    We have during the summer 2 productions issues without big impact on the system:

    1) SB SAN pillar was down one day without affecting any harvesting because the KB site was running and all harvesters on SB was inpendent servers with own disk storage.
    2) We lost 1 day of harvesting caused by no process resources on our admin server. We are still investigating the logs for futher explanations.

    2 questions for BNF:

    1) Can you show "Show comments" for harvested facebook.com sites?
    2) If you harvest youtube, how do you link the youtube "metadata" page with the actual video URL?

  • Netarkivet: SAS (for a month ago)

...

As to our selective crawls: “business as usual” – that is to say: analyze of “candidates” (new sites proposed for selective crawls), QA of selective crawls, monitoring harvest jobs, revision of harvest profiles

  • BNF:
Panel

 

  • ONB:
Panel

 

Date for NAS workshop at SB

Mid-october?

Date for next joint tele-conference.

Panel

September 11th?

Any other business?