Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

  • Full-text indexing and presentation
  • Tool-support for mass-processing
    • Corpus extraction
    • Derived formats
    • analysis + visualisation
  • index-server API
  • harvesting API
  • Discovery API + Services
  • WARC standard + usage
    • Deduplication/revisits
    • Standards and tools for metadata + provenance
  • Integration of web- and nonweb- collections
  • more automation of QA (crawl.log analyse)