...
- Full-text indexing and presentation
- Tool-support for mass-processing
- Corpus extraction
- Derived formats
- analysis + visualisation
- index-server API
- harvesting API
- Discovery API + Services
- WARC standard + usage
- Deduplication/revisits
- Standards and tools for metadata + provenance
- Integration of web- and nonweb- collections