Subprojects and tools
(review 2021 ANKM) A component based re-design (review 2021 ANKM)
A component based re-design
A Service Oriented Architecture for the NetarchiveSuite Frontend
Bitmagasinet som arkiv for Netarkivet
Browsertrix-cloud Installation At KB
Heritrix 3
Contain the thought on the inevitable migration to Heritrix 3
JhoNAS project - WARC support in JHOVE2 and NetarchiveSuite
IIPC sponsored project to leverage support for the WARC archiving format.
NAS Preload Tool
NAS-integration w/ Umbra
20240504 / TLR liste hentet fra Lazlo
* [warcdb](https://github.com/florents-Tselai/warcdb) - A command line utility (Python) for importing WARC files into a SQLite database. *(Stable)*
* [warcdedupe](https://gitlab.com/taricorp/warcdedupe) - WARC deduplication tool (and WARC library) written in Rust. (In Development)
* [warc-safe](https://github.com/natliblux/warc-safe) - Automatic detection of viruses and NSFW content in WARC files.
* [WarcPartitioner](https://github.com/helgeho/WarcPartitioner) - Partition (W)ARC Files by MIME Type and Year. *(Stable)*
* [warcrefs](https://github.com/arcalex/warcrefs) - Web archive deduplication tools. *Stable*
* [webarchive-indexing](https://github.com/ikreymer/webarchive-indexing) - Tools for bulk indexing of WARC/ARC files on Hadoop, EMR or local file system.