Excerpt |
---|
The Bitrepository platform is used for longterm preservation of the newspaper data |
The bitrepository for newspapers will consist of:
...
In-house wiki description of the Bitrepository: https://sbprojects.statsbiblioteket.dk/display/DIGSAM/4.5+Bitbevaring+Avis
Ingester
The bitrepository ingester takes care of the archiving of the jp2 files into the bitrepository archive. This is done by traversed the batch structure and for each jp2 files perform the following steps:
- Generated a unique FileID identifying the file in the repository. The key is constructed by using the file name including the path in the batch structure with the batch number as the root element. The path separator is the '_' (underscore) char. The maximum length allowed is 250.
- Ingest the file into the bit repository verifying the ingest using the checksum for the file.
- Register the archived file in the DOMS system.
Data processing
As part of the general architecture files are processed when they have been ingested in the repositories. I.e. Bitrepository for datafiles (jpeg2000) and DOMS for metadata.
...