Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: Migrated to Confluence 5.3

...

The fileID must conform to the regular expression: [a-zA-z0Z0-9\-_.] {5,250}

Example: The file

...

Code Block
B400022028241-RT1_400022028241-14/_1795-06-13-01_adresseavisen1759-1795-06-13-01-0006.jp2

...

  • File and directory name format: Most of the elements making up the directory and file names are either predefined name (UNMATCHED, ISO, TargerTarget) or number formats (0002, 1, 1890-10-18-01, 4000220289521). The only slightly complexed case is the newspaperID element, where the (current unwritten) format is [a-zA-z0Z0-9] (this is apparently not written down anywhere).
  • Bit repository The bitrepository only accepts FileIDs of the format [a-zA-z0Z0-9\-_.] {5,250}.

Note that the constraint on newspaperIDs must be implemented in the file structure check and shared with the MFPak people. The ingester itself also needs to protect itself from invalid newspaperIDs, as the file structure checker will only be run on the received files after ingest (should have been run at ninestars though).