Storage | ||
---|---|---|
Active data | During the research project data will be collected and registrations of the method have to be documented. | |
In which way are collections and methods organised? (Hint: Organizing files and folders) | ||
How much storage space will the data consume? | ||
Which data formats get used (ARC, WARC, CDX, PDF, CSV, ...)? | ||
Version Control | Which files get version controlled (sensitive data are not allowed to be that)? | |
Which version control system is used? | ||
Which version of the version control system is used? | ||
Durable formats | Endorsed and published by standards agencies (http://www.digitalpreservation.gov/formats/fdd/descriptions.shtml, http://www.digitalpreservation.gov/formats/) | |
Publicly documented, i.e. complete authoritative specifications are available | ||
Widely used and accepted as best practice | ||
Data validation and authentication | Current data volume - total size in MB/GB/TB - and likely rate of growth | |
Number of files and folders, and how they are organised | ||
Platform - Mac/windows/Linux | ||
Applications used to access and work with your data | ||
Frequency of update, e.g. working data that changes daily, or data from project that needs to be retained but would not be used often | ||
Data type(s): spreadsheets, database, documents, images, datasets, etc. | ||
Any special security needs, e.g. personal data, commercial potential | ||
Access control: Who needs access to which areas? Do they have access to Netarkivet? If not, where are they from and who are they, e.g. journalist, lawyers, journals etc. | ||
Backup | Is there a backup strategy? | |
How many copies are there? | ||
Are they placed in another place than the main data storage? | ||
Are they placed secure? | ||
On which devices are they placed? | ||
Hvilke værktøjer anvendes (navn, version og systemkrav)? (gerne link til værktøjet) Hvilken platform og versionsnummer anvendes (windows, linux, mac)? | Per / Kulturarvscluster |
Backup
Researchers and research administrators must have a backup strategy to recover data after loss and/or to recover data from a particular time. It may not be possible to store the data in the State and University Library. In such cases a suitable place has to be found for backup of the data. Backups of more than one copy should be performed regularly, and should be housed remotely from the main data storage. The backups should be labelled and well organised to facilitate any data restoration process.
Backup security requires further mention. If the data is sensitive then it should not be stored on a computer that is connected to the internet, and preferably not connected to any network. If the data needs to be destroyed at the end of a project then consider what level is required – a hard drive will need to be overwritten several hundred times to ensure that no data can be recovered.
The lifetime of backups should also be considered. Burned optical discs have average lifetime of two years, and five years if kept in a cool dark place.
It is a good idea to check with the IT staff to find out how often they backup, what is the maximum amount of data they can backup, and how long they keep old backups.
You may need to maintain your own backups if:
- There are no services available to you
- You have valuable data that you do not trust with other people
- You have sensitive data that you cannot store on unsecure computers