...
Digital object category | Properties to be retained in preservation format | Original format | Original format is preservation format? | Preservation copy | Intermediate access copy | Archivematica normalisation for preservation | Archivematica normalisation for access |
---|---|---|---|---|---|---|---|
Raster image RGB (24/48 bits), grayscale (8/16 bits) and bitonal (1 bit) | Gray- or colour values, bit depth, resolution, colour space, if used: ICC profile | TIFF Baseline | Yes | N/A | JPEG | No | Yes |
TIFF/EP | Yes | N/A | JPEG | No | Yes | ||
TIFF/IIT | Yes | N/A | JPEG | No | Yes | ||
TIFF/FX | Yes | N/A | JPEG | No | Yes | ||
JPEG | Yes | N/A | JPEG | No | No | ||
PNG | Yes | N/A | PNG | No | No | ||
GIF | Yes | N/A | GIF | No | No | ||
JP2 (JPEG 2000 part 1) | No | TIFF | JPEG | Yes | NoYes | ||
Other | N/A | Baseline TIFF | JPEG | Yes | Yes | ||
Raw camera files | Gray- or colour values, bit depth, resolution, colour space, if used: ICC profile. | 3FR, ARW, CR2, CRW, DCR, ERF, KDC, MRW, NEF, ORF, PEF, RAF, RAW, X3F etc. | No | Baseline TIFF 6.0 uncompressed | TIFF | Yes | Yes |
DNG | No | Baseline TIFF 6.0 uncompressed | TIFF | Yes | Yes | ||
2D vector images (2D) | Hard to say as vector files can have many origins [2]. Some properties are: points, lines and areas. | SVG | Yes | SVG | N/A | Yes | No |
Other (AI, EPS etc) | N/A | SVG | SVG | Yes | Yes | ||
| DOC | No | N/A | N/A (no normalisation tool available) | No | N/A | |
DOCX | Yes | N/A | N/A | No | No | ||
ODT | Yes | N/A | N/A | No | No | ||
RTF | Yes | N/A | N/A | No | No | ||
WPD | No | N/A (no normalisation tool available) | N/A (no normalisation tool available) | N/A | N/A | ||
Yes | PDF (as is) | PDF (as is) | No | No | |||
Other | N/A | N/A (no normalisation tool available) | N/A (no normalisation tool available) | N/A | N/A | ||
PDF files | Yes | N/A | N/A | No | No | ||
PDF/A | Yes | N/A | N/A | No | No | ||
PDF/X | Yes | N/A | N/A | No | No | ||
PDF/E | Yes | N/A | N/A | No | No | ||
Text mark-up files | Tags, text | HTML | Yes | HTML | HTML | No | No |
XML | Yes | XML | XML | No | No | ||
Other | N/A | Original format | Original format | No | No | ||
Plain text | Content: text | TXT | Yes | TXT | TXT | No | No |
Other | N/A | Original format | Original format | No | No | ||
E-books |
| EPUB | Yes | EPUB | EPUB | No | No |
MOBI | No? | N/A | N/A | No | No | ||
Other | N/A | N/A (no normalisation tool available) | N/A (no normalisation tool available) | N/A | N/A | ||
Workflow chosen Nov 2017: Mailboxes: PST files are pre-Archivematica converted to MBOX. Individual mails: .msg files are stored as such, other formats (prereably converted to eml) |
| PST → should we ingest this as original format? | Yes? |
|
| N/A (no tool included) | N/A (no tool included) |
MBOX | Yes |
| MBOX | N/A (no tool included) | N/A (no tool included) | ||
MSG | Yes |
| MSG. Attached files are delivered as separate files. | N/A | N/A | ||
EML | Yes |
| EML. Attached files are delivered as separate files. | No | No | ||
Other → conversion of mailboxes are made before Archivematica | N/A |
| MBOX, EML. Attached files are delivered as separate files. | N/A (no tool included) | N/A (no tool included) | ||
Spreadsheets |
| XLS | No | N/A | N/A | No | No |
XLSX | No | N/A | N/A | No | No | ||
ODS | Yes | N/A | N/A | No | No | ||
Other | N/A | N/A (no normalisation tool available) | N/A (no normalisation tool available) | N/A | N/A | ||
Presentation files |
| PPT | No | N/A | N/A | No | No |
PPTX | Yes | N/A | N/A | No | No | ||
ODP | Yes | N/A | N/A | No | No | ||
Other | N/A | N/A | N/A | N/A | N/A | ||
Audio files |
| WAV | Yes | N/A | MP3 | No | Yes |
AIFF | Yes | N/A | MP3 | No | Yes | ||
MP3 | Yes | N/A | MP3 | No | No | ||
FLAC | Yes | N/A | MP3 | No | Yes | ||
M4A, AAC | Yes | N/A | MP3 | No | Yes | ||
Other | N/A | N/A | MP3 | Yes | Yes | ||
Video files | Audio:
Video:
| MKV-container file | Yes | MKV-container file with lossless FFV1-encoding for the video signal and LPCM -encoding for the audio signal. | MP4-container with a H.264-videostream and a AAC-audiostream | N/A | Yes |
Generic MXF container file | No | MKV-container file with lossless FFV1-encoding for the video signal and LPCM -encoding for the audio signal. (Was: MXF-container file with lossless JPEG2000-encoding for the video signal and LPCM-encoding for the audio signal.) | MP4-container with a H.264-videostream and a AAC-audiostream | N/A | Yes | ||
AVI | No | MKV-container file with lossless FFV1-encoding for the video signal and LPCM -encoding for the audio signal | MP4-container with a H.264-videostream and a AAC-audiostream | Yes | Yes | ||
MOV | No | MKV-container file with lossless FFV1-encoding for the video signal and LPCM -encoding for the audio signal | MP4-container with a H.264-videostream and a AAC-audiostream | Yes | Yes | ||
MPEG-2 | No | MKV-container file with lossless FFV1-encoding for the video signal and LPCM -encoding for the audio signal | MP4-container with a H.264-videostream and a AAC-audiostream | No | Yes | ||
MPEG-4 | No | MKV-container file with lossless FFV1-encoding for the video signal and LPCM -encoding for the audio signal | MP4-container with a H.264-videostream and a AAC-audiostream | No | Yes | ||
Other | N/A | MKV-container file with lossless FFV1-encoding for the video signal and LPCM -encoding for the audio signal | MP4-container with a H.264-videostream and a AAC-audiostream | Yes | Yes | ||
WARC The WARC file is the end product of a website harvesting proces (mostly by the use of Heretrix tool). | Yes | N/A | N/A | N/A | N/A | ||
Packed files (ZIP, RAR) | ZIP | No | ZIP file is unpacked, unpacked files are (pre-)ingested and original ZIP file deleted. | N/A | N/A | N/A | |
RAR | No | RAR file is unpacked, unpacked files are (pre-)ingested and original RAR file deleted. | N/A | N/A | N/A | ||
Other | No | Original packaged file is (if possible) unpacked, unpacked files are (pre-)ingested and original package file deleted. | N/A | N/A | N/A | ||
Databases (more research needed) | SIARD | Yes | N/A | N/A | No | No | |
CSV | Yes | N/A | N/A | No | No | ||
Microsoft Access database MDB (different versions - before 2000 problematic???) | No | N/A | N/A | No | No | ||
Microsoft Access database ACCDB | No | N/A | N/A | No | No | ||
Other | N/A | No normalisation | No normalisation | N/A | N/A | ||
Geographical information (GIS) - more research needed | GeoTIFF | Yes | N/A | N/A | No | No | |
ESRI Shapefiles (.shp en bijbehorende bestanden), GML??? | Geojson, TopoJSON | ? | ? | ||||
Unknown file format | Unknown file formats are stored as such. | As these file formats are unknown no access format can be made. | N/A | N/A |
...