Download - IPTC Photo Metadata Conference - Digital images and digital preservation

Transcript
Page 1: IPTC Photo Metadata Conference - Digital images and digital preservation

@dart_ulcc http://dablog.ulcc.ac.uk

Digital images and digital preservationIPTC Photo Metadata Conference, Zagreb, 2016

Ed Pinsent, ULCC

Page 2: IPTC Photo Metadata Conference - Digital images and digital preservation

@dart_ulcc http://dablog.ulcc.ac.uk

About me

• Digital Archivist at ULCC since 2004• Teaches digital preservation on the DPTP• Background as archivist / records manager• Experience in web-archiving, repository

management, metadata projects, migration, digitisation, project management, etc.

• See more at digital archives blog http://dart.blogs.ulcc.ac.uk/

Page 3: IPTC Photo Metadata Conference - Digital images and digital preservation

@dart_ulcc http://dablog.ulcc.ac.uk

Digital preservation

• Digital preservation means ensuring sustained, continued access to your content, over along time; not just backing-up

• For image collections, this might mean:1. Preservation of digital image files2. Preservation (and continued management) of

metadata

This presentation will define these targets of preservation, and propose some interventions

Page 4: IPTC Photo Metadata Conference - Digital images and digital preservation

@dart_ulcc http://dablog.ulcc.ac.uk

Drivers for doing digital preservation• You might want it, e.g.

– To respond to threats caused by software changes / format changes– To respond to an entire system failure– To have a definitive preservation copy, in case of exports going wrong

• If you’re managing an image collection, then you might want to:– Protect your work– Protect your investment– Recognise that digitisation / image creation takes time and money– Recognise that metadata curation takes time and money

• Other possible scenarios:– You have a user community whose needs you must meet– You have a commercial operation which would be strengthened by good

preservation of the assets

Page 5: IPTC Photo Metadata Conference - Digital images and digital preservation

@dart_ulcc http://dablog.ulcc.ac.uk

Production chain

Creators

Digital images

Archived digital images

Access copies

Authoritative library-approvedMetadata

Online DAM

Page 6: IPTC Photo Metadata Conference - Digital images and digital preservation

@dart_ulcc http://dablog.ulcc.ac.uk

Metadata creation - embedded

Digital images

Photographer

Image software

IPTC Metadata

EXIF Metadata

Technical Metadata

Crea

tors

Camera

Page 7: IPTC Photo Metadata Conference - Digital images and digital preservation

@dart_ulcc http://dablog.ulcc.ac.uk

Metadata creation – not embedded, descriptive

Authoritative library-approvedMetadata

Online DAM

Descriptive metadata

NamesTitlesDatesKeywordsPlaces

Page 8: IPTC Photo Metadata Conference - Digital images and digital preservation

@dart_ulcc http://dablog.ulcc.ac.uk

Intervention 1: preservation storage

Archived digital images

Fixity / checksum

Page 9: IPTC Photo Metadata Conference - Digital images and digital preservation

@dart_ulcc http://dablog.ulcc.ac.uk

Long-term storage

• Ideally, we’d want preservation-standard storage• This is not the same as “backing-up” a network, or storing

content on HD / USB Drives• Preservation-standard storage is:

– Dedicated archival storage– Little if any network traffic– Fixity, validation, monitoring and reporting– Multiple independent geo-redundant copies

• A place to keep our “master copies” of digital images, probably hi-res images stored in lossless formats, so they have a big footprint

Page 10: IPTC Photo Metadata Conference - Digital images and digital preservation

@dart_ulcc http://dablog.ulcc.ac.uk

Fixity

• Crucial part of archival storage / long-term preservation

• Means of detecting change or corruption• We must generate a checksum for each file• Checksum = “fingerprint” of digital object; also called

hash or fixity• If checksum changes, this is an indicator that object

has changed• Regular validation of checksums is strongly

recommended

Page 11: IPTC Photo Metadata Conference - Digital images and digital preservation

@dart_ulcc http://dablog.ulcc.ac.uk

Intervention 2: metadata management

Technical Metadata Metadata Extraction

Archived digital images

Preservation Database

Page 12: IPTC Photo Metadata Conference - Digital images and digital preservation

@dart_ulcc http://dablog.ulcc.ac.uk

Technical metadata

• Metadata about the digital image, very often embedded in the file header

• Helps us identify the format with high degree of certainty

• <identity format="Graphics Interchange Format" mimetype="image/gif”>

Page 13: IPTC Photo Metadata Conference - Digital images and digital preservation

@dart_ulcc http://dablog.ulcc.ac.uk

Technical metadata

• Helps us identify specific elements of the image encoding that are necessary for rendition

• <byteOrder>little endian</byteOrder>• <compressionScheme>LZW</compressionScheme>• <imageWidth>550</imageWidth>• <imageHeight>428</imageHeight>• <colorSpace>RGB Palette</colorSpace>• <orientation>normal*</orientation>• <bitsPerSample>8</bitsPerSample>

Page 14: IPTC Photo Metadata Conference - Digital images and digital preservation

@dart_ulcc http://dablog.ulcc.ac.uk

EXIF metadata

• EXIF metadata: very detailed record of camera (or scanner) information

• Embedded (automatically) in the file when the image is created

• May have some value / meaning as:– a history of the file’s creation– a history of hardware use– a history of the production chain of the

image

Page 15: IPTC Photo Metadata Conference - Digital images and digital preservation

@dart_ulcc http://dablog.ulcc.ac.uk

IPTC metadata

• IPTC metadata: rights metadata and descriptive metadata; used for expressing copyright, IPR and ownership of a digital image

• Tends to be authored / created by the photographer, agency, image owner

• In some image file formats, such as JPEG, TIFF, and PSD, the metadata is standard and supported

• Likely to have long-term value:– Protects the owner’s rights– Protects the image from unauthorised copying– Adds meaning and context to the image– Translates into a business / commercial value

Page 16: IPTC Photo Metadata Conference - Digital images and digital preservation

@dart_ulcc http://dablog.ulcc.ac.uk

Intervention 2: metadata management

Technical metadata+ EXIF metadata+ IPTC metadata+ UID

Metadata Extraction

Archived digital images

Preservation Database

Page 17: IPTC Photo Metadata Conference - Digital images and digital preservation

@dart_ulcc http://dablog.ulcc.ac.uk

Intervention 3: migration

Archived digital images

Access copies

Migration

Migration

Another file format

Another file format

Migration

Page 18: IPTC Photo Metadata Conference - Digital images and digital preservation

@dart_ulcc http://dablog.ulcc.ac.uk

Migration

• Common and well-understood approach to digital preservation

• E.g. migrate TIFF to JPEG 2000, if JP2 gives you more confidence as a long-term format for preservation

• E.g. migrate TIFF to PNG, to create access copies for dissemination or sale

• You would need to support migration tools, and target formats

• However, migration also introduces risks of loss, especially metadata

Page 19: IPTC Photo Metadata Conference - Digital images and digital preservation

@dart_ulcc http://dablog.ulcc.ac.uk

Risks of loss

Digital images

IPTC Metadata

EXIF Metadata

Technical Metadata

Migration process Digital images

Page 20: IPTC Photo Metadata Conference - Digital images and digital preservation

@dart_ulcc http://dablog.ulcc.ac.uk

Intervention 4: Preserve metadata from the DAMS

Online DAM

Descriptive metadata

NamesTitlesDatesKeywordsPlaces

Export action

Any metadata created here goes beyond the original IPTC description, and is separate.

Add this to the Preservation Database

Page 21: IPTC Photo Metadata Conference - Digital images and digital preservation

@dart_ulcc http://dablog.ulcc.ac.uk

How this all joins up…

Online DAM

Linking UID

Archival digital images All metadata about digital images

UID

PRESERVATION ENVIRONMENT

Page 22: IPTC Photo Metadata Conference - Digital images and digital preservation

@dart_ulcc http://dablog.ulcc.ac.uk

View of your preserved assets and metadata in preservation storage

19807

Page 23: IPTC Photo Metadata Conference - Digital images and digital preservation

@dart_ulcc http://dablog.ulcc.ac.uk

Next steps…

• How do we do all this?• What is the software / who are the service providers?• Is it expensive?• What are the steps towards doing preservation?• Who is usually responsible for preservation?• How can we as archivists/image library people get the process going?• What do we need to say to IT people to achieve best preservation?• Who else is doing it?• Is it commercially viable to preserve images in commercial image

libraries?• How do we adjust our workflow now to encompass the need for

preservation later?• How does a DAM system relate to preservation?

Page 24: IPTC Photo Metadata Conference - Digital images and digital preservation

@dart_ulcc http://dablog.ulcc.ac.uk

Free ULCC Resources

• AOR Toolkit: info.ulcc.ac.uk/aortoolkit-iptc• Free OAIS Course: info.ulcc.ac.uk/oais-iptc• Cheat Sheet: info.ulcc.ac.uk/metadata-iptc16