Dublin Core Metadata
-
Upload
olga-ratliff -
Category
Documents
-
view
63 -
download
1
description
Transcript of Dublin Core Metadata
Besser--Dublin Core Metadata 2/14/02 1
Dublin Core Metadata
Howard Besser
UCLA School of Education & Information
http://www.gseis.ucla.edu/~howard
Besser--Dublin Core Metadata 2/14/02 2
Metadata for Digital Libraries- Models for Digital Libraries Importance of Metadata Standards Types and Uses of Metadata Discovery Metadata: The Dublin Core
Besser--Dublin Core Metadata 2/14/02 3
Key problems we’re facing
Discovery Longevity- Interoperability-
Besser--Dublin Core Metadata 2/14/02 4
Traditional Digital Library Model
DL
DL
DL
DL
useruser
search & presentation
search & presentation
search & presentation
search & presentation
Besser--Dublin Core Metadata 2/14/02 5
Ideal Digital Library Model
DL
DL
DL
DL
useruser
search & presentation
Besser--Dublin Core Metadata 2/14/02 6
For Interoperability Digital Libraries Need Standards
Descriptive Metadata for consistent description
Discovery Metadata for finding Administrative Metadata for viewing and
maintaining Structural Metadata for navigation ... Terms & Conditions Metadata for
controlling access...
Besser--Dublin Core Metadata 2/14/02 7
Why are Standards and Metadata consensus
important? Managing digital files over time Longevity Interoperability Veracity Recording in a consistent manner Will give vendors incentive to create
applications that support this
Besser--Dublin Core Metadata 2/14/02 8
Why Standards? Why do we need standards?
– To make information universally available to users– facilitate sharing and interchange of information– To preserve information (make it safe from
changes in hardware and software) Standards only work if communities widely
accept them, but they’re necessary for communities to work together
Besser--Dublin Core Metadata 2/14/02 9
Why are you Managing this Information?
Organizational mission & type Users Uses
Besser--Dublin Core Metadata 2/14/02 10
Questions to Ask
What communities is this standard designed for? What type of information is this standard designed to
handle? What functions is this standard designed to serve? What previous standards is it built upon? Does the standard prescribe how to create new records (or
parts of records), or how to map from existing records? How far does the standard go? Semantics: Does it define
element sets? Rules? Syntax?-
Besser--Dublin Core Metadata 2/14/02 11
What is Metadata
_ Structured data describing other data used to find or help manage information resources
_ Aids in interoperability_ Titles, dates, captions, cataloging and
indexing data, file headers, rights info, provenance, code books, transaction logs, ...
_ One person’s metadata is another’s data
Besser--Dublin Core Metadata 2/14/02 12
Sorting through the Standards Morass
_ Data Structures (DC, CDWA, MARC, VRA Core, TEI, EAD, MESL data dict)
_ Data Interchange (Z39.50)
_ Data Values/vocabularies (LCSH, AAT, ULAN, TGN)
_ Data Content/syntax (AACR2)
Besser--Dublin Core Metadata 2/14/02 13
Semantics/Syntax/Structure
_ Semantics– meaning, as defined by a community to meet their particular needs
(DC)
_ Syntax– a systematic arrangement of data elements for machine processing
– facilitates the exchange and use of metadata among various applications (HTML, XML, RDF)
_ Structure– a formal arrangement of the syntax with the goal of consistent
representation of the semantics (rules defining field contents like 1/11/99)
Besser--Dublin Core Metadata 2/14/02 14
What is MetadataTypes & Uses
lots of different ways of dividing the clusters
Besser--Dublin Core Metadata 2/14/02 15
Uses of Metadata
_ Discovery & Retrieval_ Identification/Provenance_ Rights Management_ Viewing_ Integrity_ Longevity_ Content rating
Besser--Dublin Core Metadata 2/14/02 16
Containers and Packages of Metadata
Warwick, not MARC
_ modular_ overlapping_ extensible_ community-based_ designed for a networked world to aid
commonality btwn communities while still providing full functionality within each community
Besser--Dublin Core Metadata 2/14/02 17
Some different schemes where Metdata is kept
_ embedded withing the object (HTML tags)_ in a separate related DB maintained by same
organization (OPAC, MOA II)_ in a separate DB maintained by a separate
organization (Books in Print, ratings systems)
_ derived on-the-fly from a different scheme (MARC-to-DC)
Besser--Dublin Core Metadata 2/14/02 18
Collaborative Metadata Projects
Dublin Core NSF/ERCIM Digital Collaboratory OCLC CORC Project- Visual Resources Association (VRA) Core Encoded Archival Description (EAD) Computerized Interchange of Museum Information
(CIMI)- Records Export for Art and Cultural Heritage
(REACH)
Besser--Dublin Core Metadata 2/14/02 19
Dublin Core (3/95)
_ improve resource discovery_ anticipate precision problems of Web Crawler-
based searching tools_ existing metadata could be “dumbed down”_ elements should be simple to understand and use,
so that any individual should be able to assign terms him/herself
_ software might eventually automatically generate very base-level metadata
Besser--Dublin Core Metadata 2/14/02 20
Dublin Core
Title Creator Subject Description Publisher Contributors Date Type
Format Identifier Source Language Relation Coverage Rights
Besser--Dublin Core Metadata 2/14/02 21
Dublin Core
every element is both optional and repeatable elements are cross-disciplinary elements are extensible by organized communities can employ a syntax such as html’s
<META> tagset for use by Spiders and HarvestersMay 2000 DLF Metadata Harvesting Project
Besser--Dublin Core Metadata 2/14/02 22
DC Qualifiers
_ allows one community to express important nuances and qualifications, while still making the basic importance available to communities with simple needs
_ our community can reflect alternate title, transliterated title, and main title, yet they will all be found under a simple Web search under “title”
Besser--Dublin Core Metadata 2/14/02 23
Discovery Metadata:Recent History
_ Dublin Core (3/95)_ Warwick Framework (4/96)_ Image Metadata Workshop (9/96)_ Canberra, Helsinki, ... DC (98)_ Digital Library Collaboratory (97-)_ DC-8, Frankfurt 10/99
Besser--Dublin Core Metadata 2/14/02 24
Dublin Core--further work
_ Warwick Framework– metadata packages for extensible functions
– layed groundwork for RDF
_ Canberra Qualifiers– refining the semantics of the element set to provide more precise info
– SUBELEMENT, SCHEME, LANG
_ Granularity– no hierarchical relationships w/i a given DC record; only one record
per discrete object (collection or item-level), and relationship field plus qualifier links them
The Research Process and Functional Categories of
Metadata_ Discovery_ Retrieval_ Collation_ Analysis_ Re-presentation
Besser--Dublin Core Metadata 2/14/02 26
Metadata Mapping-
Crosswalks Resource Description Framework (RDF)
Besser--Dublin Core Metadata 2/14/02 27
Crosswalks
mapping btwn differing metadata structures eliminate the need for monolithic,
universally adopted standards focus on flexibility and interoperatiblity RDF-based metadata registries
Besser--Dublin Core Metadata 2/14/02 28
Crosswalk ExampleCDWA Object ID
CIMISchema
FDAVRA CoreCategories
USMARCDUBLINCORE
OBJECT/WORK (core)
DocumentClassification-CatalogLevel (core)DocumentClassification-Group Type
Object/Work-Type (core)
Type ofObject
objectNAME DocumentClassification- DocumentType (core)Purpose-Purpose(Broad) (core)Purpose-Purpose(Narrow)
W1. WorkType
655 Genre-Form
Type
Object/Work-Components
quantity DocumentClassification-Extent
300a PhysicalDescription-Extent
ORIENTATION/ARRANGEMENT
Description
TITLES ORNAMES(core)
Title objectTitlebibliographicTitle
Group/ItemIdentification-RepositoryTitleGroup/ItemIdentification-DescriptiveTitle (core)Group/ItemIdentification-InscribedTitle
W2. Title 24Xa Titleand Title-RelatedInformation
Title
Besser--Dublin Core Metadata 2/14/02 29
Resource Description Framework (RDF, spec released 2/99)
_ W3C Metadata activity_ designed to move the Web beyond simple links to
semantically-rich relationships btwn resources_ metadata application using XML as a common syntax for
exchange and processing_ flexible architecture for managing diverse application-
specific metadata packets that can be processed by machines_ associates resources, property types, and corresponding
values_ http://www.w3.org/RDF/
Besser--Dublin Core Metadata 2/14/02 30
RDF
_ Resources (character strings, names, digital objects)
_ Property (“is the author of”)_ Value
_ resources+properties=relationships_ many different relationships can be reflected
Besser--Dublin Core Metadata 2/14/02 31
XML-encoded RDF
_ <?xml:namespace ns=http://www.w3.org/RDF/RDF prefix="RDF" ?>
_ <?xml:namespace ns=http://purl.oclc.org/DC/ prefix="DC" ?>
_ <RDF:RDF>_ <DC:Creator>Howard Besser</DC:Creator>_ </RDF:Description>_ </RDF:RDF>
Besser--Dublin Core Metadata 2/14/02 32
Should you start building with RDF today?
_ Tools are primitive_ Standard still likely to evolve
Besser--Dublin Core Metadata 2/14/02 33
Metadata for Digital LibrariesHoward Besser
UCLA School of Education & Information
Baca, Murtha (ed). Introduction to Metadata, Los Angeles: Getty Information Institute, 1998
http://www.getty.edu/gri/standard/intrometadata/
http://sunsite.Berkeley.EDU/Imaging/Databases/#standards
http://sunsite.Berkeley.EDU/moa2/
http://sunsite.Berkeley.EDU/Longevity/
http://www.ifla.org/II/metadata.htm
http://purl.oclc.org/metadata/dublin_core/
http://purl.oclc.org/corc/
http://lcweb.loc.gov/ead/
http://www.gseis.ucla.edu/~howard/image-meta.html
http://www.gseis.ucla.edu/~howard/Metadata/UC-May00/
http://sunsite.berkeley.edu/Metadata/sp2000.html