Adlug annual meeting 2013
Transcript of Adlug annual meeting 2013
Copyright 2009-2010 @CULT. All rights reserved
Tiziana Possemato
32st ADLUG ANNUAL MEETING 2013
ARTIUM and the Fundación Sancho El Sabio – Vitoria-Gasteiz
16th – 18th October 2013
RDA, Bibframe, Linked data: follow the evolution of cataloging inside the LMS
Copyright 2009-2010 @CULT. All rights reserved 2
The global environment
Functional Requirements for Authority Data
Functional Requirements for Bibliographic Records
Resource Description and Access
International Cataloguing Principles Semantic web/Linked data
Bibframe
Where we are moving…
Copyright 2009-2010 @CULT. All rights reserved 3
FRBR’s Entity-Relationship Model
• Entities
• Relationships
• Attributes
One Entity Another Entity
• National level required elements
relationship
Copyright 2009-2010 @CULT. All rights reserved 4
FRBR’s Entity-Relationship Model
Shakespeare Hamlet
created
was created by
Pers
on
Work
Copyright 2009-2010 @CULT. All rights reserved
is realized through
is embodied in
is exemplified by
recursive
one
many
Work
Expression
Manifestation
Item
Group 1
FRBR: inherent Group 1 Relationships
Copyright 2009-2010 @CULT. All rights reserved
Group 2
is owned by
is produced by
is realized by
is created by
Work
Expression
Manifestation
Item
Person
Corporate Body
Family
FRBR: Relationships between Groups 1 & 2
Copyright 2009-2010 @CULT. All rights reserved 7
ICP: Statement of International Cataloguing Principles
ICP general principles
• Convenience of user
• Representation
• Common usage
• Accuracy
• Sufficiency and necessity
• Significance
• Economy
• Consistency and Standardization
• Integration
• Defensible, not arbitrary
• If contradict, take a defensible, practical solution
Copyright 2009-2010 @CULT. All rights reserved 8
RDA: Resource Description and Access
Based on IFLA’s international models and principles
• Functional Requirements for Bibliographic Records (FRBR; 1998)
• Functional Requirements for Authority Data (FRAD; 2009)
• Statement of International Cataloguing Principles (ICP; 2009)
Copyright 2009-2010 @CULT. All rights reserved 9
RDA: goals*
RDA is:
• A new standard for resource description and access
• Designed for the digital world:• Optimized for use as online product
• Description and access of all resources (all types of content and media)
• Resulting records usable in the digital environment (Internet, WebOPACs, etc.)
• Rules should be easy to use and interpret
• Be applicable to an online, networked environment
*Barbara Tillett, Seminar on RDA
Copyright 2009-2010 @CULT. All rights reserved 10
RDA: goals*
• Provide effective bibliographic control for all types of media
• Encourage use beyond the library community
• Be compatible with other similar standards
• Have a logical structure based on internationally agreedprinciples
• Separate content and carrier data
• Examples – more of them, more appropriate
Basic goals: identify and relate (from FRBR/FRAD user tasksand ICP)
Copyright 2009-2010 @CULT. All rights reserved 11
RDA: Summary of differences*
• Based on principles• ICP
• Based on conceptual models• FRBR/FRAD
• Content, not encoding, standard:• not a display standard,
• not an encoding standard: use whatever schema you prefer, such as Marc21,Dublin Core etc.
• Strictly related to the Bibliographic Framework Transition Initiative
• More international
• Wider scope and resources
*Barbara Tillett, Seminar on RDA
Copyright 2009-2010 @CULT. All rights reserved 12
RDA: Summary of differences*
• Includes authority data• based on attributes and relationships in FRAD.
• Authorized/variant access point and elements continue to be documented inauthority records
• Has controlled vocabularies• Only a few closed: content, media and carrier types; mode of issuance; etc.
• Most are open: cataloguer can supply term if needed term not in list
• Vocabularies being registered on the Web -- goal of multiple languages and/orscripts (http://metadataregistry.org/rdabrowse.htm)
• Changes in instruction vocabulary
• Core elements
*Barbara Tillett, Seminar on RDA
Copyright 2009-2010 @CULT. All rights reserved 13
RDA in Marc21
RDA: “use whatever schema you prefer, such as Marc21, Dublin Core etc.”
I can catalogue in RDA using Marc21… but
some updates are needed!
Copyright 2009-2010 @CULT. All rights reserved
RDA record in Marc21 format
040 $e rda
content, media, carrier
264 as publisher
Copyright 2009-2010 @CULT. All rights reserved
Full update of Marc21 Bibliographic and Authority: numbers
• Marc21 Bibliographic Format updates: 252tags/subfields
• Marc21 Authority Format updates: 167tags/subfields (total of 3249 elements checked and revised)
(strong effort of LibriCore)
• Integration of Authority manage in OliSuite/WeCat
Copyright 2009-2010 @CULT. All rights reserved
Marc must die? From record to data
http://www.loc.gov/marc/transition/pdf/marcld-report-11-21-2012.pdf
Bibliographic Framework as a Web of Data: Linked Data Model and Supporting Services
Copyright 2009-2010 @CULT. All rights reserved
Bibliographic Framework
Bibliographic Framework as a Web of Data: Linked Data Model and Supporting Services
(Library of Congress – November 21, 2012)
• The result of a reflection and study process about the functions of the catalog, the actuality of Marc, of cataloging standards, the new types of resources.
• It‟s the first step of a road map that will lead to a review and proposal of a new environment for bibliographic libraries, a new bibliographic ecosystem
• Convertion of bibliographic data in linked data: this process has to be an evolution and not a revolution
Copyright 2009-2010 @CULT. All rights reserved
Bibliographic Framework
• Reflections on new cataloging rules
• Greater level of identification and analysis of the data
• More attention to the controlled vocabularies
• Increased use of terms with respect to codes
• Emphasis on relationships
• Greater flexibility within checked headings
Copyright 2009-2010 @CULT. All rights reserved 19
Library data and Semantic web
As Karen Coyle suggests us, the question that we must face is:
“How we can best transform our data so that it can become part ofthe dominant information environment that is the Web”*
How we can transform our data so that they can be OF THE WEB andnot on the web
*Karen Coyle, Understanding the Semantic web: bibliographic data and metadata
Copyright 2009-2010 @CULT. All rights reserved 20
Library data and Semantic web
Web of document (traditional web)
vs Web of data (Semantic web)
Web of document: identity issues and interoperability
Copyright 2009-2010 @CULT. All rights reserved
Web: identity issues and interoperability
2006: Tim Berners-Lee, the father of the world wide web (www), offers the Linked Data as elegant and effective method to simplify and standardize the solutions to the problems of identity and interoperability.
The method aims to create a web of data (or entities or things) and is part of the technology for the Semantic Web
[http://www.w3.org/DesignIssues/LinkedData.html]
Copyright 2009-2010 @CULT. All rights reserved
The Semantic web or Web of data
*17th International World Wide Web Conference W3C Track @ WWW2008, Beijing, China 23-24 April 2008 -Linked Data: Principles and State of the Art
Il web di dati o web semantico
”A web of things in the world, described by data on the web”
Copyright 2009-2010 @CULT. All rights reserved
The four rules for linked data
“The Semantic Web isn't just about putting data on the web. It is about
making links, so that a person or machine can explore the web of
data. With linked data, when you have some of it, you can find
other, related, data.”
Tim Berners-Lee proposes four rules to create linked data on the web:
1. Use URIs as names for things.
2. Use HTTP URIs so that people can look up those names.
3. When someone looks up a URI, provide useful information, using
the standards (RDF*, SPARQL).
4. Include links to other URIs so that they can discover more things.
Copyright 2009-2010 @CULT. All rights reserved
Linked data and RDF (ResourceDescription Framework)
The Linked data are usually published on the web in RDF
Each statement in RDF is constituted by a triple formed by
subject -- predicate -- object
Copyright 2009-2010 @CULT. All rights reserved
Published byCarlo
Collodi
Le avventure di
Pinocchio
Poggi
Un burattino di
nome Pinocchio
1881Pinocchio
Author of
date
Dramatized as
Adapted as
MazzantiIllustrated by
Una struttura estremamente semplice che rende
possibile a ciascuno formulare asserzioni su
qualsiasi cosa
The RDF triples
Copyright 2009-2010 @CULT. All rights reserved
The focus of our attention now is…
Copyright 2009-2010 @CULT. All rights reserved
…on the bibliographic record
Copyright 2009-2010 @CULT. All rights reserved
Centrality of the bibliographic record –Drawbacks
• Redundancy of information
• High costs
• Production of closed silos, and mostly not usable outside
of the library world
Copyright 2009-2010 @CULT. All rights reserved
Marc21 and Linked data
In the semantic web MARC record is seen as a
primary source of information, for the
enormous amount of data contained therein and to
the high degree of semantics present in it
Copyright 2009-2010 @CULT. All rights reserved
Marc21 and Linked data
xv, 240 p. : ill. ; 24 cm.
Updated ed.
Includes bibliographical references (p. 240).
edited by Michael Hattaway.
Fathers and daughters--Drama.
Cambridge University Press, 2009.
But the semantics expressed in the record is only comprehensible
to the human eye. The individual elements, isolated from the record,
lose their meaning
Copyright 2009-2010 @CULT. All rights reserved
One bibliographic record…
Copyright 2009-2010 @CULT. All rights reserved
… many statements or triples
has author Shakespeare,William
Has subject
has publication date
has title
has publisher
As you like it
Cambridge U Press
Padri e figlie
2009
Triple
The resource
Copyright 2009-2010 @CULT. All rights reserved
RDF Triple
http://lccn.loc.gov/2009025024 (the book)
http://RDVocab.info/roles/HasAuthor (has
author)
http://lcna.loc.gov/n 78095332 (the author)
Controlled vocabulary and ontology
Copyright 2009-2010 @CULT. All rights reserved
Linked data vs record
As you like it
Cambridge University Press, 2009.
Michael Hattaway.
Fathers and daughters--Drama
Shakespeare, William, 1564-1616
Controlled vocabulary and ontology
Copyright 2009-2010 @CULT. All rights reserved
Linked data vs bibliographic record
• Linked data => not focused on records but on individual
assertions or statements of atomic structure (triples)
• Each statement (RDF triple) thus produced can be
recorded with URI/URIs, which connects the data to
others on the same dataset or in external datasets
• Moving these statements understandable to machines
on the web transforms the data contained in the original
Marc record in integral part of the Semantic Web
Copyright 2009-2010 @CULT. All rights reserved
Try to summarize core concepts
• Resource of the Web identified through URI
• Atomic elements related one another through links:
entity/relations model
• Less interest to description (“Take what you see” and
“Accept what you get”) and more relevance to access
control => authority record, controlled vocabulary
and ontology
• Data understandable and re-usable by machines: the
Resource Description Framework (RDF) as framework for
representing information in the Web.
Copyright 2009-2010 @CULT. All rights reserved
Authority module in WeCat
The full authority manage into WeCat
Copyright 2009-2010 @CULT. All rights reserved
Authority module in WeCat
Copyright 2009-2010 @CULT. All rights reserved
Authority module in WeCat
Authority record
in WeCat
Copyright 2009-2010 @CULT. All rights reserved
Authority module in WeCat
Copyright 2009-2010 @CULT. All rights reserved
Manage ontology in OliSuite/WeCat
See here how each tag is “mapped” with URI coming from different ontology
Copyright 2009-2010 @CULT. All rights reserved
Manage ontology in OliSuite/WeCat
An authorized person can associate each tag to one or more entity from
vocabulary and ontology
Copyright 2009-2010 @CULT. All rights reserved
Manage ontology in OliSuite/WeCat
Tag 110 (Name as Main entry) producersName in RDA vocabulary
Copyright 2009-2010 @CULT. All rights reserved
Checking ontology in OliSuite/WeCat
The cataloguer click on the tag
label to open the screen where
the mapping with ontology
element is reported
Copyright 2009-2010 @CULT. All rights reserved
Checking ontology in OliSuite/WeCat
Tag 110 (Name as Main entry) producersName in RDA vocabulary
and M1001_a in Marc 21 Element Set
Copyright 2009-2010 @CULT. All rights reserved
Checking ontology in OliSuite/WeCat
The URI refers directly to the specific element present on the Open Metadata
Registry, where many of „library‟ ontologies and vocabularies are published
Copyright 2009-2010 @CULT. All rights reserved
GMD as content, media and carrier in RDA
The GMD – General Material Designator of AACR2 and ISBD
is coded in three fields in RDA, and the content of each field is
taken from controlled vocabularies:
• Tag 336: CONTENT
• Tag 337: MEDIA
• Tag 338: CARRIER
In OliSuite/WeCat the cataloguer can choose the content of each
field from e list, that refers to controlled vocabulary on Open
Metadata Registry
Copyright 2009-2010 @CULT. All rights reserved
GMD as content, media and carrier in RDA
Tag 336: CONTENT
Copyright 2009-2010 @CULT. All rights reserved
GMD as content, media and carrier in RDA
Tag 336: CONTENT
Copyright 2009-2010 @CULT. All rights reserved
GMD as content, media and carrier in RDA
Tag 336: CONTENT
Copyright 2009-2010 @CULT. All rights reserved
The content concepts…
…from controlled vocabulary
Copyright 2009-2010 @CULT. All rights reserved
GMD as content, media and carrier in RDA
The same for tag 337: MEDIA
Copyright 2009-2010 @CULT. All rights reserved
GMD as content, media and carrier in RDA
The same for tag 338: CARRIER
Copyright 2009-2010 @CULT. All rights reserved
Include ID (URI) to identify heading
URIs reported in tag 1xx/7xx $0: see hereUmberto, Eco associated to VIAF URI, to make a link with VIAF –Virtual International Authority File
Copyright 2009-2010 @CULT. All rights reserved
Include ID (URI) to identify heading
The URI associated to an heading is repeatable and identify each heading. See here it in the authority record
Copyright 2009-2010 @CULT. All rights reserved
Include ID (URI) to identify heading
The URI can be used both by cataloguer – to check the heading on VIAF or other projects, and to end user, aftern, on the OPAC
Copyright 2009-2010 @CULT. All rights reserved
Just to close…
…giving a look to “economy” principle of ICP(see some example to make easier the cataloguing process: the autocomplete function in heading creation)