Linking library data

37
Linking Library Data ELAG 2011 Workshop Jindřich Mynarz @jindrichmynar z

description

Slides accompanying the Linking Library Data workshop at European Libraries Automation Group conference 2011.

Transcript of Linking library data

Page 1: Linking library data

Linking Library DataELAG 2011 Workshop

Jindřich Mynarz @jindrichmynarz

Page 2: Linking library data

linked data is sooo 2009

Page 3: Linking library data

Workshop

• Introductiono Motivationo Involved technologies

• Discussiono Key questionso Potential issues

• Practical linking

Shared document: bit.ly/linking-library-dataTwitter hashtag: #elag2011

Page 4: Linking library data

conversion

lots about

Page 5: Linking library data

linking

little about

Page 6: Linking library data

library links

Page 7: Linking library data
Page 8: Linking library data
Page 9: Linking library data
Page 10: Linking library data

raw data

Page 11: Linking library data

linked data

Page 12: Linking library data

Key technologies

• URIs• RDF• SPARQL• Linked data

Page 13: Linking library data

URIs

• Uniform Resource Identifierso <http://example.com>

• "Cool URIs"o resolvableo stableo implement content negotiation

Page 14: Linking library data

Learn how the RDF looks like

• Data format for formalizing directed graphs.• Standard for data interchange on the Web.• Unit of RDF is a triple.

Page 15: Linking library data

Step 1: a triple

Page 16: Linking library data

Step 2: triples

Page 17: Linking library data

Step 3: a graph

Page 18: Linking library data

Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/

Step 4: linked data

Page 19: Linking library data

Learn how to SPARQL

• Query language for RDFPREFIX ex: <http://example.com>SELECT [DISTINCT] ?what [FROM ?where]WHERE {  ?triplePattern} [LIMIT ?limit][ORDER BY ?variable] 

Page 20: Linking library data

Linked data

1. Use URIs as names for things2. Use HTTP URIs so that people can look up those names.3. When someone looks up a URI, provide useful

information, using the standards (RDF, SPARQL)4. Include links to other URIs. so that they can discover more

things.

Page 21: Linking library data

Linking data

• linking... o is a continuous integration of heterogeneous

dataspaces? o creates context?o is a job for librarians? Or machines?o is good?

Page 22: Linking library data

@href is a blunt instrument

Page 23: Linking library data

typed links

Page 24: Linking library data

=    identity~    similarity>    hierarchy?    aboutness

typed links

Page 25: Linking library data

Discussion

• How to find datasets suitable for interlinking? • How to make my dataset worth linking to?• How to encourage others to link to my data?• What is the added value of links? • How to determine the quality of a link?• How to maintain links?

Page 26: Linking library data

find and examine data

Page 27: Linking library data

added value of links

Page 28: Linking library data

link baiting

Page 29: Linking library data

link maintenance

Page 30: Linking library data

Linking

• Record linkage, identity resolution, duplicate detection, instance matching, co-reference detection

• Determinism: o Deterministic (e.g., dictionary-based)o Probabilistic (e.g., graph matching)

• Level:o Schema (e.g., ontology mapping)o Instances (e.g., record linkage)

Page 31: Linking library data

Linking

1. Untyped links to typed links.2. Literals to links.3. Links to other links.

Page 32: Linking library data

Interlinking with Silk

• Silk is an interlinking framework for instance matching.• Uses the link specification language to describe the

interlinking process.• Powerful and relatively easy-to-use.

Page 33: Linking library data

Link specification language

• Your turn!

Page 34: Linking library data

connect the dots

Page 35: Linking library data

get this

Page 36: Linking library data

or this

Page 37: Linking library data

the end.