Forging New Links: Libraries in the Semantic Web

84
Forging New Links: Libraries in the Semantic Web Lisa Goddard & Gillian Byrne Memorial University Libraries Computers in Libraries, Washington D.C. March 23 rd , 2011

Transcript of Forging New Links: Libraries in the Semantic Web

Page 1: Forging New Links: Libraries in the Semantic Web

Forging New Links: Libraries in the Semantic Web

Lisa Goddard & Gillian Byrne

Memorial University Libraries

Computers in Libraries, Washington D.C.

March 23rd, 2011

Page 2: Forging New Links: Libraries in the Semantic Web

The Gist

General Semantic Web

• How it works.

• A few tools.

• Who’s involved?

Lisa

Libraries & Linked Data

• What it solves.

• Issues & obstacles.

• Where we are now.

Gillian

Page 3: Forging New Links: Libraries in the Semantic Web

Web Search Problems

Page 4: Forging New Links: Libraries in the Semantic Web

High Recall, Low Precision

Page 5: Forging New Links: Libraries in the Semantic Web

Vocabulary Dependent

Page 6: Forging New Links: Libraries in the Semantic Web

Returns Single Web Pages

Page 7: Forging New Links: Libraries in the Semantic Web

Access to Deep Web

Page 8: Forging New Links: Libraries in the Semantic Web

Identity

Page 9: Forging New Links: Libraries in the Semantic Web

Comparisons

Academic Staff Member (University College London)

Faculty Member (McGill)Equivalent to?

Page 10: Forging New Links: Libraries in the Semantic Web

Complex Queries

Find all soccer players, who played as goalkeeper for a club that has a stadium with more than 40,000 seats and who are born in a country with more than 10 million inhabitants.

Page 11: Forging New Links: Libraries in the Semantic Web

Structured Databases

Page 12: Forging New Links: Libraries in the Semantic Web
Page 13: Forging New Links: Libraries in the Semantic Web

The Semantic Solution

1. Structured data

2. Controlled vocabularies

3. Linking

Page 14: Forging New Links: Libraries in the Semantic Web

Machine-Actionable Data

“Our top two users are computers.”

- Martin Kalfatovic, Smithsonian

Page 15: Forging New Links: Libraries in the Semantic Web

Structured Data: RDF

Data model for writing simple statements about web objects.

RDF statements are written as “triples”.

Subject Object

Shakespeare Macbeth

Predicate

Wrote

V

Statement

Page 16: Forging New Links: Libraries in the Semantic Web

RDF Triples

Subject Predicate Object

ShakespeareShakespeareAnne HathawayShakespeareStratfordMacbethEnglandScotland

WroteWroteMarriedLived inIs inSet in Part ofPart of

King LearMacbethShakespeareStratfordEnglandScotlandUKUK

Page 17: Forging New Links: Libraries in the Semantic Web

RDF Graph: A Semantic Net

AnneHathaway

Shakespeare

Stratford

UK

Macbeth

KingLear

Scotland

England

married

wrote

wrote

isIn

setIn

lived

In

partO

f

partOf

Page 18: Forging New Links: Libraries in the Semantic Web

Unique Identifiers: URIs

Shakespeare Macbethwrote

http://www.mun.ca/project#shakespeare http://www.mun.ca/project#macbeth

http://www.mun.ca/project#wrote

URIs should resolve.

Page 19: Forging New Links: Libraries in the Semantic Web

RDF Description: FOAF<rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#" xmlns:foaf="http://xmlns.com/foaf/0.1/"<foaf:Person rdf:ID="me">

<foaf:name>Lisa Goddard</foaf:name><foaf:title>Ms.</foaf:title><foaf:givenname>Lisa</foaf:givenname><foaf:family_name>Goddard</foaf:family_name><foaf:homepage rdf:resource="http://twitter.com/lisagoddard"/><foaf:workplaceHomepage rdf:resource="http://www.library.mun.ca"/><foaf:schoolHomepage rdf:resource="http://www.queensu.ca"/><foaf:knows>

<foaf:Person><foaf:name>Gillian Byrne</foaf:name><foaf:mbox rdf:resource="mailto:[email protected]"/>

</foaf:Person></foaf:knows></foaf:Person></rdf:RDF>

Page 20: Forging New Links: Libraries in the Semantic Web

Resolvable URIs

xmlns:foaf="http://xmlns.com/foaf/0.1/"

<foaf:Person rdf:ID="me"><foaf:name>Lisa Goddard</foaf:name><foaf:workplaceHomepage rdf:resource="http://www.library.mun.ca"/></foaf:Person>

http://xmlns.com/foaf/0.1/#term_workplaceHomepage

Page 21: Forging New Links: Libraries in the Semantic Web

Resolvable URIs

Page 22: Forging New Links: Libraries in the Semantic Web

An ontology describes a particular domain of knowledge (e.g. bikes, whiskey).

•Establishes controlled vocabulary.

•Models relationships between entities & concepts.

•Built-in rules and datatypes that support reasoning.

Ontologies

Page 23: Forging New Links: Libraries in the Semantic Web

Controlled Vocabulary

Terms and definitions are posted online, so they can be shared by many different organizations.

http://www.mun.ca/lit.owl

#wrote

#setIn

#play #book

#poem

#narrated

http://www.mun.ca/lit.owl#wrote

http://www.mun.ca/lit.owl/#book

http://www.mun.ca/lit.owl#play

Page 24: Forging New Links: Libraries in the Semantic Web

Shared Ontologies

Namespaces allow us to combine several vocabularies while maintaining distinct meaning of each element.

#person

#partOf

http://gmu.edu/bio.owl

http://mit.edu/geo.rdf

http://mun.ca/lit.owl

#wrote

#setIn

#married

#play #book

#poem

#narrated#isIn

#country

#city#region

#birthdate

#deathdate

Page 25: Forging New Links: Libraries in the Semantic Web

Ontologies & Reasoning

Page 26: Forging New Links: Libraries in the Semantic Web

Reasoning: Inverse

lit:wrote owl:inverseOf lit:writtenBy

bio:Shakespeare lit:wrote lit:Macbeth

lit:Macbeth lit:writtenBy bio:Shakespeare

Axiom

ExplicitFact

ImplicitFact

Page 27: Forging New Links: Libraries in the Semantic Web

Reasoning: Symmetrical

bio:married rdf:type owl:SymmetricProperty

bio:AnneHathaway bio:married bio:Shakespeare

bio:Shakespeare bio:married bio:AnneHathaway

Axiom

ExplicitFact

ImplicitFact

Page 28: Forging New Links: Libraries in the Semantic Web

Reasoning: Equivalent

lit:WilliamShakespeare owl:sameAs bio:Shakespeare

bio:Shakespeare lit:wrote lit:Macbeth

lit:WilliamShakespeare lit:wrote lit:Macbeth

ExplicitFacts

Axiom

ImplicitFacts

Page 29: Forging New Links: Libraries in the Semantic Web

Finding Ontologies

Page 30: Forging New Links: Libraries in the Semantic Web

RDF Browser

Page 31: Forging New Links: Libraries in the Semantic Web

Linking Distributed Data

Page 32: Forging New Links: Libraries in the Semantic Web

Linking Data

Page 33: Forging New Links: Libraries in the Semantic Web

Linking Data

Page 34: Forging New Links: Libraries in the Semantic Web

Linking Entities

Directors Moviesdbpedia:director

foaf:made

Page 35: Forging New Links: Libraries in the Semantic Web

Linking Hubs

Page 36: Forging New Links: Libraries in the Semantic Web

Semantic Linking Hubs

Page 37: Forging New Links: Libraries in the Semantic Web

Natural Language Processing

Identify people, places, things, concepts in unstructured text files.

Disambiguate terms.

Suggest links to existing entities.

Page 38: Forging New Links: Libraries in the Semantic Web
Page 39: Forging New Links: Libraries in the Semantic Web
Page 40: Forging New Links: Libraries in the Semantic Web
Page 41: Forging New Links: Libraries in the Semantic Web

RDF Publishing Tools

Page 42: Forging New Links: Libraries in the Semantic Web

Drupal 7 CMS

Page 43: Forging New Links: Libraries in the Semantic Web

Semantic MediaWiki

Page 44: Forging New Links: Libraries in the Semantic Web

Semantic Blogging: Zemanta

Page 45: Forging New Links: Libraries in the Semantic Web

Linked Data Adoption

Page 46: Forging New Links: Libraries in the Semantic Web

Growth of RDFa

Usage of RDFa increased 510% between Mar 2009 and Oct 2010

From: http://tripletalk.wordpress.com/2011/01/25/rdfa-deployment-across-the-web/

Page 47: Forging New Links: Libraries in the Semantic Web

Websites with RDF

Page 48: Forging New Links: Libraries in the Semantic Web

Semantic Technologies

Page 49: Forging New Links: Libraries in the Semantic Web
Page 50: Forging New Links: Libraries in the Semantic Web

GoogleFaceted Recipe Search

Page 51: Forging New Links: Libraries in the Semantic Web

Library Linked Data

Page 52: Forging New Links: Libraries in the Semantic Web

Disconnected Data

Page 53: Forging New Links: Libraries in the Semantic Web

Silos in the Library

Source: Provincial Archives of Newfoundland & Labrador

Page 54: Forging New Links: Libraries in the Semantic Web
Page 55: Forging New Links: Libraries in the Semantic Web
Page 56: Forging New Links: Libraries in the Semantic Web

Weak Links

Page 58: Forging New Links: Libraries in the Semantic Web

Lost Content

Page 59: Forging New Links: Libraries in the Semantic Web

Enhanced Content

Page 60: Forging New Links: Libraries in the Semantic Web

Missed Opportunities

Page 61: Forging New Links: Libraries in the Semantic Web

Enhanced personalization

Julie

History 1012

List ID: 12

Newfoundland to confederation

MemberOf

HasList

Contains

@prefix aiiso: <http://purl.org/vocab/aiiso/schema#>. @prefix resource: <http://purl.org/vocab/resourcelist/schema# >.@prefix foaf: <http://xmlns.com/foaf/0.1/> .@prefix bibo: <http://purl.org/ontology/bibo/> . @prefix dcterms: <http://purl.org/dc/terms/> .

Page 62: Forging New Links: Libraries in the Semantic Web

What’s our value?

• 2.0 thinking: our value is linked to our data

• 3.0 thinking: our value is linked to our (re)useable, shareable data?

Page 63: Forging New Links: Libraries in the Semantic Web

Obstacles

Page 64: Forging New Links: Libraries in the Semantic Web

Competing Vocabularies

...how many ways to describe a book, journal article or a place?

Ian Millard, Hugh Glaser, Manuel Salvadores, Nigel Shadbolt http://eprints.ecs.soton.ac.uk/21681/5/cold2010-slides.pdf

Page 65: Forging New Links: Libraries in the Semantic Web

Co-referencing

1. http://dbpedia.org/resource/Cluny_MacPherson

2. http://dbpedia.org/resource/Dr._Cluny_MacPherson

3. http://mpii.de/yago/resource/Cluny_MacPherson

4. http://rdf.freebase.com/ns/guid.9202a8c04000641f80000000005e34c1

5. http://viaf.org/viaf/76097050

6. http://umbel.org/umbel/ne/wikipedia/Cluny_MacPherson

Page 66: Forging New Links: Libraries in the Semantic Web
Page 67: Forging New Links: Libraries in the Semantic Web

Discovering

• Lots (and lots and lots) of linked data out there

• How to find it?

Page 68: Forging New Links: Libraries in the Semantic Web
Page 69: Forging New Links: Libraries in the Semantic Web

Querying

Page 70: Forging New Links: Libraries in the Semantic Web
Page 71: Forging New Links: Libraries in the Semantic Web

Trust

The largest hurdle to library adoption of Linked Data, though, may not be educational or technological …The sticking point for librarians may be an issue of trust. 

- Ross Singer, “Linked Data Now!”

Page 72: Forging New Links: Libraries in the Semantic Web

Preservation

What happens when a ontology or linking hub disappears?

Page 73: Forging New Links: Libraries in the Semantic Web

Data ownership

Library

Catalogue

Digital Archive

Database

Repository

Ejournal

Page 74: Forging New Links: Libraries in the Semantic Web

Licensing

“You shall not use the data made available through the GC Open Data Portal in any way which, in the opinion of Canada, may bring disrepute to or prejudice the reputation of Canada."

Page 75: Forging New Links: Libraries in the Semantic Web

VoID

:DBpedia a void:Dataset ;

dcterms:license <http://www.gnu.org/copyleft/fdl.html> .

• schema to describe linked datasets

Page 76: Forging New Links: Libraries in the Semantic Web

Oh - One more thing…

“who’s minding the ranch?”

Page 77: Forging New Links: Libraries in the Semantic Web

RDA

• Works with in MARC, but also works as a linked data Metadata Vocabulary

Page 78: Forging New Links: Libraries in the Semantic Web
Page 79: Forging New Links: Libraries in the Semantic Web

RDF Converters

Page 80: Forging New Links: Libraries in the Semantic Web

Publishing Tools

Page 81: Forging New Links: Libraries in the Semantic Web

Where we are now

Page 82: Forging New Links: Libraries in the Semantic Web

Age of Chaotic Innovation?

LIBRIS (Swedish Union Catalog)

Library of Congress (LCSH, OSI)

German National Library

Hungarian National Library

British Library

Europeana

Linked Periodicals Data

Virtual International Authority File

Dewey Decimal Classification

BIBSYS’ authority files Thesaurus for Economics

Rameau

Swedish Subject Headings

German Subject Headings

Metadata Authority Description Schema (MADS)

Page 83: Forging New Links: Libraries in the Semantic Web

The Chaos Tamers

• W3C Linked Library Data Incubator Group

• IFLA Semantic Web Interest Group

• CKAN Linked Library Data Group

• LITA/ALCTS Linked Library Data Interest Group

Page 84: Forging New Links: Libraries in the Semantic Web

Questions?

Source: http://www.jenniferbetman.com