The Web of Data is Our Oyster

133
Richard Wallis OCLC Technology Evangelist @rjw The Web of Data is Our Oyster

Transcript of The Web of Data is Our Oyster

Page 1: The Web of Data is Our Oyster

Richard  Wallis  OCLC  Technology  Evangelist  

@rjw

The  Web  of  Data  is  Our  Oyster

Page 2: The Web of Data is Our Oyster
Page 3: The Web of Data is Our Oyster
Page 4: The Web of Data is Our Oyster

Image  courtesy  of:  Shropshire  County  Council1779  (c.)

The Industrial Revolution

Page 5: The Web of Data is Our Oyster
Page 6: The Web of Data is Our Oyster

The  Web  of  …

Page 7: The Web of Data is Our Oyster

The  Web  of  …

Documents

Page 8: The Web of Data is Our Oyster

The  Web  of  …

Documents

Active  Documents

Page 9: The Web of Data is Our Oyster

The  Web  of  …

Documents

Active  Documents

Discovery

Page 10: The Web of Data is Our Oyster

The  Web  of  …

Documents

Active  Documents

Discovery

Data

☌☌

Page 11: The Web of Data is Our Oyster

The  Web  of  …

Documents

Active  Documents

Discovery

Data

☌☌

Page 12: The Web of Data is Our Oyster

The  Web  of  …

Documents

Active  Documents

Discovery

Data

☌☌

Page 13: The Web of Data is Our Oyster

The  Web  of  …

Documents

Active  Documents

Discovery

Data

☌☌

Page 14: The Web of Data is Our Oyster

The  Web  of  …

Documents

Active  Documents

Discovery

Data

☌☌

Page 15: The Web of Data is Our Oyster

The  Web  of  …

Documents

Active  Documents

Discovery

Data

☌☌

✔✗

Page 16: The Web of Data is Our Oyster

The  Web  of  …

Documents

Active  Documents

Discovery

Data

Knowledge

☌☌

✔✗

Page 17: The Web of Data is Our Oyster

The  Web  of  …

Documents

Active  Documents

Discovery

Data

Knowledge

☌☌

✔✗

?☌

Page 18: The Web of Data is Our Oyster

http://www.opte.org/

The  Web  of  Data  

Page 19: The Web of Data is Our Oyster

http://www.opte.org/

The  Web  of  Data  

A  Web  of  related  entities

Page 20: The Web of Data is Our Oyster

http://www.opte.org/

The  Web  of  Data  

Page 21: The Web of Data is Our Oyster

http://www.opte.org/

The  Web  of  Data  

A  Library  Shaped  Black  Hole  ?

Page 22: The Web of Data is Our Oyster
Page 23: The Web of Data is Our Oyster
Page 24: The Web of Data is Our Oyster
Page 25: The Web of Data is Our Oyster

record  /ˈrɛkɔːd/  noun  !

a  thing  constituting  a  piece  of  evidence  about  the  past,  especially  an  account  kept  in  writing  or  some  other  permanent  form.

Page 26: The Web of Data is Our Oyster

entity  /ˈɛntɪti/  noun  

a  thing  with  distinct  and  independent  existence.

Page 27: The Web of Data is Our Oyster

entity  /ˈɛntɪti/  noun  

a  thing  with  distinct  and  independent  existence.

relationship  /rɪˈleɪʃ(ə)nʃɪp/  noun  

the  way  in  which  two  or  more  people  or  things  are  connected  

Page 28: The Web of Data is Our Oyster

RecordTitle:    "War  and  Peace"  Author:    "Leo  Tolstoy  1828-­‐1910"  ISBN:  0307266931

Type:  Work  Name:    "War  and  Peace"  Author:    http://worldcat.org/entity/person/id/1234

Entity  (http://worldcat.org/entity/work/id/115206288)

Page 29: The Web of Data is Our Oyster

RecordTitle:    "War  and  Peace"  Author:    "Leo  Tolstoy  1828-­‐1910"  ISBN:  0307266931

Type:  Work  Name:    "War  and  Peace"  Author:    http://worldcat.org/entity/person/id/1234

Entity  (http://worldcat.org/entity/work/id/115206288)

Page 30: The Web of Data is Our Oyster

RecordTitle:    "War  and  Peace"  Author:    "Leo  Tolstoy  1828-­‐1910"  ISBN:  0307266931

Type:  Work  Name:    "War  and  Peace"  Author:    http://worldcat.org/entity/person/id/1234

Entity  (http://worldcat.org/entity/work/id/115206288)

Type:  Person  Name:    "Leo  Tolstoy  "  Born:    1828  Died:  1910  Birthplace:  http://worldcat.org/entity/place/id/8976

Entity  (http://worldcat.org/entity/person/id/1234) ⤵

Page 31: The Web of Data is Our Oyster

RecordTitle:    "War  and  Peace"  Author:    "Leo  Tolstoy  1828-­‐1910"  ISBN:  0307266931

Type:  Work  Name:    "War  and  Peace"  Author:    http://worldcat.org/entity/person/id/1234

Entity  (http://worldcat.org/entity/work/id/115206288)

Type:  Person  Name:    "Leo  Tolstoy  "  Born:    1828  Died:  1910  Birthplace:  http://worldcat.org/entity/place/id/8976

Entity  (http://worldcat.org/entity/person/id/1234)

Type:  Place  Name:    "Yasnaya  Polyana"  SameAs:    http://geonames.org/468686

Entity  (http://worldcat.org/entity/place/id/8976)

⤵⟶

Page 32: The Web of Data is Our Oyster
Page 33: The Web of Data is Our Oyster
Page 34: The Web of Data is Our Oyster
Page 35: The Web of Data is Our Oyster
Page 36: The Web of Data is Our Oyster
Page 37: The Web of Data is Our Oyster
Page 38: The Web of Data is Our Oyster
Page 39: The Web of Data is Our Oyster

Many great LD Projects

So today …..

Where are

we on t

he web?

Page 40: The Web of Data is Our Oyster

Where are

we on t

he web?

Page 41: The Web of Data is Our Oyster
Page 42: The Web of Data is Our Oyster
Page 43: The Web of Data is Our Oyster
Page 44: The Web of Data is Our Oyster
Page 45: The Web of Data is Our Oyster
Page 46: The Web of Data is Our Oyster
Page 47: The Web of Data is Our Oyster
Page 48: The Web of Data is Our Oyster
Page 49: The Web of Data is Our Oyster

Invisible

on the w

eb!

Page 50: The Web of Data is Our Oyster

Invisible

on the w

eb!

Page 51: The Web of Data is Our Oyster

Open  Linked  Data  -­‐  Silos

Library  Linked  Data

Page 52: The Web of Data is Our Oyster

British  Library

German  National  Library

Spanish  National  Library

Swedish  National  Library

Open  Linked  Data  -­‐  Silos

Library  Linked  Data

Page 53: The Web of Data is Our Oyster

British  Library

German  National  Library

Spanish  National  Library

Swedish  National  Library

Open  Linked  Data  -­‐  Silos

Library  Linked  Data

Page 54: The Web of Data is Our Oyster

British  Library

German  National  Library

Spanish  National  Library

Swedish  National  Library

Open  Linked  Data  -­‐  Silos

Library  Linked  Data

Page 55: The Web of Data is Our Oyster

British  Library

German  National  Library

Spanish  National  Library

Swedish  National  Library

Open  Linked  Data  -­‐  Silos

Library  Linked  Data

Page 56: The Web of Data is Our Oyster

British  Library

German  National  Library

Spanish  National  Library

Swedish  National  Library

Open  Linked  Data  -­‐  Silos

Library  Linked  Data

Page 57: The Web of Data is Our Oyster

British  Library

German  National  Library

Spanish  National  Library

Swedish  National  Library

Open  Linked  Data  -­‐  Silos

Library  Linked  Data

Page 58: The Web of Data is Our Oyster

British  Library

German  National  Library

Spanish  National  Library

Swedish  National  Library

Open  Linked  Data  -­‐  Silos

Library  Linked  Data

Page 59: The Web of Data is Our Oyster

British  Library

German  National  Library

Spanish  National  Library

Swedish  National  Library

Open  Linked  Data  -­‐  Silos

Library  Linked  Data

Page 60: The Web of Data is Our Oyster

British  Library

German  National  Library

Spanish  National  Library

Swedish  National  Library

Open  Linked  Data  -­‐  SilosBehind  A  Vocabulary  Barrier

Library  Linked  Data

Page 61: The Web of Data is Our Oyster
Page 62: The Web of Data is Our Oyster
Page 63: The Web of Data is Our Oyster

A  general  purpose  vocabulary  for  describing  things  on  the  web

Page 64: The Web of Data is Our Oyster

A  general  purpose  vocabulary  for  describing  things  on  the  web

"Used  by  5  million  

domains"

Page 65: The Web of Data is Our Oyster

A  general  purpose  vocabulary  for  describing  things  on  the  web

"Used  by  5  million  

domains" "25%  o

f  pages

 in  our  

indexe

s"

Page 66: The Web of Data is Our Oyster

A  general  purpose  vocabulary  for  describing  things  on  the  web

"Used  by  5  million  

domains" "25%  o

f  pages

 in  our  

indexe

s"

"15%  of  the  Web"

Page 67: The Web of Data is Our Oyster

A  general  purpose  vocabulary  for  describing  things  on  the  web

"Used  by  5  million  

domains" "25%  o

f  pages

 in  our  

indexe

s"

de  facto

y

"15%  of  the  Web"

Page 68: The Web of Data is Our Oyster

A  general  purpose  vocabulary  for  describing  things  on  the  web

"Used  by  5  million  

domains" "25%  o

f  pages

 in  our  

indexe

s"

de  facto

y

• Linked  Data  

"15%  of  the  Web"

Page 69: The Web of Data is Our Oyster

A  general  purpose  vocabulary  for  describing  things  on  the  web

"Used  by  5  million  

domains" "25%  o

f  pages

 in  our  

indexe

s"

de  facto

y

• Linked  Data  • Embedded  in  HTML

"15%  of  the  Web"

Page 70: The Web of Data is Our Oyster

A  general  purpose  vocabulary  for  describing  things  on  the  web

"Used  by  5  million  

domains" "25%  o

f  pages

 in  our  

indexe

s"

de  facto

y

• Linked  Data  • Embedded  in  HTML• RDFa,  Microdata,  JSON-­‐LD

"15%  of  the  Web"

Page 71: The Web of Data is Our Oyster

A  general  purpose  vocabulary  for  describing  things  on  the  web

"Used  by  5  million  

domains" "25%  o

f  pages

 in  our  

indexe

s"

de  facto

y

• Linked  Data  • Embedded  in  HTML• RDFa,  Microdata,  JSON-­‐LD• Descriptive  data

"15%  of  the  Web"

Page 72: The Web of Data is Our Oyster

A  general  purpose  vocabulary  for  describing  things  on  the  web

"Used  by  5  million  

domains" "25%  o

f  pages

 in  our  

indexe

s"

de  facto

y

• Linked  Data  • Embedded  in  HTML• RDFa,  Microdata,  JSON-­‐LD• Descriptive  data• Active  links

"15%  of  the  Web"

Page 73: The Web of Data is Our Oyster
Page 74: The Web of Data is Our Oyster

• Foundation  for  the  future  of  bibliographic  description

Page 75: The Web of Data is Our Oyster

• Foundation  for  the  future  of  bibliographic  description

• Eventual  replacement  for  Marc  21

Page 76: The Web of Data is Our Oyster

• Foundation  for  the  future  of  bibliographic  description

• Eventual  replacement  for  Marc  21

• Identify  information  entities

Page 77: The Web of Data is Our Oyster

• Foundation  for  the  future  of  bibliographic  description

• Eventual  replacement  for  Marc  21

• Identify  information  entities

• Conversion  from  Marc

Page 78: The Web of Data is Our Oyster

• Foundation  for  the  future  of  bibliographic  description

• Eventual  replacement  for  Marc  21

• Identify  information  entities

• Conversion  from  Marc

• Publish  in  RDF  –  Linked  Data

Page 79: The Web of Data is Our Oyster

• Foundation  for  the  future  of  bibliographic  description

• Eventual  replacement  for  Marc  21

• Identify  information  entities

• Conversion  from  Marc

• Publish  in  RDF  –  Linked  Data

• White  PaperCommon Ground: Exploring Compatibilities between the Linked Data Models of the Library of Congress and OCLC

http://oc.lc/CommonGround

Page 80: The Web of Data is Our Oyster
Page 81: The Web of Data is Our Oyster

Why  Catalog?

Page 82: The Web of Data is Our Oyster

Why  Catalog?So  we  can  find  things

Page 83: The Web of Data is Our Oyster

Why  Catalog?So  we  can  find  things

Why  Share  on  the  Web?

Page 84: The Web of Data is Our Oyster

Why  Catalog?So  we  can  find  things

Why  Share  on  the  Web?

So  today’s  users  can  find  our  things

Page 85: The Web of Data is Our Oyster

Where  are  our  users?

Page 86: The Web of Data is Our Oyster

Where  are  our  users?

Page 87: The Web of Data is Our Oyster

Getting  from  here  to  there

Page 88: The Web of Data is Our Oyster

Data from oneconverted record doesnot an entity make

Getting  from  here  to  there

Page 89: The Web of Data is Our Oyster

Data from oneconverted record doesnot an entity make

Transformation  into  Linked  Data  is  just  a  beginning  …

Getting  from  here  to  there

Page 90: The Web of Data is Our Oyster

Data from oneconverted record doesnot an entity make

Transformation  into  Linked  Data  is  just  a  beginning  …• Mine  and  analyze  the  aggregate

Getting  from  here  to  there

Page 91: The Web of Data is Our Oyster

Data from oneconverted record doesnot an entity make

Transformation  into  Linked  Data  is  just  a  beginning  …• Mine  and  analyze  the  aggregate• Identify,  map,  merge  -­‐  evidence  based

Getting  from  here  to  there

Page 92: The Web of Data is Our Oyster

Data from oneconverted record doesnot an entity make

Transformation  into  Linked  Data  is  just  a  beginning  …• Mine  and  analyze  the  aggregate• Identify,  map,  merge  -­‐  evidence  based• Relate  to  external  sources

Getting  from  here  to  there

Page 93: The Web of Data is Our Oyster

Data from oneconverted record doesnot an entity make

Transformation  into  Linked  Data  is  just  a  beginning  …• Mine  and  analyze  the  aggregate• Identify,  map,  merge  -­‐  evidence  based• Relate  to  external  sources• Establish  the  entities

Getting  from  here  to  there

Page 94: The Web of Data is Our Oyster
Page 95: The Web of Data is Our Oyster
Page 96: The Web of Data is Our Oyster
Page 97: The Web of Data is Our Oyster
Page 98: The Web of Data is Our Oyster
Page 99: The Web of Data is Our Oyster
Page 100: The Web of Data is Our Oyster

Entities  and  library  workflowsDiscovery

The  Name  of  the  Rose

Summary:  The  year  is  1327.  Franciscans  in  a  wealthy  Italian  abbey  are  suspected  of  heresy,  and  Brother  William  of  Baskerville  arrives  to  investigate.  His  delicate  mission  is  suddenly  overshadowed  by  seven  bizarre  deaths  that  take  place  in  seven  days  and  nights  of  apocalyptic  terror.  

Subjects

Borrowing  Options  eBooks  |  Printed  Books  |  Audio  Books  

Other  Languages  

!

Monastic  libraries  -­‐-­‐  Italy  –  Fiction  |  Semiotics  -­‐-­‐  Fiction  

Page 101: The Web of Data is Our Oyster

http://www.opte.org/

A  Web  of  Data  

Page 102: The Web of Data is Our Oyster

http://www.opte.org/

A  Web  of  Data  

Page 103: The Web of Data is Our Oyster

person place

object concept

organization work

The  solution  starts  here.

The  library  knowledge  graphA  graph  of  relationships

Page 104: The Web of Data is Our Oyster

person place

object concept

organization work

author

The  solution  starts  here.

The  library  knowledge  graphA  graph  of  relationships

Page 105: The Web of Data is Our Oyster

person place

object concept

organization work

author

subject

The  solution  starts  here.

The  library  knowledge  graphA  graph  of  relationships

Page 106: The Web of Data is Our Oyster

person place

object concept

organization work

author

subjectitem availability

The  solution  starts  here.

The  library  knowledge  graphA  graph  of  relationships

Page 107: The Web of Data is Our Oyster

The  library  knowledge  graphA  graph  of  relationships

person place

object concept

organization work

Page 108: The Web of Data is Our Oyster

What  will  be  better?

Page 109: The Web of Data is Our Oyster

The  library  knowledge  graphLots  of  things….if  we  do  it  right.

ILL  and  AnalyticsCataloging

Discovery Integration  with  the  web

What  will  be  better?

Page 110: The Web of Data is Our Oyster

Entities  and  library  workflowsCataloging

Cataloging  will  be  different…  

▪ Managing  the  quality  of  Works  

• Improving  clusters  

▪ Managing  the  quality  of  Persons  

• Links  to  works,  Other  IDs

Page 111: The Web of Data is Our Oyster

What  has  OCLC  done?

Page 112: The Web of Data is Our Oyster

What  has  OCLC  done?

So  what  progress  have  we  made?

Page 113: The Web of Data is Our Oyster

• 197+  million  Work  descriptions  and  URIs  • Schema.org  +  BiblioGraph.net  • RDF  Data  formats  

• RDF/XML,  Turtle,  Triples,  JSON-­‐LD  

• Links  to  WorldCat  manifestations  • Links  to  Dewey,  LCSH,  LCNAF,  VIAF,  FAST  • Open  Data  license  via  Linked  Data  Explorer  •  2015:  Discovery  API,  Metadata  API  

• Released  April  2014

http://www.oclc.org/dataThe  Work  Entity

Page 114: The Web of Data is Our Oyster

• 98+  million  Person  descriptions  and  URIs  • Person  entities  with  authority:  20.2  million  

• Person  entities  without  authority:  78.3  million  

• Schema.org  +  BiblioGraph.net  • Harvested  from  WorldCat  data  and  enriched  from  other  hubs  RDF  Data  formats  • RDF/XML,  Turtle,  Triples,  JSON-­‐LD  

• Links  to  WorldCat  Works.    Added  links  from  WC  Works.  • Open  Data  license  via  Linked  Data  Explorer  •  2015:  Linked  Data  Explorer,  Discovery  API

http://www.oclc.org/dataThe  Person  Entity

Page 115: The Web of Data is Our Oyster
Page 116: The Web of Data is Our Oyster

Can  we  measure  impact?

Page 117: The Web of Data is Our Oyster
Page 118: The Web of Data is Our Oyster

Monthly  Unique  Visitors

Page 119: The Web of Data is Our Oyster

OCLC  Entity  Based  Data  Strategy

2012  

2013

2010

Page 120: The Web of Data is Our Oyster

OCLC  Entity  Based  Data  Strategy✓VIAF,  ISNI,  FAST  Publish  Linked  Data✓WorldCat.org  Linked  Data  Release  –  using  Schema.org✓Data  mining  of  WorldCat  resources✓WorldCat  Works  Released  –  using  Schema.org✓Schema.org  added  to  VIAF  RDF✓WorldCat  Discovery  API  Returns  Schema.org  RDF  (Beta)

2012  

20142013

2010

Page 121: The Web of Data is Our Oyster

OCLC  Entity  Based  Data  Strategy✓VIAF,  ISNI,  FAST  Publish  Linked  Data✓WorldCat.org  Linked  Data  Release  –  using  Schema.org✓Data  mining  of  WorldCat  resources✓WorldCat  Works  Released  –  using  Schema.org✓Schema.org  added  to  VIAF  RDF✓WorldCat  Discovery  API  Returns  Schema.org  RDF  (Beta)

2012  

2014

➢Application  Integration  ➢WorldCat  Discovery  ➢Analytics  ➢Discovery  API  ➢Cataloging

2015

2013

2010

Page 122: The Web of Data is Our Oyster

OCLC  Entity  Based  Data  Strategy✓VIAF,  ISNI,  FAST  Publish  Linked  Data✓WorldCat.org  Linked  Data  Release  –  using  Schema.org✓Data  mining  of  WorldCat  resources✓WorldCat  Works  Released  –  using  Schema.org✓Schema.org  added  to  VIAF  RDF✓WorldCat  Discovery  API  Returns  Schema.org  RDF  (Beta)

2012  

2014

➢Application  Integration  ➢WorldCat  Discovery  ➢Analytics  ➢Discovery  API  ➢Cataloging

2015

➢More  Entities  Released  ➢Person  ➢Organization  ➢Event  ➢Concept

2013

2010

Page 123: The Web of Data is Our Oyster

OCLC  Entity  Based  Data  Strategy✓VIAF,  ISNI,  FAST  Publish  Linked  Data✓WorldCat.org  Linked  Data  Release  –  using  Schema.org✓Data  mining  of  WorldCat  resources✓WorldCat  Works  Released  –  using  Schema.org✓Schema.org  added  to  VIAF  RDF✓WorldCat  Discovery  API  Returns  Schema.org  RDF  (Beta)

2012  

2014

➢Application  Integration  ➢WorldCat  Discovery  ➢Analytics  ➢Discovery  API  ➢Cataloging

2015

➢More  Entities  Released  ➢Person  ➢Organization  ➢Event  ➢Concept

➢New  Products              ➢Continuing  Evangelism

➢New  Services➢Continuing  Innovation

2013

2016

2010

Page 124: The Web of Data is Our Oyster

!Many great Library Linked

Data Initiatives

Page 125: The Web of Data is Our Oyster

but!

Many great Library Linked Data Initiatives

Page 126: The Web of Data is Our Oyster

but!

Many great Library Linked Data Initiatives

If  users  can't  discover  our  resources

Page 127: The Web of Data is Our Oyster

but!

Many great Library Linked Data Initiatives

If  users  can't  discover  our  resources

What  is  the  point?

Page 128: The Web of Data is Our Oyster

but!

Many great Library Linked Data Initiatives

If  users  can't  discover  our  resources

What  is  the  point?

Give  the  Web  what  it  wants!

Page 129: The Web of Data is Our Oyster
Page 130: The Web of Data is Our Oyster

Linked  Data  has  benefits  for  library  workflows  ….

….by  giving  the  Web  what  it  wants

Page 131: The Web of Data is Our Oyster

The  Web  of  Data  is  Our  Oyster

Linked  Data  has  benefits  for  library  workflows  ….

….by  giving  the  Web  what  it  wants

Page 132: The Web of Data is Our Oyster

Richard  Wallis  OCLC  Technology  Evangelist  

@rjw

The  Web  of  Data  is  Our  Oyster

Page 133: The Web of Data is Our Oyster

Richard  Wallis  OCLC  Technology  Evangelist  

@rjw

The  Web  of  Data  is  Our  Oyster

http://slideshare.net/rjw