1 © 2008 OpenLink Software, All rights reserved. On The Evolution of Terms appellavitque Adam...

13
1 © 2008 OpenLink Software, All rights reserved. On The Evolution of Terms “appellavitque Adam nominibus suis cuncta animantia et universa volatilia caeli et omnes bestias terrae...“ Genesis, 2:20 Orri Erling - Program Manager, Virtuoso Yrjänä Rankka - Developer, Virtuoso

Transcript of 1 © 2008 OpenLink Software, All rights reserved. On The Evolution of Terms appellavitque Adam...

Page 1: 1 © 2008 OpenLink Software, All rights reserved. On The Evolution of Terms appellavitque Adam nominibus suis cuncta animantia et universa volatilia caeli.

1© 2008 OpenLink Software, All rights reserved.

On The Evolution of Terms

“appellavitque Adam nominibus suis cuncta animantia et universa volatilia caeli et omnes bestias terrae...“

Genesis, 2:20

Orri Erling - Program Manager, VirtuosoYrjänä Rankka - Developer, Virtuoso

Page 2: 1 © 2008 OpenLink Software, All rights reserved. On The Evolution of Terms appellavitque Adam nominibus suis cuncta animantia et universa volatilia caeli.

2© 2008 OpenLink Software, All rights reserved.

“Adam called all things by their names”

We are not the first to tryThe history of the "search for the perfect language"

favors natural languagesWhat gets used becomes fit for the taskLanguage building from scratch has generally not

been successfulForcing nature to fit preconceived, idealized

taxonomies has generally failed

2

Page 3: 1 © 2008 OpenLink Software, All rights reserved. On The Evolution of Terms appellavitque Adam nominibus suis cuncta animantia et universa volatilia caeli.

3© 2008 OpenLink Software, All rights reserved.

Usage for Linked Data

New layer to the document webUse de-referenceable HTTP URIsUse #this to distinguish subject matter from

documentReuse terms where canHuman readable URI's are bestFrom HTML to XML, most formats are somewhat

human readable, same for the data web

Page 4: 1 © 2008 OpenLink Software, All rights reserved. On The Evolution of Terms appellavitque Adam nominibus suis cuncta animantia et universa volatilia caeli.

4© 2008 OpenLink Software, All rights reserved.

Universal “Data Language”?

RDF will do for grammar Useful vocabulary can only evolve in a

community of practiceDistributed evolution of vocabulary and

diversity of names for things is a givenApplication determines the data structure and

vocabulary

Page 5: 1 © 2008 OpenLink Software, All rights reserved. On The Evolution of Terms appellavitque Adam nominibus suis cuncta animantia et universa volatilia caeli.

5© 2008 OpenLink Software, All rights reserved.

Emerging Hubs

SIOC/FOAF For Web Data SpacesDbpedia/UMBEL/OpenCYC for names of real

world entitiesBFO for epistemology of structures and

processes

Page 6: 1 © 2008 OpenLink Software, All rights reserved. On The Evolution of Terms appellavitque Adam nominibus suis cuncta animantia et universa volatilia caeli.

6© 2008 OpenLink Software, All rights reserved.

Can’t Put Genie Back in the Bottle

Once data is out, there it staysSingle identifiers for entities are possible only

in closed, application specific DWs Heterogeneity of names, overlap of

descriptions, taxonomies etc is a givenMeaning exists only in context, so make this

explicit

6

Page 7: 1 © 2008 OpenLink Software, All rights reserved. On The Evolution of Terms appellavitque Adam nominibus suis cuncta animantia et universa volatilia caeli.

7© 2008 OpenLink Software, All rights reserved.

Infrastructure Scenarios

Application-specific warehouse or mapped RDBMS

General warehouse with lots of graphs, a la search engine or Billion Triples Challenge

Query driven harvesting a la OpenLink Sponger

On-line discovery and federated SPARQL

Page 8: 1 © 2008 OpenLink Software, All rights reserved. On The Evolution of Terms appellavitque Adam nominibus suis cuncta animantia et universa volatilia caeli.

8© 2008 OpenLink Software, All rights reserved.

Implications for Query

Make it explicitReport what data sets and SameAs's and

graphs went into producing an answerAllow the app to explicitly choose what

graphs, SameAs's, taxonomies, etc. are considered

Search and discoverability will drive vocabulary convergence

Page 9: 1 © 2008 OpenLink Software, All rights reserved. On The Evolution of Terms appellavitque Adam nominibus suis cuncta animantia et universa volatilia caeli.

9© 2008 OpenLink Software, All rights reserved.

Sameness

SameAs and equivalent are necessary and permanent features

What can be considered identical depends on context Universal agreement will not happen, so let people

choose whose SameAs they trust SameAs adds query cost and must be resolved at time

of query SameAs cannot be forward chained at web scale

because which of them are relevant is not fixed, Malicious/SPAM SameAs is inevitable

Page 10: 1 © 2008 OpenLink Software, All rights reserved. On The Evolution of Terms appellavitque Adam nominibus suis cuncta animantia et universa volatilia caeli.

10

© 2008 OpenLink Software, All rights reserved.

Implications For Publishers

SPARQL and data self description practices are urgently needed

Describe what names are used and what other data meshes with yours

Explicit license

Page 11: 1 © 2008 OpenLink Software, All rights reserved. On The Evolution of Terms appellavitque Adam nominibus suis cuncta animantia et universa volatilia caeli.

11

© 2008 OpenLink Software, All rights reserved.

Implications for Entity NS

DNS is good because of distributed, resilient storage and admin

Convergence cannot be forced but should be encouraged

Make administration compartmentalized a la DNS for No SPAM, No censorship

People can say things in their own spacesOffer classification, e.g. UMBEL In searching for terms, rank most reused the

highest

11

Page 12: 1 © 2008 OpenLink Software, All rights reserved. On The Evolution of Terms appellavitque Adam nominibus suis cuncta animantia et universa volatilia caeli.

12

© 2008 OpenLink Software, All rights reserved.

Conclusion

Since uniformity is impossible, make diversity of identifiers explicit

Build alongside the document web, de-referenceably and with #this

Encourage reuse but allow innovationNeeds of communities will differ according to

stage of development

Page 13: 1 © 2008 OpenLink Software, All rights reserved. On The Evolution of Terms appellavitque Adam nominibus suis cuncta animantia et universa volatilia caeli.

13

© 2008 OpenLink Software, All rights reserved.

OpenLink Software

Thank You!http://www.openlinksw.com