Introduction to the Semantic Web to the Semantic Web… · Introduction to the Semantic Web Erich...
Transcript of Introduction to the Semantic Web to the Semantic Web… · Introduction to the Semantic Web Erich...
Introduction to the Semantic Web
Erich BremerJanuary 20, 2015
W3CWorld Wide Web Consortium
395 Members as of today(this includes Stony Brook University)
The Semantic WebA new form of Web content that is meaningful to computers will
unleash a revolution of new possibilitiesBy Tim Berners-Lee, James Hendler and Ora Lassila
May 17, 2001
What is the Semantic Web trying to accomplish?
It is trying to enable the exchange of data between people and/or machines.
Semantic Web Layer Cake
Resource Description Framework (RDF)SPARQL (query language of the RDF)RDFS/OWL – Ontology LanguagesRules Interchange Format (RIF)
Resource Description Framework (RDF)
SUBJECT PREDICATE OBJECT<http://www.w3.org/People/Berners-Lee/card#i> <http://xmlns.com/foaf/0.1/knows> <http://www.cs.rpi.edu/~hendler/foaf.rdf>
An RDF Triple
Ontology – (RDFS/OWL)More RDF on TimBL More RDF on Hendler
Multiple triples form a multi-dimensional directed graph
RDF/XML
RDFa
Turtle
N-Triples
JSON-LDN3
TriG
N-Quads
I’m new!
TriX
Microdata
HDT
The Many Faces of RDF
The Many Faces of RDF
Turtle@prefix : http://xmlns.com/foaf/0.1/<http://www.w3.org/People/Berners-Lee/card#i> a :Person;
:knows <http://www.cs.rpi.edu/~hendler/foaf.rdf>;:lastName “Berners-Lee”;:firstName “Tim” .
N-Triples<http://www.w3.org/People/Berners-Lee/card#i> <http://xmlns.com/foaf/0.1/knows> <http://www.cs.rpi.edu/~hendler/foaf.rdf> .<http://www.w3.org/People/Berners-Lee/card#i> <http://xmlns.com/foaf/0.1/lastName> “Berners-Lee” .<http://www.w3.org/People/Berners-Lee/card#i> <http://xmlns.com/foaf/0.1/firstName> “Tim” .<http://www.w3.org/People/Berners-Lee/card#i> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://xmlns.com/foaf/0.1/Person> .
JSON-LD{ “@id”: http://www.w3.org/People/Berners-Lee/card#i,
@type: “:Person”,“knows”: http://www.cs.rpi.edu/~hendler/foaf.rdf,“lastName”: “Berners-Lee”,“firstName”: “Tim”,
“@context”: {“firstName”: “http://xmlns.com/foaf/0.1/firstName”,“lastName”: “http://xmlns.com/foaf/0.1/lastName”,“knows”: “http://xmlns.com/foaf/0.1/knows”,“Person”: “http://xmlns.com/foaf/0.1/Person”}
}
SPARQL Query Language
prefix : http://xmlns.com/foaf/0.1/ .select ?uri ?lastName ?firstName ?phonewhere {?uri
:lastName ?lastName;:firstName ?firstName;:phone ?phone}
limit 100
Linked Data Platform (LDP)
Create a RESTful style of accessing, adding, updating, and deleting RDF data
Web Ontology Language (OWL)
• Defines a particular domain of knowledge• Define relationships between predicates• Can limit domain and range of relationships• Enables inferencing of new data based on ontologies and known assertions
• The Nation Center for Biomedical Ontology (http://www.bioontology.org/)• The Open Biological and Biomedical Ontologies (http://www.obofoundry.org/)
Linked Data
Semantic Web in the Wild
VIVO
A web database of researchers and educators containing their publications, research interests, grants, presentations, and courses
Development was funded by a 12.2 million dollar stimulus grant from NIH
Created by Cornell, Indiana, University of Florida, Washington University in St. Louis, Weill Medical College, The Scripps Research Institute, Ponce School of Medicine - Puerto Rico
Cornell working with a new $1 million dollar grant to develop scholarly works ontology (2013)
Diagram maintained by Richard Cyganiak (DERI, NUI Galway) and Anja Jentzsch (Freie Universität Berlin)
LINKINGOPENDATAW3C SWEO Community Project
Diagram maintained by Richard Cyganiak (DERI, NUI Galway) and Anja Jentzsch (Freie Universität Berlin)
Diagram maintained by Richard Cyganiak (DERI, NUI Galway) and Anja Jentzsch (Freie Universität Berlin)
Diagram maintained by Richard Cyganiak (DERI, NUI Galway) and Anja Jentzsch (Freie Universität Berlin)
> 30,000,000,000 triples
Diagram maintained by Richard Cyganiak (DERI, NUI Galway) and Anja Jentzsch (Freie Universität Berlin)
Linked Data Milestones• Best Buy represents product catalogs in web as RDFa (2008)• Eagle-i consortium ($15 million from NIH) (2009)• Facebook Open Graph protocol (RDFa-based) (2010)• Google Buys Freebase (2010)• Google Recommends Using RDFa and the Good Relations Vocabulary (2010)• RDF supported in core of Drupal 7.0, Drupal powers over 634,580 web sites (2011)• IBM Watson in-part consumes Linked Data (2011)• NY Times rNews protocol (RDFa-based) (2011)• CTSA Consortium Recommends VIVO Ontology (2011)• Schema.org – Google, Yahoo, Bing (2011) and now Yandex too!• Library of Congress to move from MARC to Linked Open Data (2011) – see BIBFRAME• OCLC Linked Data Research – WorldCat – over 80 million triples (2012)• ORCID - http://orcid.org/ (2012)• CTSAconnect – (http://ctsaconnect.org) medical expertise ontology (2012)• Wikidata (2012) – regular exports in RDF (as of 1/5/2015 - > 500 million triples)• Google support JSON-LD within Gmail (2013)
Continued…
• (2013) Linked TCGA– ~20.4 billion triples (http://tcga.deri.ie/)• (Jan 2014) Bio2RDF - Release 2: 1 billion triples in 19 datasets• Freebase being retired in favor of Wikidata (June 2015)
schema.org
Who is using schema.org? (Good Relations)
GoogleYahoo!BestBuySears (15 Million items)Kmart (250,000 items)and 10,000 more:http://www.heppnetz.de/projects/goodrelations/
Who is else is using the SemWeb Tech?
Pretty much every webpage on the BBC website (heard of them?) now hits an RDF triple-store. Examples: http://www.bbc.co.uk/programmes/developers, http://www.bbc.co.uk/nature/feedsanddata, 60.000(!) BBC news items annotated with RDF (http://www.ontoba.com/blog/bbc-news-labs), etc.
NXP is one of the world's biggest makers of microprocessors (4.3b$). On data.nxp.com they have data on 26.000(!) products, internal triplestore (Dydra, 250K entitles, 2.5m triples) to drive a website, this is externally available, to make it part of a broader ecosystem.
Renault publishes configuration options for its cars in RDF http://www.slideshare.net/fpservant/ldow2013
Electricity de France generates 300.000 personalised energy bills using SemWeb technology: http://data.semanticweb.org/conference/iswc/2013/proceedings-2/paper-04/html
New York Times publishes Linked Open Data http://open.blogs.nytimes.com/2010/06/24/more-subject-headings-published-as-linked-open-data/?_r=0
Ad.ly (adds from celebreties) goes RDF http://www.slideshare.net/testac/how-hollywood-learned-to-love-the-semantic-web
Monster Board goes RDF: http://semanticweb.com/monster-offers-more-semantic-enabled-help-to-job-seekers-and-hr-staffers_b19673
Bill and Melinda Gates foundation goes RDF: http://priyankmohan.blogspot.com/2010/02/bill-and-melinda-gates-foundation_26.html
– Frank Van Harmele
• AllegroGraph – Franz Inc.• Virtuoso - Openlink Software• Apache TDB• Sesame• BigOWLIM• StarDog – Clark and Parsia• Bigdata – Systap, LLC – Bryan Thompson• 4Store - Garlik• Oracle• IBM DB2• On your web server in RDF-serialized Documents
Where do I put all of my triples?
• Apache Jena – includes TDB, Jena Core tools, ARQ, Fuseki, OWI API, Reasoning (Java)
• dotNetRDF – (C#)• N3.js – RDF Turtle parser (Javascript) • Rdflib – (Python)• Redland librdf – (C++)• Ruby RDF - https://github.com/ruby-rdf• Gruff – Free 2D RDF graph visualizer (end-user tool)
Semantic Software Tools
http://reach.suny.edu
vivo.stonybrookmedicine.edu
How many passwords do you have?
The hell if I can remember, click here to logon
Unhappy and demoralized user
WebIDWeb-scale Single Sign-on authenticationNo Passwords (uses digital certificates)Central Identity
Being developed by the W3C WebID Community Group75 Members, Henry Story – Chair
Originally, it went under the name foaf+sslThe term "WebID" was coined by Dan Brickley and Tim Berners-Lee in 2000
A Sample WebID profile<http://www.ebremer.com/foaf.rdf#me>
a foaf:Person ;foaf:family_name "Bremer" ;foaf:firstName "Erich" ;foaf:knows <http://bblfish.net/people/henry/card#me> ;foaf:knows http://tdiprima.com/foaf.rdf#me;foaf:status "Happy" ;cert:key [ a cert:RSAPublicKey ;
cert:exponent 65537 ;cert:modulus
"B8CD08202DB1C18A704AE4ABBF0C2C7A23C7FD9C8D85BC957645D91922BC79BF81C468A75A80629DD134CB639213B661E1762B365A85EFC37E110E6946A882A429103E2601C1EBC42F612635E8E5B9C968D53940FA9EDC4644B20A89E9CA2963232B0625D2FFED66AB5BB97E700D2ACCE91D972D37005370B01B3632CA43E42B0DD8DD283D5B811059547FA772480A2B5DB8781E69CF7DBD6BDC61A5165A2F659DE7904006A5AC4C70AA226FDCCEC514B9AE081C40E917368B4882F769F9A8BAA818E8BD72DBEDD3ABCF9BCB66BD9AAC916CD56F47FCAA3DA4337E99B4064DF4AF4F2869D751E35F324C8E06996D7B1491F55F1C7B05E5BD5D31C95919D25801"^^<http://www.w3.org/2001/XMLSchema#hexBinary> ] ;
W3CSemantic Web Health Care and Life Sciences (HCLS) Interest Group
http://www.w3.org/blog/hcls/The mission of the Semantic Web Health Care and Life Sciences Interest Group (HCLS IG) is to develop, advocate for, and support the use of Semantic Web technologies across health care, life sciences, clinical research and translational medicine. These domains stand to gain tremendous benefit from intra- and inter-domain application of Semantic Web technologies as they depend on the interoperability of information from many disciplines.
Linked open drug data for pharmaceutical research and development (2011)Matthias Samwald, Anja Jentzsch, Christopher Bouton, Claus Stie Kallesøe, Egon Willighagen, Janos Hajagos, M Scott Marshall, Eric Prud'hommeaux, Oktie Hassanzadeh, Elgar Pichler and Susie Stephens
Semantically enabling pharmacogenomic data for the realization of personalized medicine (2012)Samwald M, Coulet A, Huerga I, Powers RL, Luciano JS, Freimuth RR, Whipple F, Pichler E, Prud'hommeauxE, Dumontier M, Marshall MS.
A familiar name
• http://www.dataversity.net/category/webcasts/
• Yosemite Manifesto - http://yosemitemanifesto.org/ (2013)
The Yosemite Project: An RDF Roadmap for Healthcare Information Interoperability
Haylyn (haylyn.io)
WebID: http://www.w3.org/wiki/WebID
Tim Berners-Lee: The next Web of open, linked data - www.ted.com (2009)
VIVO – http://vivoweb.org
W3C – http://w3.org/standards/semanticweb
http://vivo.cornell.eduhttp://vivo.ufl.eduhttp://vivo.iu.eduhttp://reach.suny.edu
Additional Reading and Educational Links
BooksSemantic Web for the Working Ontologist – Hendler (2nd edition)Learning SPARQL – DuCharme (2nd edition)
LinksSemantic University - http://www.cambridgesemantics.com/semantic-universityW3C - http://www.w3.org/2013/data/
Questions?