The Search for Meaning and Semantics: Taxonomies Get It Done

30
Strategies Taxonomy June 9, 2014 Copyright 2014 Taxonomy Strategies. All rights reserved. The Search for Meaning and Semantics: Taxonomies Get It Done Joseph Busch – Why Semantics Matter

description

The Search for Meaning and Semantics: Taxonomies Get It Done. Joseph Busch – Why Semantics Matter . Agenda. Why semantics matter (… a quick review from 2001) What is semantic search, SKOS and Linked Data? Some semantic search examples?. Why Semantics Matter May 20, 2001. - PowerPoint PPT Presentation

Transcript of The Search for Meaning and Semantics: Taxonomies Get It Done

Page 1: The Search  for Meaning  and Semantics: Taxonomies Get It  Done

StrategiesTaxonomy

June 9, 2014 Copyright 2014 Taxonomy Strategies. All rights reserved.

The Search for Meaning and Semantics: Taxonomies Get It Done

Joseph Busch – Why Semantics Matter

Page 2: The Search  for Meaning  and Semantics: Taxonomies Get It  Done

2Taxonomy Strategies The business of organized information

Agenda

v Why semantics matter (… a quick review from 2001)v What is semantic search, SKOS and Linked Data?v Some semantic search examples?

Page 3: The Search  for Meaning  and Semantics: Taxonomies Get It  Done

3Taxonomy Strategies The business of organized information

Why Semantics MatterMay 20, 2001

Page 4: The Search  for Meaning  and Semantics: Taxonomies Get It  Done

4Taxonomy Strategies The business of organized information

When you own a Rembrandt you can spell his name any way you want.

Page 5: The Search  for Meaning  and Semantics: Taxonomies Get It  Done

5Taxonomy Strategies The business of organized information

But when you want to find a Rembrandt … you better spell his name correctly.

Page 6: The Search  for Meaning  and Semantics: Taxonomies Get It  Done

6Taxonomy Strategies The business of organized information

Vocabulary resources can help find the right artist even if their name is typed incorrectly.

Page 7: The Search  for Meaning  and Semantics: Taxonomies Get It  Done

7Taxonomy Strategies The business of organized information

Users cannot type in the complex queries needed to find all the relevant items... But this can be done automatically.

Page 8: The Search  for Meaning  and Semantics: Taxonomies Get It  Done

8Taxonomy Strategies The business of organized information

Complex queries are even more important when you search the entire web.

Page 9: The Search  for Meaning  and Semantics: Taxonomies Get It  Done

9Taxonomy Strategies The business of organized information

So you find Rembrandt the Dutch guy...

Page 10: The Search  for Meaning  and Semantics: Taxonomies Get It  Done

10Taxonomy Strategies The business of organized information

… And not Rembrandt the toothpaste.

Page 11: The Search  for Meaning  and Semantics: Taxonomies Get It  Done

11Taxonomy Strategies The business of organized information

Getty Vocabularies Linked Data ServicesFebruary 19, 2014

Page 12: The Search  for Meaning  and Semantics: Taxonomies Get It  Done

12Taxonomy Strategies The business of organized information

Agenda

v Why semantics matterv What is semantic search, SKOS and Linked Data?v Some semantic search examples?

Page 13: The Search  for Meaning  and Semantics: Taxonomies Get It  Done

13Taxonomy Strategies The business of organized information

Search Failure

v 19% Character errors. (Young, et al)

v 40% Vocabulary errors. (Seaman. Norgard, et al)

v 20% Index confusion.v 21% Successful (Nielsen)

40%20%

19%21%

Page 14: The Search  for Meaning  and Semantics: Taxonomies Get It  Done

14Taxonomy Strategies The business of organized information

Page 15: The Search  for Meaning  and Semantics: Taxonomies Get It  Done

15Taxonomy Strategies The business of organized information

Semantic search solution

v Semantic search improves search accuracy by inferring the contextual meaning of terms via:

Disambiguation Part of speech (POS) analysis Synonyms, variations and quasi-synonyms Concept matching Natural language query analysis Key sentence detection

v Generate more consistent content to search on. v Correct user errors.v Map the language of users to the language of the target content.v Augment search results with linked data.

Page 16: The Search  for Meaning  and Semantics: Taxonomies Get It  Done

16Taxonomy Strategies The business of organized information

What semantics do for search?

Function Description

Related search Query corrections … did you mean?

Concept search Query expansion with synonyms, abbreviations, acronyms, etc. … do you also want?

Ontology-based search Query expansion with narrower or broader terms; scoping exhaustive search results

Faceted search Dynamic filtering of search results; online shopping

Clustering Dynamically bucketing search results into pre-defined categories

Stored queries RSS feeds, alerts, SDI (selective dissemination of information), etc.

Personalization Weighting search results based on explicit profiles and implicit data (where you’ve been and what you’ve done)

Page 17: The Search  for Meaning  and Semantics: Taxonomies Get It  Done

17Taxonomy Strategies The business of organized information

What is SKOS?

v Provides the basis for any user, tool, or program to identify, define and link concept vocabularies.

Relationship Definition

Concept A unit of thought, an idea, meaning, or category of objects or events. A Concept is independent of the terms used to label it.

Preferred Label A preferred lexical label for the resource such as a term used in a digital asset management system.

Alternate Label An alternative label for the resource such as a synonym or quasi-synonym.

Broader Concept Hierarchical link between two Concepts where one Concept is more general than the other.

Narrower Concept Hierarchical link between two Concepts where one Concept is more specific than the other.

Related Concept Link between two Concepts where the two are inherently "related", but that one is not in any way more general than the other.

Page 18: The Search  for Meaning  and Semantics: Taxonomies Get It  Done

18Taxonomy Strategies The business of organized information

Subject Predicate Objectlc:sh85052028 skos:prefLabel Fringe parking

lc:sh85052028 skos:altLabel Park and ride systems

lc:sh85052028 skos:altLabel Park and ride

lc:sh85052028 skos:altLabel Park & ride

lc:sh85052028 skos:altLabel Park-n-ride

trt:Brddf skos:prefLabel Fringe parking

trt:Brddf skos:altLabel Park and ride

trt:Brddf skos:altLabel P&R system

Trt:Brdd skos:broader Parking

lc:sh85052028

Fringe parking

Park and ride

systems

Park and ride

CONCEPT

trt:BrddfPark &

ride

Park-n- ride

altLabel

altLabel

altLabel

prefLabel

prefLabel

altLabel

altLabel

P&R system

altLabel

broader

Parking

trt:Brdd

prefLabel

Page 19: The Search  for Meaning  and Semantics: Taxonomies Get It  Done

19Taxonomy Strategies The business of organized information

Why SKOS? According to Alistair Miles* (SKOS co-author)

v Ease of combination with other standards Vocabularies are used in great variety of contexts.

– E.g., databases, faceted navigation, website browsing, linked open data, spellcheckers, etc.

Vocabularies are re-used in combination with other vocabularies.– E.g., Library of Congress Subject Headings +

Transportation Research Thesaurus; USPS states + USPS zip codes + US Congressional districts; etc.

v Flexibility and extensibility to cope with variations in structure and style

Variations between types of vocabularies– E.g., list vs. classification scheme

Variations within types of vocabularies – E.g., Z39.19-2005 monolingual controlled vocabularies and the

Transportation Research Thesaurus

* Head of Epidemiological Informatics at Oxford University Wellcome Trust Centre for Human Genetics (formerly OUP Senior Computing Officer)

Page 20: The Search  for Meaning  and Semantics: Taxonomies Get It  Done

20Taxonomy Strategies The business of organized information

Why SKOS? (2)

v Publish managed vocabularies so they can readily be consumed by applications

Identify the concepts– What are the named entities?

Describe the relationships– Labels, definitions and other properties

Publish the data– Convert data structure to standard format– Put files on an http server (or load statements into an RDF server)

v Ease of integration with external applications Use web services to use or link to a published concept, or to one or more

entire vocabularies.– E.g., Google maps API, NY Times article search API, Linked open data; etc.

v A W3C standard like HTML, CSS, XML and RDF, RDFS, and OWL.

Page 21: The Search  for Meaning  and Semantics: Taxonomies Get It  Done

21Taxonomy Strategies The business of organized information

Agenda

v Why semantics matterv What is semantic search, SKOS and Linked Data?v Some semantic search examples?

Page 22: The Search  for Meaning  and Semantics: Taxonomies Get It  Done

22Taxonomy Strategies The business of organized information

Taxonomy browser

Page 23: The Search  for Meaning  and Semantics: Taxonomies Get It  Done

23Taxonomy Strategies The business of organized information

Taxonomy-powered search results

Page 24: The Search  for Meaning  and Semantics: Taxonomies Get It  Done

24Taxonomy Strategies The business of organized information

Audience ProductsLocationOrganization Content Type

Product Line

Application

Technology

Industry Solution

Person

Oracle.com top-level taxonomy

Has a

Is a

Page 25: The Search  for Meaning  and Semantics: Taxonomies Get It  Done

25Taxonomy Strategies The business of organized information

Oracle event finderhttp://events.oracle.com/

Filter on Location and Language

More filters based on this result

Results shown on Google maps UI

Subscribe to RSS feed based on the criteria set on this page

Page 26: The Search  for Meaning  and Semantics: Taxonomies Get It  Done

26Taxonomy Strategies The business of organized information

APS Taxonomy browser

Page 27: The Search  for Meaning  and Semantics: Taxonomies Get It  Done

27Taxonomy Strategies The business of organized information

Linked data example

APS TaxonomyBroad Subject AreasMethods & TheoriesPhenomenaPhysical Systems Astronomical systems

Atomic-scale objectsBeamsComplex systemsDynamical systemsElectric & magnetic fieldsEngineered materialsFundamental particlesGases deleteInformation systemsLiquids deleteMaterialsNonlinear systemNucleiPlasmaQuasiparticles

Materials by CompositionMaterials by DimensionalityMaterials by PropertyMaterials by Structure

Elements by GroupGroup 1Group 2Group 3Group 4Group 5Group 6Group 7Group 8Group 9Group 10Group 11Group 12Group 13Group 14Group 15Group 16Group 17Group 18

Elements of the periodic table, and common isotopes

CadmiumCoperniciumMercuryZinc 194Hg

196Hg198Hg199Hg200Hg201Hg202Hg204Hg

A faceted taxonomy of concepts in physics

Page 28: The Search  for Meaning  and Semantics: Taxonomies Get It  Done

28Taxonomy Strategies The business of organized information

Paper submission tagging (prototype)

Page 29: The Search  for Meaning  and Semantics: Taxonomies Get It  Done

29Taxonomy Strategies The business of organized information

QUESTIONS

Joseph A BuschMobile 415-377-7912

[email protected]

Page 30: The Search  for Meaning  and Semantics: Taxonomies Get It  Done

30Taxonomy Strategies The business of organized information

Session description

v Semantic search – a phrase that is increasingly used in the popular as well as the professional literature. What does it look like, and how will it work. Panelists will present their visions of semantic search. Program is designed to be interactive with audience participation – suggestions for functions and features they see in the future.

What is semantic search? What are the components of semantic search? How can it be used in libraries?