Semantic search

Post on 14-Jan-2015

3.818 views 2 download

Tags:

description

Semantic search based on thesaurus management. The use of thesauri improves search engines. Scenarios will be shown in this presentation.

Transcript of Semantic search

Semantic Search

Andreas Blumauer

powered by

Agenda

• Semantic search scenarios

• The role of thesauri in semantic search

• PoolParty Semantic Search

– Live Demo – http://bit.ly/semantic_search

2

3

Semantic search scenarios

Semantic search has many faces

htt

p:/

/ww

w.f

lickr

.co

m/p

ho

tos/

tech

bu

rst/

27

96

42

12

48

/

Situations in which semantic search can help

4

I can´t remember how

to spell the search term

I can´t remember how

to spell the search term

I can´t remember

exactly what I was looking for

I can´t remember

exactly what I was looking for

I want to gain background

knowledge to a certain

document

I want to gain background

knowledge to a certain

document

I want to know more about

this entity in a certain context.

I want to know more about

this entity in a certain context.

I want to see facts from

different sources describing this

entity.

I want to see facts from

different sources describing this

entity.

I want to search in different

languages simultaneously

I want to search in different

languages simultaneously

I forgot some of the names for the entity I´m looking for

I forgot some of the names for the entity I´m looking for

I want the software to understand

what I mean by „Jaguar“

I want the software to understand

what I mean by „Jaguar“

Types of Search

5

(1) Pinpoint-Search (factual knowledge) i.e. „Eyjafjallajökull“

(2) Collecting and Researchi.e. „volcanoes in Iceland“

(3) Associative Search / Discovery of interrelationsi.e. Volcanic eruptions and climate change

(4) Questions & Answersi.e. „Which volcanoes in Iceland erupted in 2010?“

(5) Serendipity Discovery of unexpected, but relevant contexts and coherencesLe

arni

ng/N

avig

atin

g

S

earc

h

Four demands for a smarter search

1. Find information faster Provide search assistants

2. Reveal hidden information Enrich the search index with background knowledge

3. Find more specific informationQuery the semantic web

4. Find linked informationIntegrate data sources

6

Find information faster – Auto-Complete

7

I can´t remember how

to spell the search term

I can´t remember how

to spell the search term

To provide powerful auto-complete also for enterprise searchscenarios you need to establish an enterprise vocabulary.To provide powerful auto-complete also for enterprise searchscenarios you need to establish an enterprise vocabulary.

Find information faster – Status quo

8

hydropower plantsSearc

h

I can´t remember

exactly what I was looking for

I can´t remember

exactly what I was looking for

Small hydro

Search

Find information faster with related search terms

9

hydropower plantsSearc

h

http://www.reegle.info/clean-energy-search

Reveal hidden information – Status quo

10

SNCRSearc

h

SNCR OR „Selective non-

Search

I forgot some of the names for the entity I´m looking for

I forgot some of the names for the entity I´m looking for

Reveal hidden information with query expansion

11

SNCRSearc

hOR "selective non catalytic reduction"

SNCR

selective non catalytic reduction

alternative Label

preferred Label

Multi-lingual search based on a thesaurus

12

clean energy Searc

hOR energía limpia

clean energy

energía limpiapreferredLabel @es

preferred Label @en

I want to search in different

languages simultaneously

I want to search in different

languages simultaneously

Reveal hidden information and relations

13

Find documentsor images relatedto any other text.

http://poolparty.punkt.at/demozone

I want to gain background

knowledge to a certain

document

I want to gain background

knowledge to a certain

document

Find more specific information – Status quo

14

Goldman SachsSearc

h

3 different contexts for„Goldman Sachs“:• Bond issuer• Analyst• Stock

3 different contexts for„Goldman Sachs“:• Bond issuer• Analyst• Stock

I want to know more about

this entity in a certain context.

I want to know more about

this entity in a certain context.

Find more specific information with faceted search

15

facets supportstructured queries

facets helpto drill down search results,adapt dynamically

Zero-result querieswon´t happen anymore

Complex queries with faceted search over linked data

16

„Show me all airlines whose parent company is Lufthansa“

„Show me all airlines whose parent company is Lufthansa“

http://dbpedia.neofonie.de/

My Energy-Dossier about

Find linked information – Status quo

17

I want to see facts from

different sources describing this

entity.

I want to see facts from

different sources describing this

entity.

The user has to put together manually energy-relatedinformation about a country.

The user has to put together manually energy-relatedinformation about a country.

360O views: Find linked information

18

Energy-relatedinformation about countriesare „mashed“ automaticallyby using „linked data“

Energy-relatedinformation about countriesare „mashed“ automaticallyby using „linked data“

http://www.reegle.info/countries

Add personal context to the search

19

I want the software to understand

what I mean by „Jaguar“

I want the software to understand

what I mean by „Jaguar“

JaguarSearc

h

20

The role of thesauri in semantic search

How vertical search can benefit from knowledge models

htt

p:/

/ww

w.f

lickr

.co

m/p

ho

tos/

tech

bu

rst/

27

96

42

12

48

/

The role of thesauri in semantic search

21

The role of thesauri in semantic search (contd.)

22

Thesaurus as the central pointto control:

•labels & query expansion•facets•refine search mechanisms•metadata integration

Thesaurus as the central pointto control:

•labels & query expansion•facets•refine search mechanisms•metadata integration

Data integration and schema mapping based on thesauri

23

<person> Thomas Miller</person>

Source 1

<employee> Tom Miller</employee>

Source 2

24

PoolParty Semantic Search (PPSS)

Make semantic search come true!

htt

p:/

/ww

w.f

lickr

.co

m/p

ho

tos/

tech

bu

rst/

27

96

42

12

48

/

PoolParty System Architecture

25

Search Services

Search Application

Collector<xml>

Semantic Indexer

Document Index

Cartridge

Indexing and Mapping with PoolParty

• Metadata Standards

– Rich metadata in a standardized, extensible format (SKOS / RDF)

– Document metadata is mapped to concepts in the thesaurus

• Cost efficient metadata management

– Thesaurus is managed with PoolParty´s easy-to-use Thesaurus Manager

– One central metadata repository

• Improved end-user experience

– Semantic information improves search experience26

PoolParty Search API & Standard GUI

27

• Available web services:• Search Service• Suggest Service• Similarity Service

• Supported formats:• JSON• XML• RSS

• Available web services:• Search Service• Suggest Service• Similarity Service

• Supported formats:• JSON• XML• RSS

http://bit.ly/semantic_search

PoolParty Semantic Search Demo – Background Info

2828

Search Services

Search Application

CollectorSemantic Indexer

Document Index

Cartridge

Thesaurus

Projects DB

Web catalogueof actors

Actors DB

Data integration based on Reegle thesaurus

29

<sector> Hydro Power small scale</sector>

Actors DB

<category> Micro Hydro</category>

Web catalogue

PoolParty Semantic Search Demo – Result

30

http://bit.ly/semantic_search

select properfacets

store querieswith search basket

facets supportstructured queries

find similar documents forrelevant results

specify your querywith categorisedauto-complete

Contact

Andreas BlumauerManaging Directorblumauer@punkt.at+43-1-8974122-27

Alexander KreiserSystem Architectkreiser@punkt.at +43-1-8974122-38

31

punkt. netServices GmbH

Lerchenfelder Guertel 43A—1160 Wien / Austria

http://www.punkt.at/ http://poolparty.punkt.at/

http://bit.ly/semantic_searchhttp://lod2.eu/

http://twitter.com/PoolParty_Teamhttp://www.facebook.com/pages/

PoolParty/134189476633941

powered by