Day 2, workshop 3

63
Semantic Web for Cultural Heritage Geertje Jacobs - Rijksmuseum Michiel Hilderbrand - CWI/VU Maarten Heerlien - Naturalis Hans Nederbragt - Trezorix DISH2009 10-12-2009

Transcript of Day 2, workshop 3

Page 1: Day 2, workshop 3

Semantic Webfor

Cultural Heritage

Geertje Jacobs - RijksmuseumMichiel Hilderbrand - CWI/VUMaarten Heerlien - NaturalisHans Nederbragt - Trezorix

DISH200910-12-2009

Page 2: Day 2, workshop 3

overview

Semantic Web for Cultural Heritage | DISH2009 | 10-12-2009

1introduction to the use cases

2what is the semantic web

3remodelling data for findability

4reusing web resources for subject annotation

5an interface to integrated collections

6organizational aspects

7lessons learned, discussion and conclusion

Page 3: Day 2, workshop 3

introduction to the use cases | Teylers Universum

Teylers Universum

Semantic Web for Cultural Heritage | DISH2009 | 10-12-2009

Page 4: Day 2, workshop 3

introduction to the use cases | Teylers Universum

Teylers Universum

heterogeneous collections:

art objectsbooks

numismatic objectsold instruments

mineralogical objectspaleontological objects

Page 5: Day 2, workshop 3

introduction to the use cases | Teylers Universum

Teylers Universum

flexible and expandable modelling:

"regular" intrinsic properties of objects

but also extendible types like:eventsroles

statements

Semantic Web for Cultural Heritage | DISH2009 | 10-12-2009

Page 6: Day 2, workshop 3

introduction to the use cases | Rijksmuseum, VU/CWI

Rijksmuseum annotation tool

Semantic Web for Cultural Heritage | DISH2009 | 10-12-2009

Page 7: Day 2, workshop 3

introduction to the use cases | Rijksmuseum, VU/CWI

Subject matter annotation

semantic technology to improve subject matter annotation

collaboration between museum specialist and academics

goalsexplore feasability of reusing excistig web sources

requirements for integration in the museums workflow

approachiterative design of prototype (11 weeks)

Semantic Web for Cultural Heritage | DISH2009 | 10-12-2009

Page 8: Day 2, workshop 3

introduction to the use cases | Sterna

STERNA

Semantic Web for Cultural Heritage | DISH2009 | 10-12-2009

Page 9: Day 2, workshop 3

introduction to the use cases | Sterna

STERNA

Semantic Web BasedThematicEuropean

Reference NetworkApplication

Semantic Web for Cultural Heritage | DISH2009 | 10-12-2009

Page 10: Day 2, workshop 3

introduction to the use cases | Sterna

STERNA

heterogeneous collectionsa distributed environmenta multilingual environment

link with Europeana

Semantic Web for Cultural Heritage | DISH2009 | 10-12-2009

Page 11: Day 2, workshop 3

what is the semantic web

What is the Semantic Web?

Semantic Web for Cultural Heritage | DISH2009 | 10-12-2009

Page 12: Day 2, workshop 3

what is the semantic web

The Web: “open” documents and links

web linkURL URLSemantic Web for Cultural Heritage | DISH2009 | 10-12-2009

Page 13: Day 2, workshop 3

what is the semantic web

The Semantic Web: “open” data and links

web linkURL URL

creator

Dublin Core

Painting: “Green Stripe (Mme Matisse)”Royal Museum of Fine Arts, Copenhagen

Painter: “Henri Matisse”Getty ULAN

Semantic Web for Cultural Heritage | DISH2009 | 10-12-2009

Page 14: Day 2, workshop 3

what is the semantic web

Semantic Web for Cultural Heritage | DISH2009 | 10-12-2009

The information is there

Page 15: Day 2, workshop 3

what is the semantic web

Semantic Web for Cultural Heritage | DISH2009 | 10-12-2009

The information is there

Page 16: Day 2, workshop 3

Web 1.0 to Web 3.0

Page 17: Day 2, workshop 3

Web 1.0 to Web 3.0

Page 18: Day 2, workshop 3

what is the semantic web

Make data accessible for linking and reuse by man and machine

provide unique identifiers on the WebHTTP URIs for each term

http://purl.org/vocabularies/getty/aat/Fauve

use commonly agreed upon formatprocessable by machines (RDF)

shared vocabularies (Dublin Core, SKOS, ...)

Semantic Web for Cultural Heritage | DISH2009 | 10-12-2009

Page 19: Day 2, workshop 3

what is the semantic web

SKOS

Simple Knowledge Organization System

Semantic Web for Cultural Heritage | DISH2009 | 10-12-2009

Page 20: Day 2, workshop 3

what is the semantic web

SKOS

Simple Knowledge Organization System

<rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:skos="http://www.w3.org/2004/02/skos/core#">

<skos:Concept rdf:about="http://www.example.com/concepts#shrubs"> <skos:prefLabel xml:lang="en">shrubs</skos:prefLabel> <skos:altLabel xml:lang="en">bushes</skos:altLabel> <skos:prefLabel xml:lang="fr">arbuste</skos:prefLabel> <skos:altLabel xml:lang="fr">buisson</skos:altLabel> </skos:Concept>

</rdf:RDF>

Semantic Web for Cultural Heritage | DISH2009 | 10-12-2009

Page 21: Day 2, workshop 3

remodelling data for findability | Teylers Universum and Sterna

Remodelling data for findabilityTeylers Universum and Sterna

flexible and expandable modelling

Semantic Web for Cultural Heritage | DISH2009 | 10-12-2009

Page 22: Day 2, workshop 3

Semantic Web for Cultural Heritage | DISH2009 | 10-12-2009

title :type :

creator :date of creation :

place of creation :publisher :

date of publication :place of publication :

publication spec :language :

size :registration nr :

notes :

Travel by boxbookHugo de Groot (1583-1645)1622ParisLodewijk Elsevier (1540-1617, ca.)1623Leiden250 pages, illustratedDutch29,7 X 21 (cm, h x b)tzx091012recently discovered

remodelling data for findability | Teylers Universum and Sterna

Page 23: Day 2, workshop 3

Semantic Web for Cultural Heritage | DISH2009 | 10-12-2009

title :type :

creator :role :

date of creation :place of creation :

publisher :date of publication :

place of publication :publication spec :

language :size :

registration nr :notes :

Travel by boxbookHugo de Groot (1583-1645)author1622ParisLodewijk Elsevier (1540-1617, ca.)1623Leiden250 pages, illustratedDutch29,7 X 21 (cm, h x b)tzx091012recently discovered

remodelling data for findability | Teylers Universum and Sterna

Page 24: Day 2, workshop 3

Semantic Web for Cultural Heritage | DISH2009 | 10-12-2009

title :type :

creator :creator :

role :role :

date of creation :place of creation :

publisher :date of publication :

place of publication :publication spec :

language :size :

registration nr :notes :

Travel by boxbookHugo de Groot (1583-1645)Rembrandt van Rijn (1606-1669)authorillustrator1622ParisLodewijk Elsevier (1540-1617, ca.)1623Leiden250 pages, illustratedDutch29,7 X 21 (cm, h x b)tzx091012recently discovered

remodelling data for findability | Teylers Universum and Sterna

Page 25: Day 2, workshop 3

Semantic Web for Cultural Heritage | DISH2009 | 10-12-2009

title :type :

creator :creator :

role :role :

date of creation :place of creation :

publisher :date of publication :

place of publication :publication spec :

language :size :

registration nr :notes :

Travel by boxbookHugo de Groot (1583-1645)Rembrandt van Rijn (1606-1669)authorillustrator1622ParisLodewijk Elsevier (1540-1617, ca.)1623Leiden250 pages, illustratedDutch29,7 X 21 (cm, h x b)tzx091012recently discovered

remodelling data for findability | Teylers Universum and Sterna

Page 26: Day 2, workshop 3

Semantic Web for Cultural Heritage | DISH2009 | 10-12-2009

title :type :

creator :creator :

role :role :

creator year of birth :creator year of birth :

creator year of death :creator year of death :

date of creation :place of creation :

publisher :publisher date of birth :

publisher date of death :date of publication :

place of publication :publication spec :

language :…

Travel by boxbookHugo de Groot (1583-1645)Rembrandt van Rijn (1606-1669)authorillustrator15831606164516691622ParisLodewijk Elsevier (1540-1617, ca.)15401617 (ca.)1623Leiden250 pages, illustratedDutch…

remodelling data for findability | Teylers Universum and Sterna

Page 27: Day 2, workshop 3

Semantic Web for Cultural Heritage | DISH2009 | 10-12-2009

title :type :

creator :creator :

role :role :

date of creation :publisher :

date of publication :place of publication :

publication spec :language :

size :registration nr :

notes :

Travel by boxbookHugo de Groot (1583-1645)Rembrandt van Rijn (1606-1669)authorillustrator1622Lodewijk Elsevier (1540-1617, ca.)1623Leiden250 pages, illustratedDutch29,7 X 21 (cm, h x b)tzx091012recently discovered

remodelling data for findability | Teylers Universum and Sterna

Page 28: Day 2, workshop 3

Semantic Web for Cultural Heritage | DISH2009 | 10-12-2009

title :event :

type :publication spec :

language :size :

registration nr :notes :

Travel by box

book250 pages, illustratedDutch29,7 X 21 (cm, h x b)tzx091012recently discovered

remodelling data for findability | Teylers Universum and Sterna

Page 29: Day 2, workshop 3

Semantic Web for Cultural Heritage | DISH2009 | 10-12-2009

title :event :

type :publication spec :

language :size :

registration nr :notes :

Travel by boxTravel by box [creation] 1book250 pages, illustratedDutch29,7 X 21 (cm, h x b)tzx091012recently discovered

event name :

type :actor/role :

date :location :

notes :

Travel by box [creation] 1creationHugo de Groot [author]1622Paris

remodelling data for findability | Teylers Universum and Sterna

Page 30: Day 2, workshop 3

Semantic Web for Cultural Heritage | DISH2009 | 10-12-2009

title :event :event :

type :publication spec :

language :size :

registration nr :notes :

Travel by boxTravel by box [creation] 1Travel by box [creation] 2book250 pages, illustratedDutch29,7 X 21 (cm, h x b)tzx091012recently discovered

event name :

type :actor/role :

date :location :

notes :

Travel by box [creation] 1creationHugo de Groot [author]1622Paris

event name :

type :actor/role :

date :location :

notes :

Travel by box [creation] 2creationRembrand van Rijn [illustrator]1622Amsterdam

remodelling data for findability | Teylers Universum and Sterna

Page 31: Day 2, workshop 3

Semantic Web for Cultural Heritage | DISH2009 | 10-12-2009

title :event :event :event :

type :publication spec :

language :size :

registration nr :notes :

Travel by boxTravel by box [creation] 1Travel by box [creation] 2Travel by box [publication]book250 pages, illustratedDutch29,7 X 21 (cm, h x b)tzx091012recently discovered

event name :

type :actor/role :

date :location :

notes :

Travel by box [creation] 1creationHugo de Groot [author]1622Paris

remodelling data for findability | Teylers Universum and Sterna

event name :

type :actor/role :

date :location :

notes :

Travel by box [creation] 2creationRembrand van Rijn [illustrator]1622Amsterdam

event name :

type :actor/role :

date :location :

notes :

Travel by box [publication] publicationLodewijk Elsevier [publisher]1623Leiden

Page 32: Day 2, workshop 3

Semantic Web for Cultural Heritage | DISH2009 | 10-12-2009

title :event :event :event :

type :publication spec :

language :size :

registration nr :notes :

Travel by boxTravel by box [creation] 1Travel by box [creation] 2Travel by box [publication]book250 pages, illustratedDutch29,7 X 21 (cm, h x b)tzx091012recently discovered

event name :

type :actor/role :

date :location :

notes :

Travel by box [creation] 1creationHugo de Groot [author]1622Paris

event types creation birth death ownership printing publication shipwreck …

remodelling data for findability | Teylers Universum and Sterna

Page 33: Day 2, workshop 3

Semantic Web for Cultural Heritage | DISH2009 | 10-12-2009

title :event :event :event :

type :publication spec :

language :size :

registration nr :notes :

Travel by boxTravel by box [creation] 1Travel by box [creation] 2Travel by box [publication]book250 pages, illustratedDutch29,7 X 21 (cm, h x b)tzx091012recently discovered

event name :

type :actor/role :

date :location :

notes :

Travel by box [creation] 1creationHugo de Groot [author]1622Paris

actor/role name :actor :

role :date :

Hugo de Groot [author]Hugo de Grootauthor1601-1645

actor name :event :notes :

Hugo de Groot roles

author illustrator owner painter printer publisher …

remodelling data for findability | Teylers Universum and Sterna

Page 34: Day 2, workshop 3

actor name :event :notes :

Hugo de Groot

Semantic Web for Cultural Heritage| DISH2009| 10-12-2009

title :event :event :event :

type :publication spec :

language :size :

registration nr :notes :

Travel by boxTravel by box [creation] 1Travel by box [creation] 2Travel by box [publication]book250 pages, illustratedDutch29,7 X 21 (cm, h x b)tzx091012recently discovered

remodelling data for findability | Teylers Universum and Sterna

Page 35: Day 2, workshop 3

actor name :event :event :notes :

Hugo de GrootHugo de Groot [birth]Hugo de Groot [death]

Semantic Web for Cultural Heritage| DISH2009| 10-12-2009

title :event :event :event :

type :publication spec :

language :size :

registration nr :notes :

Travel by boxTravel by box [creation] 1Travel by box [creation] 2Travel by box [publication]book250 pages, illustratedDutch29,7 X 21 (cm, h x b)tzx091012recently discovered

event name :

type :actor/role :actor/role :

date :location :

notes :

Hugo de Groot [birth] birthJan de Groot [father]Alida van Overschie [mother]1583Delft

event name :

type :actor/role :

date :location :

notes :

Hugo de Groot [death] death 1645Rostockdied after a shipwreck

locations Amsterdam Delft Leiden Paris Rostock …

remodelling data for findability | Teylers Universum and Sterna

Page 36: Day 2, workshop 3

Semantic Web for Cultural Heritage| DISH2009| 10-12-2009

title :event :event :event :

type :publication spec :

language :size :

registration nr :notes :

Travel by boxTravel by box [creation] 1Travel by box [creation] 2Travel by box [publication]book250 pages, illustratedDutch29,7 X 21 (cm, h x b)tzx091012recently discovered

actor/role name :actor :

role :date :

Hugo de Groot [author]Hugo de Grootauthor1601-1645

referrers Adamus Exul [creation] De republicaemendanda [creation] Parallelon rerumpublicarum [creation] De Indis [creation] Christus patiens [creation] Mare Liberum [creation] De antiquitate reipublicae Batavicae [creation] Travel by box [creation] 1

√√√

remodelling data for findability | Teylers Universum and Sterna

Page 37: Day 2, workshop 3

Semantic Web for Cultural Heritage| DISH2009| 10-12-2009

"legacy systems" versus RDF

you need an RDF-like infrastructure to handlethis type of modelling

"classic" databases are reliable and fastbut they are not very flexible

"classic" indexers are reliable and fastbut they are not good in complicated queries

RDF structures are very flexiblethe RDF format is very useful for complicated queries

but RDF technology is not yet mature

remodelling data for findability | Teylers Universum and Sterna

Page 38: Day 2, workshop 3

reusing web resources for subject annotation | Rijksmuseum annotation tool

Reusing web resources for subject annotationRijksmuseum annotation tool

Semantic Web for Cultural Heritage | DISH2009 | 10-12-2009

Page 39: Day 2, workshop 3

Print Room online

registration and digitization project (3 years)30.000 objects a year

high quality enclosure, registration by specialistsbasic properties: e.g. object ID, storage location, Title,

creator and measurementsproperties for better accessibility:

high quality digital scans subject matter annotations

Semantic Web for Cultural Heritage | DISH2009 | 10-12-2009

reusing web resources for subject annotation | Rijksmuseum annotation tool

Page 40: Day 2, workshop 3

Subject annotation

1. focus on the main subject (due to required throughput rates)2. the cataloguers are instructed to only describe theWHO (depicted person/organization)WHAT (event name)WHERE (place)WHEN (date)3. to use the existing thesauri of the Rijksmuseum and the Iconclass

classification system

Semantic Web for Cultural Heritage | DISH2009 | 10-12-2009

reusing web resources for subject annotation | Rijksmuseum annotation tool

Page 41: Day 2, workshop 3
Page 42: Day 2, workshop 3

Several unsolved problems

Adding new terms is time consuming:one full month a year of adding thesaurus terms instead of catalogueing

objects

the external thesauri/classification systems not integrated in Adlib:e.g. IconClass, AAT

Generic terms are required for subject annotions, no integrated source

Maintaining a thesaurus iswork for specialisttime consuming

difficult to achieve and keep a high quality

Semantic Web for Cultural Heritage | DISH2009 | 10-12-2009

reusing web resources for subject annotation | Rijksmuseum annotation tool

Page 43: Day 2, workshop 3

Semantic Web for Cultural Heritage | DISH2009 | 10-12-2009

reusing web resources for subject annotation | Rijksmuseum annotation tool

Page 44: Day 2, workshop 3

Semantic Web for Cultural Heritage | DISH2009 | 10-12-2009

reusing web resources for subject annotation | Rijksmuseum annotation tool

Page 45: Day 2, workshop 3

http://e-culture.multimedian.nl/pk/annotate?uri=RP-P-OB-77.320

Semantic Web for Cultural Heritage | DISH2009 | 10-12-2009

reusing web resources for subject annotation | Rijksmuseum annotation tool

Page 46: Day 2, workshop 3

Advantages

annotationlarger coverage

multiple perspectivesmaintenance by third party

searchmultilingual (IconClass, AAT, WordNet)

alternatives spelling (ULAN,TGN)synonyms (WordNet, AAT, TGN) nicknames (ULAN)

multiple perspectivesrelations between thesaurus terms

Semantic Web for Cultural Heritage | DISH2009 | 10-12-2009

reusing web resources for subject annotation | Rijksmuseum annotation tool

Page 47: Day 2, workshop 3

Requirements

standard representation (SKOS) enables integration

extensive configuration of search algorithm and interface design

collaboration between museum experts and developers

Semantic Web for Cultural Heritage | DISH2009 | 10-12-2009

reusing web resources for subject annotation | Rijksmuseum annotation tool

Page 48: Day 2, workshop 3

an interface to integrated collections | Sterna

An interface to integrated collectionsSterna Birdwatchers interface

Semantic Web for Cultural Heritage | DISH2009 | 10-12-2009

http://science.naturalis.nl/collections/sterna-birdwatchers-portal

Page 49: Day 2, workshop 3

organizational aspects | Rijksmuseum

Organizational aspectsRijksmuseum

Semantic Web for Cultural Heritage | DISH2009 | 10-12-2009

Page 50: Day 2, workshop 3

Collaboration between Rijksmuseum and Multimedian

Semantic Web for Cultural Heritage | DISH2009 | 10-12-2009

organizational aspects | Rijksmuseum

Page 51: Day 2, workshop 3

The Rijksmuseum

has opportunities for experimentationprovide innovations for digitization of cultural heritage

why the Print Room Online?

specific scope for subject matter annotationstrict guidelines and procedures

necessity for new datasources /thesauri

Semantic Web for Cultural Heritage | DISH2009 | 10-12-2009

organizational aspects | Rijksmuseum

Page 52: Day 2, workshop 3

Semantic Web for Cultural Heritage | DISH2009 | 10-12-2009

organizational aspects | Rijksmuseum

Page 53: Day 2, workshop 3

organizational aspects | Naturalis

Organizational aspectsNaturalis

Semantic Web for Cultural Heritage | DISH2009 | 10-12-2009

Page 54: Day 2, workshop 3

organizational aspects | Naturalis

Organizational aspectsNaturalis

Semantic Web for Cultural Heritage | DISH2009 | 10-12-2009

create a peer-to-peer network of content providers based on a common knowledge domain

bigger organisations have usually more facilities and/or resources and should support smaller partner institutions in the network

a suitable technical infrastructure is necessary to support the realization of such a network

the Sterna project is part of the iterative process of realizing this ambition

Page 55: Day 2, workshop 3

organizational aspects | Trezorix

Organizational aspectsTrezorix

Semantic Web for Cultural Heritage | DISH2009 | 10-12-2009

Page 56: Day 2, workshop 3

organizational aspects | Trezorix

Organizational aspectsTrezorix

Semantic Web for Cultural Heritage | DISH2009 | 10-12-2009

Trezorix works with clients who, as knowledge distributors and networking organizations, have a strong ambition of themselves to be

innovators

Trezorix works closely together with academic communities, exchanging all kinds of innovative ideas with extensive practical

experiences in building semantic networks

Page 57: Day 2, workshop 3

Our Conclusions

Semantic Web for Cultural Heritage | DISH2009 | 10-12-2009

lessons learned

Page 58: Day 2, workshop 3

Our Conclusions

• implementation of semantic networks is an undertaking that takes a lot of effort and energy from cultural heritage institutions:

• the Institutions will have to deal with complex questions about content, organization and technical solutions

• there is a strong need for supporting servicesshould all other tools in the tool chain be adapted?

• it requires a dialogue and collaboration with other institutions and organizations about the organizational aspects

Semantic Web for Cultural Heritage | DISH2009 | 10-12-2009

lessons learned

Page 59: Day 2, workshop 3

lessons learned

Lessons learned

Semantic Web for Cultural Heritage | DISH2009 | 10-12-2009

Page 60: Day 2, workshop 3

lessons learned

Lessons learned

• there is a tendency for cultural heritage institutions to rely too much on the "beneficial" effects and possibilities of technology

• all parties have different goals, define these goals from the beginning

• create commitment in involving all kind of people (technical staff, curators, cataloguers, users of the database)

• make the collaboration official and make sure there’s time in your organization for innovative projects that won’t immediately result in

ready made products

Semantic Web for Cultural Heritage | DISH2009 | 10-12-2009

Page 61: Day 2, workshop 3

Group discussion

Semantic Web for Cultural Heritage | DISH2009 | 10-12-2009

lessons learned

Page 62: Day 2, workshop 3

Semantic Web for Cultural Heritage | DISH2009 | 10-12-2009

lessons learned

Group discussion

Do you recognize the situations demonstrated in these three cases?

How is dealt with these situations in your institution? Do you think semantic technology can provide solutions for your

institution as well?

Do you think universities, cultural heritage institutions and developers should work more closely together in innovative projects and how should it be

organized?