Tutorial on Semantic Digital Libraries (ESWC'2007)

Post on 06-May-2015

11.093 views 1 download

Tags:

description

Full stack of slides from our tutorial on semantic digital libraries from ESWC'2007

Transcript of Tutorial on Semantic Digital Libraries (ESWC'2007)

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [1]

Tutorial – Semantic Digital Libraries

- Introduction -

Sebastian R. Kruk, Bernhard Haslhofer,

Philipp Nußbaumer, Sandy Payette, Tomasz Woroniecki

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [2]

Tutorial overview

• Who we are– Sebastian R. Kruk, DERI Galway – Ireland

– Bernhard Haslhofer, University of Vienna - Austria

– Phillip Nußbaumer, Research Studios - Austria

– Sandy Payette, Cornell University – USA

– Tomasz Woroniecki, DERI Galway – Ireland

• Today we want to– give you a brief introduction to the Semantic Web, and show how SW is related to

digital libraries

– present existing semantic digital library systems

– discuss the current problems and future directions of semantic digital libraries and get feedback from you

• After this tutorial you will know– what is the semantic digital library system

– existing solutions in various degrees of detail

– how to run semantic digital library solutionson your machine

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [3]

Tutorial Schedule

Time

9:00 - 9:45 Introduction to Semantic Digital Libraries

9:45 – 10:30 Existing solutions - JeromeDL

10:30 – 11:00 Coffee break

11:00 - 12:15 Existing Semantic Digital Libraries solutions

12:15 – 12:30 Comparison and the future of SemDL

12:30 – 14:00 Lunch break

14:00 – 15:30 Hands-on session (part I)

15:30 – 16:00 Coffee break

16:00 – 16:45 Hands-on session (part II)

16:45 - 17:30 Conclusions, discussion

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [4]

Outline

Introduction to Semantic Web

Semantic Digital Libraries

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [5]

The Semantic Web – A Brief Introduction

• Current Web vs. Semantic Web?– An extension of the current Web in which information is given well-defined

meaning, better enabling computers and people to work in cooperation. [Tim Berners-Lee]

– Current Web was designed for humans, and there is little information usable for machines

• Was the Web meant to be more?– Objects with well defined attributes as opposed to untyped hyperlinks

between Internet resources

– A network of relationships amongst named objects, yielding unified information management tasks

• What do you mean by “Semantic”?– the semantics of something is the meaning of something

– Semantic Web is able to describe things in a way that computers can understand

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [6]

The Semantic Web – A Brief Introduction

YouAre

Here!

Where are we in the “Semantic Web layer cake”?

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [7]

The Semantic Web – A Brief Introduction

The challenge for the Semantic Web– The Semantic Web can’t work all by itself– For example, it is not very likely that you will be able to sell

your car just by putting your RDF file on the Web– Need society-scale applications: Semantic Web agents and/or

services, consumers and processors for semantic data, more advanced collaborative applications

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [8]

The Semantic Web – What is RDF ?

Describing things on the Semantic Web– RDF (Resource Description Framework)

• a data format for describing information and resources,

• the fundamental data model for the Semantic Web

– Using RDF, we can describe relationships between things like:

• A is a part of B or

• Y is a member of Z

• and their properties (size, weight, age, price…) in a machine-understandable format where each thing has a

– RDF graph-based model delivers straightforward machine processing

– Putting information into RDF files makes it possible for “scutters” or RDF crawlers to search, discover, pick up, collect, analyse and process information from the Web

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [9]

The Semantic Web – What is RDF ?

A simple RDF example– Statement:

“Stefan Decker is the creator of the resource (web page) http://www.stefandecker.org”

– Structure:

Resource (subject) http://www.stefandecker.org

Property (predicate) http://purl.org/dc/elements/1.1/creator

Value (object) “Stefan Decker”– Directed graph:

http://www.stefandecker.orgdc:creator Stefan Decker

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [10]

The Semantic Web – How RDF can help us?

How RDF can help us?• identify objects

• establish relationships

• express a new relationship just add a new RDF statement

• integrate information from different sources copy all the RDF data together

• RDF allows many points of view

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [11]

• What is an Ontology?„An ontology is a specification of a conceptualization.“

Tom Gruber, 1993

• Ontologies are social contracts– Agreed, explicit semantics

– Understandable to outsiders

– (Often) derived in a community process

• Ontology markup and representation languages:– RDF and RDF Schema

– OWL

– Other: DAML+OIL, EER, UML, Topic Maps, MOF, XML Schemas

The Semantic Web – Ontologies and Schemata

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [12]

• Defines small vocabulary for RDF: – Class, subClassOf, type

– Property, subPropertyOf

– domain, range

• Vocabulary can be used to define other vocabularies for your application domain

The Semantic Web – RDF Schema

Person

Student Researcher

subClassOfsubClassOf

Jeen

type

hasSuperVisordomain range

Frank

type

hasSuperVisor

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [13]

OWL – The Web Ontology Language

Owl took Christopher Robin’s notice from Rabbit and looked at it nervously. He could spell his own name WOL, and he could spell Tuesday so that you knew it wasn’t Wednesday, and he could read quite comfortably when you weren’t looking over his shoulder and saying "Well?" all the time...

• provides a vocabulary for defining classes, their properties and their relationships among classes.

The Semantic Web – OWL

owl:disjointWith

s s

ss

Animal

Herbivore

Carnivore

Omnivore

• Based on Description Logics

• OWL is a W3C Recommendation

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [14]

The Semantic Web – Applications

• Semantic Web cannot be and is not only a set of recommendations

• Semantic Web is becoming reality by applications that support it and are based on it

• Enabling technologies:– RDF Storages: Sesame, Jena, YARS– Reasoners: KAON, Racer – Editors: Protege, SWOOP, MarcOnt Portal

• End-User applications:– Semantic wikis: Makna, SemperWiki– Semantic blogs– Semantic digital libraries

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [15]

Outline

Introduction to Semantic Web

Semantic Digital Libraries

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [16]

What is a Semantic Digital Library?

Semantic digital libraries– integrate information based on different

metadata, e.g.: resources, user profiles, bookmarks, taxonomies – high quality semantics = highly and meaningfully connected information

– provide interoperability with other systems (not only digital libraries) on either metadata or communication level or both – RDF as common denominator between digital libraries and other services

– delivering more robust, user friendly and adaptable search and browsing interfaces empowered by semantics

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [17]

Old days of hard-copy books

• Library:– Archive (storage space)

– Bibliographic cards (metadata)

– Librarian (interface)

• Pros:– Someone to talk to, to understand us, to explain, help in searching

• Cons:– Based on physical location

– Libraries are not connected – we have to visit every place

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [18]

Yesterday of digital books

• Digital library– Database and archive (storage)

– Digital bibliographic descriptions (metadata)

– Full-text search (interface)

• Pros:– Content accessible online

– Federations of libraries – visit less places

• Cons:– Lonely user - no one to talk to, we need to find the right keywords,

what if we do not know them (“man without an ear” paintings example)

– Still many problems with interconnecting (different) libraries

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [19]

Today of interconnected content

• Semantic Digital Libraries– Database and archive (storage)

– Semantic bibliographic description (interconnected metadata)

– Search and browsing on ontologies (interface)

• Pros:– Search and browsing based on semantics can help in substituting

the librarian

– It is easier to interconnect heterogeneous libraries (RDF as common denominator)

• Cons:– Semantics created from legacy formats – still hard to capture by

most of average users

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [20]

Tomorrow of social media

• Social Semantic Digital Libraries– Database and archive (storage)

– Bibliographic descriptions with annotations provided by users (metadata)

– Collaborative search and browsing (interface)

• Pros:– Users contribute to the classification process

– Users can understand community driven annotations

– Users enhance digital content using blogs, wikis on the side

• Cons:– Not everyone is convinced

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [21]

How are Semantic Digital Libraries different?

Semantic digital libraries extend digital libraries by– describing and exposing its resources in a machine

‘understandable’ way– resources can be

• contents, digital artefacts• organization of objects (e.g. collections)• users, user communities• controlled vocabularies, thesauri,

taxonomies– expose the semantics of their metadata

in terms of an ontology• defined using a formal language

– deliver mediation services for communication with other systems

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [22]

Semantic Web Technologies for Digital Libraries?

Metadata is the key concept• the Web does not have metadata

– the idea of a Semantic Web is nice but difficult to implement

• many digital libraries do have metadata in place• we simply must make them available in a machine

understandable format• the Semantic Web provides the format: RDF

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [23]

Semantic Web Technologies for Digital Libraries?

Knowledge in bibliographic records• Digital Libraries already have controlled vocabularies,

taxonomies or even ontologies in place • the challenge is to model this knowledge in a machine

understandable way• the Semantic Web provides ontology languages:

– RDF Schema– OWL– SKOS

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [24]

A Sample Bibliographic Record

Vincent van Gogh; painter: Gogh, Vincent van (Dutch painter, 1853-1890)

Creation-Creator/Role

J. Paul Getty MuseumCurrent Location-Repository Name

irises, nature, soil, etc.Subject-Matter

1889, earliest: 1889, latest: 1889Creation-Date

IrisesTitle

paintingsObject/Work type

PaintingsClassification

Copyright 2000 The J. Paul Getty Trust &

College Art Association, Inc.

Terms taken from Controlled Vocabularies

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [25]

Knowledge Organization Systems

• tools that present the organized interpretation of knowledge structures

• semantic tools - meaning of words and other symbols as well as

(semantic) relations between symbols and concept

• organize information and promote knowledge management

• Examples:

– classification and categorization schemata (organize materials at a general

level)

– subject headings (provide more detailed access)

– authority files (control variant versions of key information such as

geographic names and personal names)

– highly structured vocabularies, such as thesauri

– traditional schemes, such as semantic networks and ontologies

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [26]

Taxonomy of Knowledge Organization Systems

• Term Lists

– Authority files (FOAF)

– Glossaries

– Dictionaries

– Gazetteers

• Classifications and Categories (DMoz)

– Subject headings

– Classification schemes

– Taxonomies

– Categorization Schemes.

• Relationship Lists

– Thesauri (WordNet, MeSH)

– Semantic networks

– Ontologies (Hodge, 2000)

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [27]

Understanding Knowledge Organization Systems

• controlled vocabulary - a list of terms that have been enumerated explicitly

• taxonomy - a collection of controlled vocabulary terms organized into a

hierarchical structure.

• formal ontology – a controlled vocabulary expressed in an ontology

representation language. This language has a grammar for using vocabulary

terms to express something meaningful within a specified domain of interest.

• meta-model - an explicit model of the constructs and rules needed to build

specific models within a domain of interest. A valid meta-model is an ontology,

but not all ontologies are modeled explicitly as meta-models.

– as a set of building blocks and rules used to build models

– as a model of a domain of interest, and

– as an instance of another model.

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [28]

Simple Knowledge Organization Systems (SKOS)

• basic structure and content of concept schemes such as

– thesauri,

– classification schemes,

– subject heading lists,

– taxonomies,

– 'folksonomies',

– other types of controlled vocabulary

• core concepts:

– narrower and broader

– isSubjectOf and subject; isPrimarySubjectOf and primarySubject

– member and Collection; memberList and OrderedCollection

– related and semanticRelation

– note, definition; altLabel and prefLabel; symbol and altSymbol

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [29]

Benefits of Semantic Digital Libraries

Problems of today’s libraries • rapidly growing islands of highly organized information

– How to find things in a growing information space?

• is it enough to have a full-text index (à la Google)?

• typical “end-users” versus “expert users”

• converging digital library systems– e.g. uniform access to Europe’s digital libraries and cultural heritage

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [30]

Benefits of Semantic Digital Libraries

The two main benefits of Semantic Digital Libraries• new search paradigms for the information space

– Ontology-based search / facet search– Community-enabled browsing

• providing interoperability on the data level– integrating metadata from various heterogeneous sources– Interconnecting different digital library systems

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [31]

Searching the Sample Bibliographic Record

Vincent van Gogh; painter: Gogh, Vincent van (Dutch painter, 1853-1890)

Creation-Creator/Role

J. Paul Getty MuseumCurrent Location-Repository Name

irises, nature, soil, etc.Subject-Matter

1889, earliest: 1889, latest: 1889

Creation-Date

IrisesTitle

paintingsObject/Work type

PaintingsClassification

Copyright 2000 The J. Paul Getty Trust &

College Art Association, Inc.

• Full-text search– “Paintings” AND “Van Gogh”

AND “flowers” no result

• Semantic query– if the knowledge that “irises”

are “flowers” is modeled in an ontology (e.g. subclass-hierarchy)

– we can query for all “Paintings” by “Van Gogh” with subject “flowers” and retrieve also the picture with subject “irises”

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [32]

Semantic Digital Libraries and Existing DL Systems

• how to handle the legacy (meta-)data problem • lifting existing (meta-)data to a semantic level

– simple solutions like MARC21 DublinCore– complex ontologies like MarcOnt Ontology for capturing concepts

from different standards

• legacy libraries expose their metadata via well established protocols - the metadata can be imported into semantic DLs

• semantic DLs can play a role of integration champions in the information retrieval process in heterogeneous networks:– OAI-PMH– Z39.50– Dienst

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [33]

Application Areas for Semantic Web Technologies

• Thesauri & Controlled Vocabularies– qualified DublinCore

– DMoz, DDC-based taxonomies

– SKOS, WordNet and other thesauri

• Schema Mappings / Crosswalks– MarcOnt Ontology – aims to cover concepts from MARC21, BibTeX and

DublinCore

– MarcOnt Mediation Services – an open mediation framework between common legacy metadata standards

• Metadata Integration– RDF as a common data model for integrating metadata from various

autonomous and heterogeneous data sources

– OWL for modeling the data source’s semantics

– SPARQL as a common query language

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [34]

Semantic DL as Evolving Knowledge Space

• In state-of-the-art digital libraries users are consumers– Retrieve contents based on available bibliographic records

• Recent trends: user communities– Connetea– Flickr

• In Semantic digital libraries users are contributers as well– Tagging (Web 2.0)– Social Semantic Collaborative Filtering– Annotations

• Semantic Digital libraries enforce the transition from a static information to a dynamic (collaborative) knowledge space

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [35]

Existing Semantic Digital Library Systems

• JeromeDL– a social semantic digital library makes use of Semantic Web

and Social Networking technologies to enhance both interoperability and usability

• BRICKS– aims at establishing the organizational and technological

foundations for a digital library network in order to share knowledge and resources in the cultural heritage domain.

• FEDORA– delivers flexible service-oriented architecture to managing

and delivering content in the form of digital objects

• SIMILE– extends and laverages DSpace, seeking to enhance

interoperability among digital assets, schemata, metadata, and services

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [36]

Tutorial – Semantic Digital Libraries

- Existing Semantic Digital Libraries Solutions -

Sebastian R. Kruk, Bernhard Haslhofer,

Philipp Nußbaumer, Sandy Payette, Tomasz Woroniecki

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [37]

Existing Semantic Digital Library Systems

• JeromeDL– a social semantic digital library makes use of Semantic Web

and Social Networking technologies to enhance both interoperability and usability

• BRICKS– aims at establishing the organizational and technological

foundations for a digital library network in order to share knowledge and resources in the cultural heritage domain.

• FEDORA– delivers flexible service-oriented architecture to managing

and delivering content in the form of digital objects

• SIMILE– extends and laverages DSpace, seeking to enhance

interoperability among digital assets, schemata, metadata, and services

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [38]

Tutorial 7 – Semantic Digital Libraries

- Existing Semantic Digital Libraries Solutions –

JeromeDL

Sebastian R. Kruk, Tomasz Woroniecki

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [39]

Outline

JeromeDL - Motivation and Overview

JeromeDL - Architecture and Ontologies

JeromeDL - Semantic Services

JeromeDL - Social Services

JeromeDL - Semantics in Use

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [40]

JeromeDL - Introduction

• Joint effort of DERI, National University of Ireland, Galway

and Gdansk University of Technology (GUT)

• Distributed under BSD Open Source license

• Digital library build on semantic web technologies to answer

requirements from: librarians, scientists and everyone.

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [41]

Motivation

• How to integrate and search information from different bibliographic sources?

• How to share and interconnect knowledge among people?

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [42]

JeromeDL – Motivations Use Cases

• Librarians:– support for rich metadata (MARC21) in uploading resources,

accessing bibliographic information and searching

– persistent identifiers

• Scientists: – easy publishing (designed as a institute/university digital library)

– creating hierarchical networks of digital libraries

– support for accessing, sharing and searching using bibliography

metadata (BibTeX)

• Everyone:– simple search (incl. natural language queries)

– community-aware information sharing and browsing,

– support for interationalization

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [43]

JeromeDL - Motivations

• Support for different kinds of bibliographic medatata, like:

DublinCore, BibTeX and MARC21 at the same time.

– Making use of existing rich sources of bibliographic descriptions (like

MARC21) created by human.

• Supporting users and communities:

– users have control over their profile information;

– community-aware profiles are integrated with bibliographic

descriptions

– support for community generated knowledge

• Delivering communication between instances:

– P2P mode for searching and users authentication

– Hierarchical mode for browsing

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [44]

Outline

JeromeDL - Motivation and Overview

JeromeDL - Architecture and Ontologies

JeromeDL - Semantic Services

JeromeDL - Social Services

JeromeDL - Semantics in Use

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [45]

JeromeDL – Architecture

• Resources and

annotations repository

• Middleware:– query processing

– community space

– resources management

• User interface agents:

• Communication to the

outside world

• Administrative interface

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [46]

Bibliographic Description in JeromeDL

<?xml version="1.0" encoding="UTF-8" ?><rdf:Description rdf:about="http://...id=828374765"><dc:title>JeromeDL - Adding Semantic Web Technologies to DLs</dc:title><dc:creator>Sebastian Kruk</dc:creator><dc:description>In recent years...</dc:description></rdf:Description>

01450cas 922004331i 450000100...019c19329999gw qr|p| ||||0 |0ger | a0044-2992 9a200412140219bVLOADc200404071525dvkulc200310071018dvbjc200303101205dkopumky200209211341zVLOAD aGD U/MPcGD U/MPdGD U/MFdGD U/KKsdWR O/EJ0 ager1 aZ. Kunstgesch. 0aZeitschrift für Kunstgeschichte00aZeitschrift für Kunstgeschichte.18aZfK aMünchen ;aBerlin :bDeutscher Kunstverlag,c1932-. c26-29 cm. aKwart.0 a1 Bd. (Juni 1932)-. aOpis na podst.: LCC. aW 1932 założycielami czasopisma byli Wilhelm Waetzoldt i Ernst Gall....

These all can be represented in RDFThese all can be represented in RDF@InProceedings { jeromedexa2005, author = "Sebastian Ryszard Kruk and ... ", title = "{JeromeDL - Adding Semantic ...}", booktitle = "{In Proceedings to DEXA 2005}", year = 2005}

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [47]

Structure ontology in JeromeDL

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [48]

Bibliographic (MarcOnt) Ontology in JeromeDL

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [49]

Community-aware (FOAFRealm) ontology

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [50]

Ontologies in JeromeDL

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [51]

Metadata and Services in JeromeDL

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [52]

Outline

JeromeDL - Motivation and Overview

JeromeDL - Architecture and Ontologies

JeromeDL - Semantic Services

JeromeDL - Social Services

JeromeDL - Semantics in Use

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [53]

Semantic Metadata and Services

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [54]

MarcOnt Initiative – Overview

Motivation:

• Provide set of tools for

collaborative ontology

development

MarcOnt Initiative goals:

• Create a framework for collaborative ontology improvement (E-learning)

• Provide domain experts with tools to share their knowledge

• Offer tools for data mediation between different data formats

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [55]

MarcOnt Portal and MarcOnt Ontology

Sugested Poposals

Initial Ontology

Proposal discussion

Proposal anotations

Proposal votingProposal autopromoting

Versioning

Next RevisionMarcOnt Portal

MarcOnt Ontology:

Central point of MarcOnt Initiative

Translation and mediation format

Continuos collaborative ontology

improvement

Knowledge from the domain experts

MarcOnt Portal (source of

knowledge):

• Suggestions

• Annotations

• Versioning

• Ontology editor

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [56]

MarcOnt Mediation Services for Legacy Metadata

MarcOnt OntologyMarcOnt RDF

MARC21 RDF

MARC21 XML

MARC21

Dublin Core RDF

Dublin Core XML

Dublin Core

New format RDF

New format XML

New format

Format translation

RDF Translator

Format co-operation

MarcOnt Mediation Services

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [57]

Browsing the data graph – why?

• The search does not end on a (long) list of results

• The results are not a list (!) but a graph

• „Lost in hyperspace”

• A need for unified UI and services for filter/narrow and

browse/expand services

• Share browsing experience – navigate collaboratively

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [58]

Browsing the data graph – how?

• Defines REST access to services and their composition

• Basic services: access, search, filter, similar, browse,

combine

• Meta services: RDF serialization, subscription channels,

service ID generation,

• Context services: manage contexts, manage service

calls/compositions in the context, lists contexts

• Statistics services: properties, values,

tokens

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [59]

Browsing the data graph

• JeromeDL exploits interconnected data

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [60]

Browsing the data graph

• … to allow browsing

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [61]

Outline

JeromeDL - Motivation and Overview

JeromeDL - Architecture and Ontologies

JeromeDL - Semantic Services

JeromeDL - Social Services

JeromeDL - Semantics in Use

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [62]

Semantic Metadata and Services

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [63]

Social Services in JeromeDL

• Involve users into sharing knowledge

– Blogs – comments and discussions about documents and

resources

– Tagging – collaborative classification

– Wikis – collaboratively edited additional descriptions, such as

summaries and interesting facts

• Preserve knowledge for future use

– Users can learn from experience of others instantly

– Recommend new, interesting resources based on users’ profiles

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [64]

FOAF - Describing Social Networks

• FOAF - Stands for Friend-of-a-Friend

• Defines properties for a person (but it does not have to be a

person, can be an “agent”)

• Does not only have to contain one person per file

• Can build a network of people with foaf:knows links

• FOAF can be easily extended to meet requirements, as in

the case of FOAFRealm for identity management…

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [65]

Identity management with FOAFRealm

• Identity defined with extended FOAF

metadata

• Policies expressed by social networking – Distance between owner and requester

– Friendship level between owner and requester,

calculated across digraph of social network

• Support for single registration and sign on

• Distributed identity management with

HyperCuP (“D-FOAF”)• FOAFRealm is currently implemented as a plugin for Tomcat

(Realm/Valve implementation), with PHP and .NET versions

coming soon

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [66]

Social Semantic Collaborative Filtering

• Why?– The bottom-line of acquiring knowledge: informal communication

(“word of mouth”)

• How?– Everyone classifies (filters) the information in bookmark folders (user-

oriented taxonomy)

– Peers share (collaborate over) the information (community-driven taxonomy)

• Result?– Knowledge “flows“ from the expert

through the social network to the user

– System amass a lot of information on user/community profile (context)

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [67]

Social Semantic Collaborative Filtering

• Problems?– The horizon of a social network (2-3 degrees of separation)

– How to handle fine-grained information (blogs, wikis, etc.)

• Solutions? – Inference engine to suggest knowledge from the outskirts of the

social network

– Support for SIOC metadata:

• SIOC browser in SSCF

• Annotations and evaluations of “local” resources

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [68]

What is Social Semantic Collaborative Filtering?

• Goal: to enhance individual bookmarks with shared knowledge within a community

• Users annotate catalogues of bookmarks with semantic information taken from DMoz or WordNet vocabularies

• Catalogs can include (transclusion) friend's catalogues

• Access to catalogues can be restricted with social networking-based polices

• SSCF delivers:– Community-oriented, semantically-rich taxonomies

– Information about a user's interest

– Flows of expertise from the domain expert

– Recommendations based on users previous actions

– Support for SIOC metadata

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [69]

foaf:knows

xfoaf:include

xfoaf:bookmark

Social Semantic Collaborative Filtering

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [70]

Social Networks in Digital Libraries

Resource

xfoaf:Annotation

user_C

creator_B

foaf:knows

marcont:hasCreator

creator_A

foaf:knows

foaf:knows

xfoaf:Directory

user_D

xfoaf:owns

xfoaf:linksTo

xfoaf:isIn

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [71]

Support for

online communities

in SSCF

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [72]

Support for

online communities

in SSCF

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [73]

Outline

JeromeDL - Motivation and Overview

JeromeDL - Architecture and Ontologies

JeromeDL - Semantic Services

JeromeDL - Social Services

JeromeDL - Semantics in Use

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [74]

JeromeDL – Delivering Semantic Content

• Providing semantic annotations during uploading process:

– open module for handling any taxonomies

– keywords based on WordNet and free tagging

– defining structure of resources in the JeromeDL ontology

• Lifting legacy metadata to MarcOnt ontology

• Community maintained annotations

– social semantic collaborative filtering

– semantic descriptions based on the FOAF metadata

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [75]

Annotating Library Resources

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [76]

JeromeDL – Semantic Information In Use

• Searching:– Keyword-based search with semantic query expansion

– Semantic search:

• Direct RDF quering

• Natural language templates

• Browsing– Exibit

– MultiBeeBrowse

• Sharing:– Social Semantic Collaborative Filtering

– Semantically Interlinked Online Communities

• Heterogeneous communication:– Bibster, A9, OAI-PMH

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [77]

Exposing Semantic Annotations

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [78]

Filtering Resources in JeromeDL

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [79]

Sharing Knowledge with SSCF

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [80]

Information Retrieval in JeromeDL

Fulltext Index

StructureRepository

MarcOntRepository

Resources’Content

FOAFRealmRepository

(typed)keywords

RDF & NLQuery

OpenSearchRSS

collaborativefiltering

localinterface

distributedinterface

types translation

semantic queryexpansion

RDF Repositories Secure Snapshot

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [81]

Networks of Digital Libraries

• ELP (Extensible Library Protocol) implementation

– communication within JeromeDL network

– adapters for communication with other networks

• D-FOAF integration (distributed user profile management)

– single sign on and single registration within D-FOAF network

• HyperCuP integration (scalable P2P network)

• Independent ELP network entry point:

http://search.jeromedl.org/

0 0

11

0

0

11

0

2 2

22

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [82]

Tutorial – Semantic Digital Libraries

- Existing Semantic Digital Libraries Solutions –

BRICKS

Bernhard Haslhofer

University of Vienna

Austria

Philipp Nußbaumer

Research Studios

Austria

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [83]

Outline

BRICKS Overview

BRICKS Components

BRICKS Applications

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [84]

What is BRICKS?

• A software infrastructure for building digital library networks

– Transparent access to distributed resources

– Multilinguality– Easy installation & maintainance

• A set of end-user applications– Network & content management– Web 2.0 tagging/annotations– Domain specific applications

• A business model– Open source, platform independent– Low cost infrastructure– User communities sustainability

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [85]

BRICKS Architecture

• A decentralized P2P network– Avoid central coordination

– Highly Scalable, increased reliability

– Minimized maintainance costs

• Each P2P Node is a set of SOA components– Web Service interface

– Platform independent

– Flexible composition

• Components for– Storing, accessing and protecting digital objects

– (Semantic) search & browsing

– P2P commmunication

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [86]

Workstation

Workstation

Workstation

Workstation

BNodeEMF

User

User

User

User

Workstation

Workstation

Workstation

Workstation

BNodeAustrian Library

User

User

User

User

BNodeStudio Azzuro

WorkstationUser

WorkstationUser

FhG IPSIRequest

RequestRequest

Request

Req

uest

Request

Accessing Data

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [87]

A Look into a BNode

BNode{

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [88]

Outline

BRICKS Overview

BRICKS Components

BRICKS Applications

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [89]

Collection Manager

• Single access point for all content and metadata related operations (local and remote)

• Physical Collection– Similar to folder/directory hierarchy in a file system

– Bound to a single BNode

– Each digital content object belongs to exactly one collection

• Logical Collection– Virtual folder for organizing content items independent of their physical location

– Links to content items from various physical collections on different BNodes

– A content item might belong to many of them

• Stored Query similar to database views

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [90]

Content Manager

• Two ways to handle content in BRICKS

– Stored locally at site of a member party, accessed via URL

– Stored within BRICKS

• Based on Java Content Repository (JCR)

• Provides a meta-content model

– Re-use of existing content models

– Use standard models

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [91]

Metadata Manager

• Metadata descriptions RDF– Suitable for any application scenario– Express relationships between

objects– React to changes without changing

the model

• Schema defintions OWL– No fixed schema– Extensible (e.g. Application profiles)– Semantic concepts instead of

schematic strucutures

• SPARQL– Metadata queries over ontology

concepts– Queries for graph patterns

System Core

Storage (Jena)

Data Transformation

Schema Manager

Metadata Manager

Metadata Record

Metadata Record

Val

idat

or

RDF/XML

Web Service

API (WSDL)

dc.xsl vra.xsl xyz.xsl

XML

OAI-PMH Server

OAI-PMH Harvester

XML

DCVRA xyz

mapping mapping

RDB File-DB

Query Adapter

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [92]

Security Manager

• Transparently invoked by the Framework– any service call is checked

• Context-aware policies based on RBAC (via XACML rules)– supporting Roles, Groups, at DLObject level

• Permission declaration through Javadoc @tags

• Federated identity is managed through an adapted version of OpenSAML

• Reputation-based Trust calculation integrated

• Web-based GUI for security configuration

92

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [93]

Digital Rights Management

• DRM Component– Support for licenses based on MPEG-21 REL license declaration standard

– Generic API for the integration of commercial DRM systems

• Watermarking– Open-source watermarking tool for images

– Other tools can be integrated

• BRICKS Store web application for commercial content

• Creative Commons support for other content in BRICKS

93

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [94]

Outline

BRICKS Overview

BRICKS Components

BRICKS Applications

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [95]

Application: BRICKS Workspace

• What does it demonstrate?– A web application (thin client) accessing BRICKS Foundation services– Web 2.0 image

annotations– Reference application

• Primary customers– General end-users (citizens)– Application developers

• Technology– Struts based interface to

the BCH

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [96]

Application: BRICKS Desktop

• What does it demonstrate?– A rich client application accessing BRICKS foundation services– Direct access to the BCHN

• Primary customers– Expert end-users

(researchers, educators)– Application developers

• Technology– Eclipse based rich

client interface

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [97]

Application: Annotation Tool

• What does it demonstrate?– Tool which allows end-users to annotate images– Creation of annotation threads– Supervised Annotations

• Primary customers– End-users– Institutions with large

image collections

• Technology– Web Application

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [98]

Application: Online Exhibition Authoring Tool

• What does it demonstrate?– Creating and publishing online exhibitions

using contents that is availablein the BRICKS network

• Primary customers?– Expert end-users

(curators)

• Technology– Web Application

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [99]

Application: Archeological Finds Identifier

• What does it demonstrate?– A web application for comparing findings (e.g. ancient coins) with objects in

reference collections – Application of complex

domain ontology (CIDOC-CRM)– Map visualization of GIS-Metadata

• Primary customers?– Museum curators, archaeologists,

students, amateurs,

• Technology– Struts based interface

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [100]

References

• BRICKS Community Web Site– http://www.brickscommunity.org/

– Main Contact: silvia.boi@metaware.it

• Related (de-facto) standards– Resource Description Framework

(RDF)• http://www.w3.org/TR/rdf-primer/

– OWL Web Ontology Language (OWL)

• http://www.w3.org/TR/owl-guide/

– SPARQL• http://www.w3.org/TR/rdf-sparql-query/

– Java Content Repository (JCR)• http://www.jcp.org/en/jsr/detail?id=170

• Tools and Libraries– Jackrabbit

• http://jackrabbit.apache.org/– Jena Semantic Web

Framework• http://jena.sourceforge.net/

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [101]

Tutorial – Semantic Digital Libraries

- Existing Semantic Digital Libraries Solutions –

Fedora

Sandy Payette

Director, Fedora Project

Cornell University

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [102]

Outline

Fedora

Examples: PLoS ONE and National Science Digital Library

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [103]

Scholarly and Scientific Workbenches

“Web 2.0” Collaborative Repositories Museum Exhibits with Lesson Plans

Fedora Semantic Digital Libraries enable …

DataData

Annotation

Article

Linking Data and Publications

blog and wiki

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [104]

The Fedora Project

• Fedora– Flexible– Extensible – Digital – Object– Repository– Architecture

• History– Cornell Research (1997-2002)

– DARPA and NSF-funded research and reference implementations– Distributed, Interoperable Repositories (experiments with CNRI)

– Open Source Project (2002-present)– Andrew W. Mellon Foundation (2002-2009)– Joint development by Cornell University and University of Virginia– Transitioning into non-profit organization (Fedora Commons 501c3)

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [105]

Arts and Humanities

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [106]

Sciences

Education

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [107]

Fedora - Technology Integration

Semantic

Repository

Enterprise Preservation

• Information Networks• Contextualization• Relationships• Query• Inference

• Workflow• Messaging• Transactions• Replication

• Digital Objects• Manage• Access• Versioning• Storage

• Integrity Check• Monitoring• Alerting• Migration

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [108]

Fedora Digital Objects• Flexible object model can support

– Documents, articles, journals

– Electronic Scholarly Texts

– Digital Images

– Complex multimedia publications

– Datasets

– Metadata

– Learning objects

– More…

• Create “networks” of objects using RDF

– Define object relationships and other properties via RDF

– Collection/member; part/whole; etc.

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [109]

RDF in the Fedora Digital Object Model

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [110]

Motivations: Fedora and Semantic Technologies

• A natural model for exposing repository as network of objects– Object-to-object relationships– Relationships to external entities– Query the graph; traversal to discover related stuff

• Indexing based on generalizable data model– Graph-based data model is a common reduction– Avoid fixed schema problems and metadata mud wrestling

• Extensible enrichment of object descriptions– Keep overlaying statements from multiple ontologies– Organic evolution

• Powerful queries and inference for repository management– Transitive relationships among objects– Dependency analysis; – Detection/Extraction of sub-graphs– Provenance of disseminations

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [111]

Digital Objects contain their RDF assertions

• Assert relationships from Fedora base ontology– Collection – member– Whole – part– Equivalence– Description Of– More…

– Assert relationships/properties from community ontologies– isAnnotationOf– isRecommendedBy– isCertifiedBy– More ….

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [112]

Example:Digital Objects with “compositional semantics”

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [113]

Use Case: scholarly objects and annotation in the humanities

S erv ice

hasPartD iagram

hasP artL ette rannotationOf

P ID -11P ID -3

P ID -1P ID -10

providesC ontex t

P ID -2

am azo n e-co m m erce

musuem and library objects

commercial web content

scholarly objects

URI-100

xx:recommends

URI-55 yy:certifies

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [114]

3 Objects – 3 RDF “Relationships” Datastreams

<rdf:Description rdf:about="info:fedora/uva:pid-11> <ais:annotationOf rdf:resource=“info:fedora/uva:pid-3”/></rdf:Description>

</rdf:RDF>

<rdf:Description rdf:about="info:fedora/uva:pid-3"> <uva:hasPartLetter rdf:resource="info:fedora/uva:pid-2"/>

<uva:hasPartDiagram rdf:resource="info:fedora/uva:pid-1"/></rdf:Description>

</rdf:RDF>

<rdf:Description rdf:about="info:fedora/uva:pid-10> <ais:providesContextFor rdf:resource=“info:fedora/uva:pid-3”/></rdf:Description>

</rdf:RDF>

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [115]

• NOT the core object store - RI is a graph-based index of the repository

• Automatic, incremental indexing into triplestore

• Search/query the repository via Fedora RI Query Interface

Fedora RDF-based Resource Index (RI)

RDF Index of Repository

RDF datastream

Fedora object properties

DC datastream

Digital Object Store

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [116]

RI Graph - view 1 (abbreviated) …

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [117]

RI Graph - view 2 (abbreviated) …

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [118]

RI Implementation: The Triplestore Challenge

• Scalability• Few triplestores perform well for 100M+ triples• Kowari – we tested to 180M triples• MPTStore – we tested to 250M triples

• Performance• Jena - easy to get out of memory• Sesame Native - slow for complex queries • Kowari

• Fast queries and full-featured query language (iTQL)• Instability and corruption problems

• MPTStore• Very fast for SPO queries (limited support for complex queries)• Add/modify significantly faster than Kowari

• Mulgara• Fork of Kowari; complex queries; models; inference• Major bug fixes to fix stability and corruption problems• XA2 transactions• Claims support for billions of triples

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [119]

Fedora Repository – Notable Features

• Generic Digital Object Model

• Automatic content versioning and audit trail

• Web Service Interfaces (REST and SOAP)

• Authentication

• Authorization– Flexible fine-grained policy enforcement

– Built-in support for Extensible Access Control Markup Language (XACML)

• RDF– Each object contains its own RDF assertions

– Repository-wide index of all object (RDF triplestore)

• Self-healing – rebuild repository via digital object source files

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [120]

Outline

FEDORA

Examples: PLoS ONE and National Science Digital Library

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [121]

PLoS ONE and Topaz

Open Access Publishing and Collaboration

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [122]

NSDL: Semantic Digital Library Architecture

NDR

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [123]

What is NSDL committed to?

– NSDL 2.0 as a platform for a collaborative,

contributory semantic digital library

– Supporting communities across the full range of

science, technology, engineering and mathematics

research, learning and education

– Supporting the creation of context around library

resources to enhance discovery, use, and

understanding

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [124]

NSDL Semantic Digital Library

repository requirements

• Supports storing both content and

metadata

• Allows arbitrary relationships

among resource and metadata

objects: organization, annotation,

citation

• Accessible through web service

architecture of remixable data

sources and transformations

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [125]

NSDL Data Repository (NDR)

• Implemented in Fedora 2.2 with MPTStore • Moderately large

– 4.7 million digital objects– 250 million RDF triples

• Digital Objects– Resources– Metadata– Agents– Metadata providers– Aggregators

• REST API and authentication• In production at nsdl.org

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [126]

NSDL as Semantic Digital Library:

collaboration, context, and contribution

• Platform: Fedora repository and services

• Applications:– Solution 1: Leverage the existing successful models: blogs,

wikis, bookmarking/tagging

– Solution 2: Leverage the existing software: WordPress,

MediaWiki, Connotea, Sakai

– Solution 3: Engage with partners and the broader community

to build applications to the platform

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [127]

Expert Voices - Blogs on top of Fedora

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [128]

Expert Voices

• NSDL Blogosphere (http://expertvoices.nsdl.org)– Topic-based discussions (e.g. forensics) linked to related library

resources– A way for NSDL community members to become NSDL

contributors of resources, questions, reviews, annotations, metadata

• Technology:– Wordpress-based multi-user multi-blog application (open source,

plug-in architecture)– Owner controls publication of entries as NSDL resources and

visibility of comments (NSDL middleware and Shibboleth)

• Blog Entries:– linked references to NSDL library resources

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [129]

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [130]

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [131]

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [132]

NSDL 2.0 – The Whole Ecosystem

Protocol:OAI-PMHHTTPRESTNDR API

STEMCollections

SearchServiceArchive

Service

Fedora-based NDR

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [133]

NSDL 2.0 and the Semantic Web

• NSDL 2.0 applications situate resources in context, aiding both

discovery and use

• Users become contributors, adding new resources, ratings,

annotations, and organizational structure – frequently as a side

effect of using the library

• Fedora-based semantic web technology organizes resources,

ties context to content, maintains provenance, enables discovery,

empowers the user, and powers the library

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [134]

Fedora Web Site: www.fedora.info

Community Open Source Tools: www.fedora.info/tools

Fedora Wiki: www.fedora.info/wiki

Tutorial:: http://openarchives.org/fedora/ESWC-Fedora.zip

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [135]

Tutorial – Semantic Digital Libraries

- Comparison and the Future -

Sebastian R. Kruk, Bernhard Haslhofer,

Philipp Nußbaumer, Sandy Payette, Tomasz Woroniecki

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [136]

Outline

SIMILE – short overview

Comparison between existing solutions

Digital Libraries and Social Web

Semantic Digital Libraries Scenarios

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [137]

SIMILE – Introduction

• SIMILE - Semantic Interoperability of Metadata and

Information in unLike Environments

– joint project conducted by the W3C, HP, MIT Libraries, and MIT's Lab

for Computer Science.

– extends and laverages DSpace, seeking to enhance interoperability

among digital assets, schemata, metadata, and services

– Goal: Make metadata interoperability easier for digital libraries by

providing useful tools for browsing, searching and mapping

heterogeneous metadata in RDF [MacKenzie Smith, MIT Libraries]

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [138]

SIMILE – Introduction

SIMILE:

• enhances interoperability and provides end-user services:

– for digital assets, arbitrary schemata, metadata and services.

– across distributed individual, community, and institutional stores.

– though the application of RDF and semantic web techniques.

• implements a digital asset dissemination architecture based

upon web standards

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [139]

SIMILE – Delivered Components

• Tools for Metadata Managers– Gadget - XML inspector

– RDFizers - Batch tools to transform existing XML data into RDF

– Solvent - Firefox extension for Javascript screen scraping

– Welkin - Graphical tool to inspect/edit RDF graph

• Tools for End-Users– Longwell - Web-based RDF faceted metadata browser

– Piggy Bank - Firefox extension for personal information management of

metadata in RDF

– Semantic Bank - Web-based server that allows data publishing and sharing

by individuals, groups, or communities

– Exibit - lightweight structured data publishing framework

– Timeline - AJAXy widget for visualizing time-based events

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [140]

RDFizers - Transform XML data into RDF

RDFizers - Transform XML data into RDF:

• tools that allow to transform existing data into an RDF representation

• List of RDFizers in SIMILE:– MARC/MODS RDF

– OAI-PMH RDF

– OCW RDF

– EMail RDF

– BibTEX RDF

– Flat RDF

– Weather RDF

– Java RDF

– Javadoc RDF

– Jira RDF

– Subversion RDF

– Random RDF

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [141]

Solvent - JavaScript screen scraping

Solvent - JavaScript screen scraping:

• a Firefox extension that helps write Javascript

screen scrapers for Piggy Bank.

• Motivation:– Piggy Bank needs web pages to embed information in RDF.

– Unfortunately, not many web pages embed or link to RDF

information.

– Piggy Bank is capable to execute a particular screen scraper on

particular pages in order to "extract" the information it needs.

– turns a regular web page into a semantic web page, freeing the

data from the page/site that contains it.

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [142]

Solvent - JavaScript screen scraping

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [143]

Longwell - RDF faceted metadata browser

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [144]

PiggyBank

• Firefox extension for managing metadata - Loads RDF into

local Longwell server

• Search and faceted browse of local RDF - Views defined by

library, other users

• Users can find, collect, annotate RDF - Can then publish for

access by others

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [145]

PiggyBank

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [146]

SemanticBank

• Semantic Bank use cases:

– persist information remotely on a server

– share information with other people

– lets you publish your information, both in RDF or to regular web pages

• for individuals, groups, communities - e.g. conference

proceedings

– the ability to tag resources creates a powerful serendipitous

categorization

– Longwell facetted browsing view of published information

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [147]

Timeline

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [148]

Exibit

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [149]

Outline

SIMILE – short overview

Comparison between existing solutions

Digital Libraries and Social Web

Semantic Digital Libraries Scenarios

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [150]

System Features Comparison

JeromeDL BRICKS Fedora

OS Support Any Any Any

Hardware Requirements

500MB RAM, min 128MB HD

500MB RAM, min 100MB HD

500MB RAM,min 100MB HD

Software Requirements

Java 1.5, Tomcat 5.5, Sesame

Java 1.4/1.5, Jena

Java 1.5, Tomcat,Kowari/Mulgara or MPTStore

Current Stage Research Stable version 2.0.1

Second Prototype

ProductionVersion 2.2

No. Installations 12+ ~ 8 ~50 monitored; large # of downloads unmonitored

Support Model Open Source Open Source Open Source

General Properties

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [151]

System Features Comparison

JeromeDL BRICKS Fedora

Distribution Distributed searching (P2P), aggregated browsing (hierarchical)

Fully decentralized (P2P)

federation via nameresolver search services; Alvis P2P

Architecture Granularity

Low (main building blocks)

High(many Components)

High (core repository service with configurable modules; loosely coupled services)

DB - Support Any Sesame-compliant backend

Any Jena compliant backend

MySQL, Postgres, Oracle, McKoi;Kowari/Mulgara

Architectural Aspects

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [152]

System Features Comparison

JeromeDL BRICKS Fedora

Content Types All All All

Content Models JeromeDL ontology

Any Any

Metadata Schema MarcOnt + extensions

Any RDF/S & OWL schema

Any XML Schema, RDF/S & OWL schema

Query types Full-text, Filed-Search, Ontology-based, NL Query Templates

Full-text, Field-Search, Ontology-based (sparql)

Field Search,Ontology-based (itql, rdql, sparql, spo),Full-Text (Lucene or Zebra backed service)

Content & Metadata Aspects

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [153]

System Features Comparison

JeromeDL BRICKS Fedora

Security Model FOAFRealm RBAC XACML Policy

Granularity Resource Component, Method, Object

Object, Datastream, Dissemination method

DRM Model Fair use DRM under development

MPEG-21 REL DRM Datastreams

DRM Enabling Tool Support

Watermarking

Security & DRM Aspects

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [154]

System Features Comparison

JeromeDL BRICKS Fedora

Reasoning Recommendation engine based on Prolog

Configurable inference engine

Holding pattern; look to Mulgara;

Tagging Free tagging, Wordnet-based

Annotation middleware component

middleware/apps (e.g., NSDL/NDR; PLoSONE/Topaz)

Taxonomies Any (JOnto) Any Any

Knowledge Sharing

SSCF component via middleware upon BRICKS

via middleware upon Fedora

Communities SIOC and FOAF compliance

Semantic Aspects & Community Features

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [155]

Outline

SIMILE – short overview

Comparison between existing solutions

Digital Libraries and Social Web

Semantic Digital Libraries Scenarios

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [156]

The future - Social Semantic Digital Libraries

• Why current (semantic) digital libraries are not enough?

– digital libraries should not be for librarians only but for average

people

– they concentrate on delivering content/information, not on

knowledge sharing within a community of users

– digital libraries have lost human-part of their predecessors

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [157]

The future - Social Semantic Digital Libraries

• What could be the solution?

– make users/readers involved in the content annotation process

– allow users/readers to share their knowledge within a community

– provide better communication between users in and across

communities

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [158]

The future - Social Semantic Digital Libraries

What is Web 2.0?– The Web where “ordinary” users can meet, collaborate, and share

using whatever is newly popular on the Web (tagged content, social bookmarking, AJAX, etc.)

– The term Web 2.0 was made popular by Tim O’Reilly: http://www.oreillynet.com/pub/a/oreilly/tim/news/2005/09/30/what-is-web-20.html

– Popular examples include: Bebo, del.icio.us, digg, Flickr, Google Maps, Skype, Technorati, Wikipedia…

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [159]

The future - Social Semantic Digital Libraries (3)

• Web 2.0 focuses include:– The Web as a platform for social and collaborative exchange

– Reusable community contributions

– Subscriptions to information, news, data flows, services

– Mass-publishing using web-based social software

• Social software for communication and collaboration:– IM, IRC, Forums, Blogs, Wikis, Social Network Services, Social

Bookmarks, MMOGs…

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [160]

Social Semantic Information Spaces

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [161]

Comparing Web 1.0 / Web 2.0 / Semantic Web 2.0

Semantic Social NetworksOnline Social NetworksBuddy Lists, Address Books

Semantic Social Information Spaces

--

Social Semantic Digital Social Semantic Digital LibrariesLibraries

Google Scholar, Book Search

CiteSeer, Project Gutenberg

Semantic Forums and Community Portals

Community PortalsMessage Boards

Semantic BlogsBlogsPersonal Websites

Semantic SearchGoogle Personalised, DumbFind

Altavista, Google

Semantic WikisWikisContent Management

Systems

Semantic Web 2.0Semantic Web 2.0Web 2.0Web 2.0Web 1.0Web 1.0

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [162]

Outline

SIMILE – short overview

Comparison between existing solutions

Digital Libraries and Social Web

Semantic Digital Libraries Scenarios

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [163]

Geo, Time, and Machine Tagging

• Geo-tagging for resources with a specific geographical

location

• Time-tagging – community driven process of assigning

auxiliary multimedia content

• Machine-tagging – ability to mix structured annotations

into tags

• ROI-tagging:

– Regions of interest

– ERP game

– Asynchonous version with annealing of annotations for less

frequently visited libraries

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [164]

SDL in eLearning

• One of potential sources of future e-Learning systems

• On the verge between formal (libraries) and informal

(communities) learning sources

• Semantic interoperability with Learning Management

Systems

• Improve knowledge creation, delivery and sharing

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [165]

SDL in Future Museums

• Museums have physical objects

• Should bind digital annotations with physical objects

• Real-virtual tours

– Start with real, guided tour

– Ubiquitous browse through context information

– Locate other exhibitions in the vicinity

– Share your knowledge and experience with others, leave bread-

crumbs for others

– Get the most of the exhibition during your visit

Tutorial – Semantic Digital Libraries, June 3, 2007 ESWC 2007 Copyright 2006-2007, DERI NUI Galway, University of Vienna, Fraunhofer IPSI, Cornell University [166]

Discussion – Feedback

The Librarian from Unseen University in Ankh-Morpork (formerly Dr. Horace Worblehat)