ESWC 2009 In-Use Track: SCOVO

16
1 SCOVO: Using Statistics on the Web of Data Michael Hausenblas, Wolfgang Halb, Yves Raimond, Lee Feigenbaum, Danny Ayers ESWC2009 In-Use Track, 2009-05-04, Heraklion, Greece

description

http://www.eswc2009.org/program-menu/accepted-in-use-track-papers/130-michael-hausenblas-wolfgang-halb-yves-raimond-lee-feigenbaum-and-danny-ayers-scovo-using-statistics-on-the-web-of-data

Transcript of ESWC 2009 In-Use Track: SCOVO

Page 1: ESWC 2009 In-Use Track: SCOVO

1

SCOVO:

Using Statistics on the Web of Data

Michael Hausenblas, Wolfgang Halb, Yves Raimond, Lee Feigenbaum, Danny Ayers

ESWC2009 In-Use Track, 2009-05-04, Heraklion, Greece

Page 2: ESWC 2009 In-Use Track: SCOVO

2

Agenda SCOVO Motivation

Requirements and Issues

Statistical Modelling Framework

Comparison

Usage

Page 3: ESWC 2009 In-Use Track: SCOVO

3

Motivation SCOVO: Statistical Core Vocabulary

http://sw.joanneum.at/scovo

Statistical data is present everywhere

Page 4: ESWC 2009 In-Use Track: SCOVO

4

MotivationWeb of Data is for sharing, accessing and using

DATA

SCOVO aims at makingstatistical data easierand better accessibleon the Web of Data

Page 5: ESWC 2009 In-Use Track: SCOVO

5

MotivationBased on 3 distinct efforts riese – ‚RDFizing and Interlinking the EuroStat Data

Set Effort‘ http://riese.joanneum.at Eurostat data (official European statistics)

US Census Bureau‘s annual Statistical Abstract Publishing UN and OECD

http://oecd.dataincubator.org/

Page 6: ESWC 2009 In-Use Track: SCOVO

6

Issues Handling of Multiple Dimensions Reusability and Uptake Structural vs. Domain Semantics Performance and Scalability Issues

Page 7: ESWC 2009 In-Use Track: SCOVO

7

Requirements Usable on the Web of Data (URIs, RDF, etc.) Extensible both on schema level and instance level Light-weight, addressing uptake, and performance

and scalability issues.

Page 8: ESWC 2009 In-Use Track: SCOVO

8

Statistical Modelling Framework

Page 9: ESWC 2009 In-Use Track: SCOVO

9

SCOVO

http://purl.org/NET/scovo

Page 10: ESWC 2009 In-Use Track: SCOVO

10

Comparison

Page 11: ESWC 2009 In-Use Track: SCOVO

11

Example From http://purl.org/NET/scovo

SELECT ?airport_name ?period ?percent_ontime_arrivals

FROM <http://sw.joanneum.at/scovo/otf-example-full.rdf>

WHERE {

?item rdf:type scv:Item ;

scv:dimension ?airport;

scv:dimension ?time_period;

scv:dimension ex:ota ;

rdf:value ?percent_ontime_arrivals .

?airport rdf:type ex:Airport ;

dc:title ?airport_name .

?time_period rdf:type ex:TimePeriod ;

scv:min ?min ;

scv:max ?max ;

dc:title ?period .

FILTER ( ?min > "2006-02-01" && ?max < "2006-08-01")

}

Page 12: ESWC 2009 In-Use Track: SCOVO

12

Usage: riese

RDFizing and Interlinking the Eurostat data Set Efforthttp://riese.joanneum.at

Contributing to Linking Open Data project Offers linked data version of Eurostat data for both humans and machines (first LOD-in-RDFa

dataset in the cloud)

Page 13: ESWC 2009 In-Use Track: SCOVO

13

Usage: voiD voiD – Vocabulary of Interlinked Datasets

http://semanticweb.org/wiki/VoiD Formal description of linked datasets Uses SCOVO to express stats about triples,

interlinking, resources, etc.

Page 14: ESWC 2009 In-Use Track: SCOVO

14

Usage: RDFStats

http://semwiq.faw.uni- linz.ac.at/node/9

Page 15: ESWC 2009 In-Use Track: SCOVO

15

Conclusion Modeling statistics is a non-trivial task (wide range of

requirements, etc.) SCOVO is usable, generic, simple However, there are issue:

Aggregation Domain semantics

Page 16: ESWC 2009 In-Use Track: SCOVO

16

Let‘s discuss!