★★★★★
Miroslav Líška, Marek ŠurekDatalan (Bratislava, Slovakia)
l
Five Star Open Data in SR, 16.9.2015
Toward Government Linked Data : A Slovak Case
data.gov.sk-semanticweb
I. Introduction1. Five Star Open Data2. History of Semantic Web in Slovakia/Datalan
II. Method3. Main Principles4. data.gov.sk Resource General* URI Pattern5. Supported ontologies (ODP Ontology + SEMIC Recommendations)
III. Process6. URI Registration – process7. URI Registration – use case model
IV. Searching for Business Cases8. Slovpedia (Tripleskop)9. Slovpedia (PharmaGuard)
*Annext A: data.gov.sk Resource URI Patterns – detail specification
Agenda
I. Introduction
1) Five Star Open Data● Slovak Government Data ? Sad story.
But this can change! Semantic Web !
2) History of Semantic Web in SK, Datalan
Datalan
Slovakia
● 2006 … – 1st Workshop on Intelligent and Knowledge oriented Technologies. SAV, FIIT STU, FEI TUKE
● 2009 … – start of Sestate, Susan, Tripleskop, Slovpedia, Pharmaguard, SemTelcoSearch
● 2013 … – DTLN became a member of Data Standardization Process in SK /Ministry of Finance SR/ (as ITAS Deputy) ● 1. formal proposal of sk semantic standards [too soon]
● 2015
● 2. formal proposal of data.gov.sk-semanticweb_1.0 (we believe for approval until end of 2015)
202X
Miroslav Líška
We fought for the Semantic Web
II. An Approach
3) Method Overwiev
URI Pattern Rules +
Simple / Extendable Government URI System
data.gov.sk Semantic Standards
data.gov.sk general URI pattern
Catalog URIOntology URI
Class URI
Individuals Template URI
UR
I V
ers
ion
ing
ru
les
URI
IndividualURI
Dataset URI
Dataset Item URI
Object Property URI
DataType Property URI
Su
pp
lem
en
tary
UR
I ru
les
Supported Ontologies+ URI Registration Process
methodprocess)(
4) data.gov.sk Resource URI Patterns
TYPE● id = concrete individual („Lukas Liska“, „Datalan“, „Bratislava“ ...) ● def = ontology entity definition● doc = document, file, electronic form ...● set = catalog, dataset (codelist), distribution
CLASS - resource classification
IDENTITY – standard relationalDB-like ID (0000001, 0000002 … )
VERSION - resource version/distribution (2015-09-17, 1.0, A, B …)
General URI Pattern for data.gov.sk Resource
http://data.gov.sk/[TYPE]/[CLASS]/[IDENTITY]/{VERSION}
§1
General URI Pattern for data.gov.sk Resource
http://data.gov.sk/[TYPE]/[CLASS]/[IDENTITY]/{VERSION}
§1
Example – Legal Form Class (ODP Ontology)
http://data.gov.sk/def/ontology/odp/LegalForm
Example – Legal Form 121 represents a joint stock form of company
http://data.gov.sk/def/legalform/121
Example – Legal Forms Codelist
http://data.gov.sk/set/codelist/legalform
Example – Distribution of the Legal Forms codelist
http://data.gov.sk/set/codelist/legalform/2015-09-16
examples
See Annext A for full specification
4) data.gov.sk Resource General URI Patterns
5) Supported Ontologies (1/2)A. ODP Ontology: Knowledge Kernel
Mapping to the actual KDP element
Mapping to actual codelist
Mapping to SEMIC recommended ontology
= OntologizationOf (ElementsOf(KDP + MetaIS)) + mapping to SEMIC Ontologies
5) Supported Ontologies (2/2) B. SEMIC Recommended ontologies
DCAT Data Catalog Vocabulary ADMS Asset Description Metadata Schema ADMS.SW ADMS for Software CPSV Core Public Service Vocabulary ROV Registered Organization Vocabulary LOCN Location Core Vocabulary PERSON Person Core Vocabulary
RDF Resource Description Framework RDFS Resource Description Framework Scheme OWL Web Ontology Language SKOS Simple Knowledbe Organizational System
C. Semantic Core Ontologies
III. Process
6) URI Registration – process model
§3
7) URI Registration – UC model
III. Searching for Business Models
8) Slovpedia/Tripleskop
MetaIS
data.gov.sk URIresourcesdefinition
Slovpedia
Enriched dataAddional datasets
data.gov.sk
LinkedDataDatasets Governent data
source1
source2
data
definitionssourceN
future status
9) Slovpedia/Tripleskop
Governent data
source1
source2
sourceN
Tripleskop
Slovpedia
Linked Data
actual statusactual status
SestateCity in Mobile 2
PharmaGuard
8) Slovpedia/Tripleskop
8) Slovpedia/Tripleskop
8) Slovpedia/Tripleskop
8) Slovpedia/Tripleskop
9) Slovpedia/PharmaGuard (1/2)
Líška, M., Šurek, M.: An Approach to NLP based Drug Interactions with Inferencing. Unpublished yet.
PharmaGuard.EU (1.0)A Drug & Medication Mobile Application based on government drug data (sk data + drugbank.ca)
uses
9) Slovpedia/PharmaGuard (2/2)
1.
2.
3.4.
Referenceshttp://www.w3.org/standards/semanticweb/https://joinup.ec.europa.eu/community/semic/descriptionhttp://www.openrdf.org/http://www.w3.org/OWL/http://www.w3.org/RDF/
http://sk.linkedin.com/in/miroslavliska/http://sk.linkedin.com/in/mareksurekhttp://www.datalan.skhttp://www.slovpedia.comhttp://www.tripleskop.com
Thanks for your attention
Annext A: data.gov.sk Resource URI Patterns >>
TYPE● id = concrete individual („Lukas Liska“, „Datalan“, „Bratislava“ ...) ● def = ontology entity definition● doc = document, file, electronic form ...● set = catalog, dataset (codelist), distribution
CLASS - resource classification
IDENTITY – standard relationalDB-like ID (0000001, 0000002 … )
VERSION - resource version/distribution (2015-09-17, 1.0, A, B …)
General URI Pattern for data.gov.sk Resource
http://data.gov.sk/[TYPE]/[CLASS]/[IDENTITY]/{VERSION}
§1
Annext A: data.gov.sk Resource URI Patterns
Individual URI http://data.gov.sk/id/[class]/[code]
Example – Bratislava Self-Governing Region
http://data.gov.sk/id/nuts3/SK01
Example - Datalan http://data.gov.sk/id/corporatebody/35810734
Example – Drug Concor 30x5mg
http://data.gov.sk/id/drug/94164
Example – Andrej Kiska (Slovak President)
http://data.gov.sk/id/president/andrej-kiska
Example – this document
http://data.gov.sk/doc/pdf/method/uri-for-slovak-public-data/201509-09-16
§1.3
Document URI http://data.gov.sk/doc/[docType/filename]/[version]
§1.2
Example – Andrej Kiska (Slovak President)
http://data.gov.sk/id/president/andrej-kiska
The Public Procurement Information Systemversion
http://data.gov.sk/id/isvs/5854
Annext A: data.gov.sk Resource URI Patterns
Dataset (codelist) http://data.gov.sk/setset/[datasetType]/[dataset]
[datasetType]● codelist = a set that contains codelist elements● data = a set that contains data „records“
[dataset] = english name of actual dataset
Example – Legal Forms codelist
http://data.gov.sk/set/codelist/legalform
Example – Approved and Categorized Drugs Datasets
http://data.gov.sk/set/data/categorizeddrug
§1.4
Annext A: data.gov.sk Resource URI Patterns
Dataset item http://data.gov.sk/[type]/[class]/[identity]
[type]● def = an item represents ontology entity definition (§1.1)● id = an item represents individual (§1.4.3)
[class] = type of the item
[identity] = present item code
Example – Joint Stock Company as the item of the Legal Forms codelist
http://data.gov.sk/def/legalform/121
Example – Bratislava Region as the item of the NUT3 codelist
http://data.gov.sk/id/nuts3/SK01
Extended example
legalform:121 rdf:type odp:LegalForm .legalform:121 rdfs:label “Joint Stock Company“@en .legalform:121 rdfs:label “Akciová spoločnosť“@sk .legalform:121 rdfs:label “Aktiengesellschaft“@de .
§1.4.1
Annext A: data.gov.sk Resource URI Patterns
Catalog (set of datasets) http://data.gov.sk/set/cat/[catalog]
Example – drug related datasets gropu http://data.gov.sk/set/cat/registered-drugs
§1.4.2
Annext A: data.gov.sk Resource URI Patterns
● Versionable resource is a resource which versions can exists in parallel, such as● an information systems, a service ...● an ontology● dataset distribution …
● Otherwise a resource is unversionable, such as● a person● geo entity● ...
Example – The Public Procurement Information Systemversion 1.0
http://data.gov.sk/id/isvs/5854/1.0
Example – A second version of
http://data.gov.sk/set/codelist/legalform/2015-09-04
Example – The Legal Forms Dataset published 2015-09-04
http://data.gov.sk/set/codelist/legalform/2015-09-04
§1.5 Resources versioning
Annext A: data.gov.sk Resource URI Patterns
URI – identify contentURL - navigate to content
Example – an eform
<http://data.gov.sk/doc/eform/DCOM_eDemokracia_StaznostFO_sk/1.0>
Example – eforms XSD
<http://data.gov.sk/doc/xsdschema/DCOM_eDemokracia_StaznostFO_sk/1.0>
Example – NOT this
<http://data.gov.sk/doc/eform/DCOM_eDemokracia_StaznostFO_sk/1.0/share/files/schema.xsd>
Annext A: data.gov.sk Resource URI Patterns§1.6 URI is not URL
Top Related