Triplificating and linking XBRL financial data

21
Triplificating and Linking XBRL Financial Data Roberto García, Rosa Gil HCI & Data Integration Research Group Universitat de Lleida, Spain 6th International Conference on Semantic Systems, I-SEMANTICS'10, 1-3 September 2010, Graz, Austria

description

Financial reports based on the XML standard XBRL are mapped to RDF together with the associated XBRL taxonomies, which are mapped to OWL ontologies.

Transcript of Triplificating and linking XBRL financial data

Page 1: Triplificating and linking XBRL financial data

Triplificating and LinkingXBRL Financial Data

Roberto García, Rosa Gil

HCI & Data Integration Research GroupUniversitat de Lleida, Spain

6th International Conference on Semantic Systems, I-SEMANTICS'10, 1-3 September 2010, Graz, Austria

Page 2: Triplificating and linking XBRL financial data

Introduction• Explore Semantic Web technologies for

financial information

• Most public financial data available as XBRL

• Reuse this data, map XBRL to RDF– Apply generic mapping: ReDeFer

• XSD2OWL, XML2RDF

• Publish as Linked Data using Rhizomer

Page 3: Triplificating and linking XBRL financial data

Approach• Reuse XBRL data and schemas

• Facilitate roundtrip: – XBRL SW XBRL

• Showcase Semantic Web benefits

• Test data: SEC’s EDGAR voluntary program1

1 From 2009, 3 year phased-in schedule, by end 2011 all U.S. public companies plus 8,000 mutual funds

Page 4: Triplificating and linking XBRL financial data

ReDeFer• Part of the Rhizomik initiative

• ReDeFer (in and out Semantic Web)– RDF2HTML+RDFa– RDF2SVG– XML2RDF– XSD2OWL

Use cases: MPEG-7, MPEG-21,…

Page 5: Triplificating and linking XBRL financial data

XBRL XML to RDF• ReDeFer XML2RDF,

model XML tree using triples– xsd:element and xsd:atribute rdf:Property

– xbrli:id and xbrli:identifier rdf:Resource ID

– Other resources, anonymous

Rendered by RDF2SVG

Page 6: Triplificating and linking XBRL financial data

XBRL XSD to OWL• XBRL Schemas: XBRL 2.1, US GAAP1,…• ReDeFer XSD2OWL

XML Schema OWL

element | attributerdf:Propertyowl:DatatypePropertyowl:ObjectProperty

element@substitutionGroup rdfs:subPropertyOfelement@type rdfs:rangecomplexType owl:ClasscomplexType//element owl:Restrictionextension@base | restriction@base

rdfs:subClassOf

@maxOccurs, @minOccurs

owl:maxCardinality, owl:minCardinality

sequence, choice

owl:intersectionOf, owl:unionOf

Core classes and properties for XBRL Instance

XSD2OWL mappings

1 Ontologies available from http://rhizomik.net/ontologies/bizontos

Page 7: Triplificating and linking XBRL financial data

XML2RDF plus XSD2OWL• Enrich RDF with links to classes for

corresponding XSD complexTypes:

Page 8: Triplificating and linking XBRL financial data

XBRL Sources• SEC’s EDGAR filings data files:

http://www.sec.gov/Archives/edgar/xbrlrss.xmlhttp://www.sec.gov/Archives/edgar/usgaap.rss.xml

XML2RDFXSD2OWL

Calculations

Labels

Presentation

Definition

Page 9: Triplificating and linking XBRL financial data

XBRL Ontologies

adbe-20090828.owl

adbe-20090828.rdfadbe-20090529.rdf

XBRL 2.1US-GAAP

Non-GAAP

XBRL 2.1US-GAAP

Non-GAAP

adbe-20090529.owl

… adbe-20080916.rdf …

adbe-20080829.owl

Page 10: Triplificating and linking XBRL financial data

Ontology Alignment• Integrate ontologies for different filings

– E.g.: adbe-20080530.owl:InvestmentLeaseReceivable = adbe-20080829.owl:InvestmentInLeaseReceivable

– Ontology alignment tools (edit distance)– Perform queries across filings

Page 11: Triplificating and linking XBRL financial data

Semantic XBRL• Dataset size

– Sept 2010: 9,19 million triples from 2974 XBRL filings

– Available from http://rhizomik.net/semanticxbrl/

Page 12: Triplificating and linking XBRL financial data

Semantic XBRL• Linked Data:

– Links to DBPedia:• Companies: Central Index Key (CIK), name or

ticker through dbpprop:redirect– dbpedia:Microsoft is dbpprop:redirect of dbpedia:MSFT

• Units: e.g. USD dbpedia:United_States_dollar

– Links to Corporate Ownership RDF Data1:(Individual ownership and subsidiaries)

• Companies: CIK

1 http://www.rdfabout.com/demo/sec/

Page 13: Triplificating and linking XBRL financial data

Architecture

XML2RDF

XSD2OWL

XBRL Schemas

.xml

XBRL filings

XBRL OntologiesReDeFer

Rhizomer

+1M triples

LOD

Linking

RDF2Form

XHTML+RDFa

RDF2HTML

SPARQL

edit

post

Rhizomer AJAX

get

.xsd

EDGARRSS Feed

download

EDGAR Standard

Taxonomies

.US

Page 14: Triplificating and linking XBRL financial data

Prototype Demo

Rhizomer:

•publish

•query

•browse

•edit

•mashup (map, timeline,…)

•etc.

Page 15: Triplificating and linking XBRL financial data

Get a context description from its URLhttp://rhizomik.net/semanticxbrl/adbe-20081216/From20071201-To20081128_None_None_StatementOperatingActivitiesSegmentAxis_KnowledgeWorkerSolutions

Page 16: Triplificating and linking XBRL financial data

Reuse Wikipedia data (DBPedia)http://rhizomik.net/semanticxbrl/?query=DESCRIBE <http://dbpedia.org/resource/Adobe_Systems>

Page 17: Triplificating and linking XBRL financial data

Get all Investment[In]LeaseReceivablehttp://rhizomik.net/semanticxbrl/?query=DESCRIBE ?o WHERE { ?s ?p ?o . ?p <http://www.w3.org/2000/01/rdf-schema#subPropertyOf> <http://rhizomik.net/ontologies/bizontos/xbrl-edgar/adbe-20080530.owl#InvestmentLeaseReceivable> . }

Page 18: Triplificating and linking XBRL financial data

Integrate different CIK variants

Page 19: Triplificating and linking XBRL financial data

Conclusions• XBRL tools: for financial information

edition and maintenance (formulae,…)

• Semantic Web tools: data integration (linked data), cross querying,…– Across filings, companies, accounting

principles,…

XSD2OWLXML2RDF

Page 20: Triplificating and linking XBRL financial data

Future Work• Systematic mappings across filings for the

same company

• Explore mappings:– Across companies– Across accounting principles…

• Develop demo webpps:– E.g.: show accounting data for

ownership/subsidiary networks of companies…

Page 21: Triplificating and linking XBRL financial data

Thank you for your attention

Roberto García

[email protected]

http://rhizomik.net/~roberto

HCI & Data Integration Research GroupUniversitat de Lleida, Spain