Sharing Data on the Web

Post on 12-Sep-2014

425 views 2 download

Tags:

description

A presentation on Linked Data to the US EPA Office of Pollution Prevention and Toxics

Transcript of Sharing Data on the Web

Sharing Data on the Web

5-Mar-2013Linked Data Overview for US EPA

Office of Pollution Prevention & ToxicsBy Bernadette Hyland & Luke Ruth

Tuesday, March 5, 13

Agenda• Intros ...• Trends in data management• Government data publication• Update on EPA Linked Data Service• EPA OPPT sharing data on the Web• Review Next steps ...

Tuesday, March 5, 13

3 Round Stones produces the leading platform for the publication of reusable data on the Web. Our commercially supported Open Source platform is used by the Fortune 2000 and US Government agencies to collect, publish and reuse data, both on the public Internet and behind institutional firewalls.

Tuesday, March 5, 13

US EPA Linked Data

• Cloud-based Linked Data provision of 3 core programs:

• 2.9M Facilities• 100K substances• 25 years of toxic pollution reports• FISMA compliant• 16 Callimachus templates• Official launch April 2013

Tuesday, March 5, 13

Tuesday, March 5, 13

Guidance for developers

Tuesday, March 5, 13

US GPO• Cloud-based Linked Data provision of persistent URLs for US Government documents:

• 100k+ documents• Used by 1,240 Federal Depository Libraries and public

• In 3rd year of operation• Deemed an “Essential service” supporting US Congress

Tuesday, March 5, 13

Tuesday, March 5, 13

Trends in government data management

Tuesday, March 5, 13

Tuesday, March 5, 13

Open Government Data

Tuesday, March 5, 13

“We’re moving from managing documents to managing discrete pieces of open data and content which can be tagged, shared, secured, mashed up and presented in the way that is most useful for the consumer of that information.”

-- Report on Digital Government: Building a 21st Century Platform to Better Serve the American People

Growing chorus ...

Tuesday, March 5, 13

Tuesday, March 5, 13

15Photo credit: http://www.flickr.com/photos/glennharper/4452247708/Tuesday, March 5, 13

Big DataSimple dataComplex dataLegacy data

Tuesday, March 5, 13

GovernmentsGoals: Governmental transparency and/or improved

internal efficiencies (data warehouses)

Tuesday, March 5, 13

Tuesday, March 5, 13

Tuesday, March 5, 13

Tuesday, March 5, 13

HELPING DEFINE THE PROCESS

PublishConvertDescribeNameModelIdentify

Maintain

Tuesday, March 5, 13

• Start easy

• Well curated datasets with relevant data

• Reach out to developers

• Get others involved early

• Ensure internal benefit

• Integrate related datasets

• Address data quality ...

• Multiple approaches including crowed sourcing

Path to Success

Tuesday, March 5, 13

Put it on the Web• Upload & share it

• Document what is available

• Document how to use it

• Solve a customer need

• Encourage feedback

• Continuous improvement

Tuesday, March 5, 13

Use a non-proprietary format

• Open Web data exchange formats that improve access and re-use

• RDF instead of CSV

• Benefits

• Accessibility & Interoperability

• Reduce risk of

• Confidential info

• Software viruses

Tuesday, March 5, 13

Open data + open standards + open platforms

Highly scalable computing & hosting via the

Cloud

International Data Exchange Standards

5 Star Data (Linked Data)

Leverage Open Source tools

Tuesday, March 5, 13

Its the Web of Data

• Universal unidirectional links using URLs

• “Cooperation without coordination

• It’s simple ... nodes and links

Tuesday, March 5, 13

Universal Identifiers• It’s the foundation of the

Web

• Others can reference things

• Two references with the same URI are the same thing

• Quick, easy and scaleable

• People keep coming back for more!!

Tuesday, March 5, 13

Social Responsibility

• Responsibility to maintain published data

• Publish frequency of data updates

• Have a persistence strategy

• Ensure data is accurate as possible

• Respond to reports of problematic data

Tuesday, March 5, 13

29

Clinical Trials + enterprise linked

data

US Legislation + enterprise data

DBpedia + enterprise datasets

Data driven Web apps using Callimachus

Tuesday, March 5, 13

Tuesday, March 5, 13

User

NOAA US EPA AirNow

DBpediaNational Library of Medicine

US EPA SunWise

Tuesday, March 5, 13

Tuesday, March 5, 13

Tuesday, March 5, 13

From WikipediaFrom EPA

Open Street Map

Tuesday, March 5, 13

Tuesday, March 5, 13

We’ve Seen This Before

Tuesday, March 5, 13

HOW IT IS DONE TODAY ...

Tuesday, March 5, 13

Audience for EPA Data

• Middle school student doing a science project

• Concerned citizen worried about local pollution

• Environmental Science PhD from EPA

• Doctor from NIH writing a research paper

Tuesday, March 5, 13

How much mercury did Hanson Permanente Cement

release in 2004?

Tuesday, March 5, 13

Tuesday, March 5, 13

Web Portals

Tuesday, March 5, 13

Tuesday, March 5, 13

Tuesday, March 5, 13

Tuesday, March 5, 13

Tuesday, March 5, 13

Finding Hanson Permanente

Tuesday, March 5, 13

Finding Mercury Released in 2004

Tuesday, March 5, 13

Compliance Report

Tuesday, March 5, 13

Potential Audience

• Middle school student doing a science project

• Concerned citizen worried about local pollution

• Environmental Science PhD from EPA

• Doctor from NIH writing a research paper

XX

X

Tuesday, March 5, 13

Linked Data Approach

Tuesday, March 5, 13

Finding Hanson Permanente

Tuesday, March 5, 13

Finding Mercury Released in 20041

2

Tuesday, March 5, 13

TRI Report

Tuesday, March 5, 13

Data Reuse

Tuesday, March 5, 13

Potential Audience

• Middle school student doing a science project

• Concerned citizen worried about local pollution

• Environmental Science PhD from EPA

• Doctor from NIH writing a research paper

Tuesday, March 5, 13

Tuesday, March 5, 13

Tuesday, March 5, 13

Credits

David NewmanGartner: “Innovation Insight: Linked Data Drives Innovation Through Information-Sharing Network Effects” Published: 15 December 2011

David Wood, ed. Linking Government Data, Springer (2011) http://3roundstones.com/linking-government-data/

US Executive Branch

Digital Government Strategy: Building a 21st Century Platform to Better Serve the American People, http://www.whitehouse.gov/sites/default/files/omb/egov/digital-government/digital-government.html

W3C Linked Data Cookbook http://www.w3.org/2011/gld/wiki/Linked_Data_Cookbook

All other photos and images © 2010-2012 3 Round Stones, Inc. and released under a CC-by-sa licenseAll other photos and images © 2010-2012 3 Round Stones, Inc. and released under a CC-by-sa license

Tuesday, March 5, 13

This work is Copyright © 2011-2012 3 Round Stones Inc.It is licensed under the Creative Commons Attribution 3.0 Unported LicenseFull details at: http://creativecommons.org/licenses/by/3.0/

You are free:

to Share — to copy, distribute and transmit the work

to Remix — to adapt the work

Under the following conditions:Attribution. You must attribute the work in the manner specified by the author or licensor (but not in any way that suggests that they endorse you or your use of the work).

Share Alike. If you alter, transform, or build upon this work, you may distribute the resulting work only under the same or similar license to this one.

Tuesday, March 5, 13