Download - Open Source, Open Data

Transcript
Page 1: Open Source, Open Data

Open Source, Open DataKirrily RobertFlorida Linux Show, 2009

Page 2: Open Source, Open Data
Page 3: Open Source, Open Data

From Open Source to Open Data

Page 4: Open Source, Open Data

1993

Me in 1993 My Linux desktop looked like this

Page 5: Open Source, Open Data

1993

• I started using Linux in 1993

• I was very excited by it, even though it was quite primitive at the time

• Other people thought I was a little crazy

Page 6: Open Source, Open Data

Image: Wikipedia Image: Engadget

1999

Google’s servers in 1999Jar Jar in 1999

Page 7: Open Source, Open Data

1999

• By 1999 Linux + open source was starting to take off

• Companies using and building services on Linux etc.

• We were calling it “Open Source” - a more marketable term for Free Software

Page 8: Open Source, Open Data

Four Software Freedomshttp://www.gnu.org/philosophy/free-sw.html

• Freedom to run the program

• Freedom to study the program and modify it for your own use

• Freedom to redistribute verbatim copies

• Freedom to improve the program, and release your improvements

Page 9: Open Source, Open Data

Free Culture

• A similar movement

• Make cultural works freely available

• Mostly over the Internet

Page 10: Open Source, Open Data

Free Culture

Page 11: Open Source, Open Data

Free Culture

Page 12: Open Source, Open Data

Free Culture

Page 13: Open Source, Open Data

Free Culturehttp://wiki.freeculture.org/Free_Culture_Definition

• Freedom to use the work

• Freedom to study the work and to apply knowledge acquired from it

• Freedom to make and redistribute copies

• Freedom to make changes and improvements, and to distribute derivative works

Page 14: Open Source, Open Data
Page 15: Open Source, Open Data

Image: masternewmedia.org

Page 16: Open Source, Open Data

What is Open Data?

Page 17: Open Source, Open Data

Data

Page 18: Open Source, Open Data

Image: himmelskratzer @ Flickr

Page 19: Open Source, Open Data

What is data?

• Ones and zeroes (obviously)

• But also filing cabinets, research archives, and other offline resources

• It’s not OPEN data unless you can get at it

Page 20: Open Source, Open Data

Open Data Freedoms

• Freedom to use the data

• Freedom to study the data and modify it for your own use

• Freedom to make and share verbatim copies

• Freedom to improve the data and redistribute the results

Page 21: Open Source, Open Data

Data availability

• Digital

• Online

• Well formatted

Page 22: Open Source, Open Data

Open Data Projects

Page 23: Open Source, Open Data

public.resource.org

• Created 2007 by Carl Malamud

• “Making Government Information More Accessible”

Page 24: Open Source, Open Data

public.resource.org

• SEC EDGAR records

• Patents database

• Copyright database

• Congressional records

• Legal decisions

• Fedflix

Page 25: Open Source, Open Data

Data.gov

• Founded 2008

• “Increase public access to high value, machine readable datasets generated by the Executive Branch of the Federal Government.”

Page 26: Open Source, Open Data

OpenStreetMap

Page 27: Open Source, Open Data

Compare...

Page 28: Open Source, Open Data

OpenStreetMap

Page 29: Open Source, Open Data

Open Library Project

Page 30: Open Source, Open Data

• CD data

• Tracks, artists, releases...

• CC license

Page 31: Open Source, Open Data

Flickr

• Images

• Metadata• tags, timestamps, geolocations, etc.

• Range of CC licenses and permissive TOS

Page 32: Open Source, Open Data

Infochimps

• Large data sets

• Various licenses

• Tools for transformation

Page 33: Open Source, Open Data

• Open data about “everything”

• 8.5m concepts

• CC-BY license

• API and data dumps

Page 34: Open Source, Open Data

2,416,683 books

Page 35: Open Source, Open Data

16,608 ships

Page 36: Open Source, Open Data

488 cheeses

Page 37: Open Source, Open Data

Structured data { "name": "Asiago cheese" "id": "/en/asiago_cheese", "region": [{ "id": "/en/asiago", "name": "Asiago", "type" : "/location/location"

}], "source_of_milk": [{ "id": "/en/cattle", "name": "Cow", "type" : "/biology/organism_classification" }] }

Page 38: Open Source, Open Data

Open Data Apps

Page 39: Open Source, Open Data
Page 40: Open Source, Open Data
Page 41: Open Source, Open Data

• Apps for America competition

• Open source and open data

• Round 1: various data sources

• Round 2: Data.gov

Page 42: Open Source, Open Data

Legistalker

Page 43: Open Source, Open Data

Filibusted

Page 44: Open Source, Open Data

Where the money goes

Page 45: Open Source, Open Data

Open Source for Open Data

Page 46: Open Source, Open Data

What can open source do?

Input

Processing

Output

Scrape

Munge

Visualise

Page 47: Open Source, Open Data

Scraping data

• APIs• XML, RSS, JSON...

• Downloadable data sets• XML, Excel, CSV, triple dumps...

• Beautiful Soup (Python)• http://www.crummy.com/software/

Page 48: Open Source, Open Data

Munging data

• Perl• http://perl.org/

• R (statistical analysis)• http://r-project.org/

• Hadoop (parallel data processing)• http://hadoop.apache.org/

Page 49: Open Source, Open Data

Visualisations

• MIT Simile• http://simile.mit.edu/

• Processing• http://processing.org/

Page 50: Open Source, Open Data
Page 51: Open Source, Open Data

http://itoworld.com

Page 52: Open Source, Open Data

Semantic Web

• Describe meaning, not markup

• Triples: subject, predicate, object

• Expression: RDF

Page 53: Open Source, Open Data

Linked Open Data

Page 54: Open Source, Open Data

Semantic web tools

• Triple stores• Sesame, BigData, Virtuoso...

• Libraries• RDFLib (Python), Redland RDf (librdf)...

Page 55: Open Source, Open Data

Freebase Acre

Page 56: Open Source, Open Data

Open source for open data

• Low barrier to entry

• Hooks in to Freebase data

• Share and clone apps

• Apps are BSD licensed

Page 57: Open Source, Open Data

FMDB

Page 58: Open Source, Open Data

Gendered names app

Page 59: Open Source, Open Data

Query editor

Page 60: Open Source, Open Data

Clone!

Page 61: Open Source, Open Data

http://freebase.com/developer

Page 62: Open Source, Open Data

Where next?

Page 63: Open Source, Open Data

Open Data: Issues

• License clarity

• Govt + Corporate acceptance

• Developer literacy

• What do we DO with it?

Page 64: Open Source, Open Data

What do we do with it?

Page 65: Open Source, Open Data

What do we do with it?

• 10 years ago we were asking the same questions of Open Source

• With Open Data, we are just starting to realise its potential

• Please join us!

Page 66: Open Source, Open Data

Keep in touch

• Email• [email protected]

• Freebase blog• http://blog.freebase.com/

• Twitter• @fbase