20130204 graph to-pacer-xml

16
GraphTO February 2013, Mozilla Toronto David Colebatch & Darrick Wiebe [email protected]

description

Intro slides; Detailed talk from Darrick at https://gist.github.com/4710452

Transcript of 20130204 graph to-pacer-xml

Page 1: 20130204 graph to-pacer-xml

GraphTOFebruary 2013, Mozilla Toronto

David Colebatch & Darrick Wiebe [email protected]

Page 2: 20130204 graph to-pacer-xml

•Who We Are

•Intro to GraphDB

•Intro to Patent-Grant Data

•Graph Concepts

•Pacer::Xml

Agenda

Sponsored By:

Page 3: 20130204 graph to-pacer-xml

¿por qué?

•Data Set Size

•Connectivity of Data

•Semi-structure

•Evolution of SOA and REST

Page 4: 20130204 graph to-pacer-xml

The Zone of SQL Adequacy

Data complexity

Perf

orm

an

ce

SQL database

Requirement of application

Salary List

ERP

CRM

Network / Cloud

Management

Social

MDM

Geo

Page 5: 20130204 graph to-pacer-xml

How?•Nodes / Vertices

•Relationships / Edges

Page 6: 20130204 graph to-pacer-xml

Relational Model vs. Graph

Each of these modelsexpresses the same thing

Person* Friend*Person-Friend

Page 7: 20130204 graph to-pacer-xml

Database # persons query time

MySQL 1,000 2,000 ms

Neo4j 1,000 2 ms

Neo4j 1,000,000 2 ms

Graph db performance

๏ a sample social graph

•with ~1,000 persons

๏ average 50 friends per person

๏ pathExists(a,b) limited to depth 4

๏ caches warmed up to eliminate disk I/O

Page 8: 20130204 graph to-pacer-xml

Different Visualization

Page 9: 20130204 graph to-pacer-xml
Page 10: 20130204 graph to-pacer-xml
Page 11: 20130204 graph to-pacer-xml
Page 12: 20130204 graph to-pacer-xml
Page 13: 20130204 graph to-pacer-xml

Query Languages

•Pacer - gem install pacer

•Cypher

•SPARQL - if you grok RDF already

Page 14: 20130204 graph to-pacer-xml

US PTO Data

•Patent Grant Data in XML

•bi-weekly chunks

•Pacer::Xml has handy loader as an example:

jruby-1.7.0 > g = PacerXml::Sample.load_100Downloading a sample xml file from...

Page 15: 20130204 graph to-pacer-xml

001> PacerXml

Importing XML into a graph? What do you do next?

Page 16: 20130204 graph to-pacer-xml

Resources

https://github.com/xnlogic/pacer-xml

https://github.com/pangloss/pacer

http://neo4j.org/

http://tinkerpop.com/