Webs of People, Webs of Data

25
Simon Price, University of Bristol Webs of People, Webs of Data Web 2.0 Live, Taunton, Nov 2006

Transcript of Webs of People, Webs of Data

Page 1: Webs of People, Webs of Data

Simon Price, University of Bristol

Webs of People, Webs of Data

Web 2.0 Live, Taunton, Nov 2006

Page 2: Webs of People, Webs of Data

Web 2.0

Page 3: Webs of People, Webs of Data

Web Applications (Web 1.5?)

Page 4: Webs of People, Webs of Data

Hybrid Web-Desktop (Web 1.6?)

Page 5: Webs of People, Webs of Data

Canonical Web 2.0

• Amazon– Customer Reviews– Amazon Recommends

• Google– PageRank™– Making money out of links– Google Mail, Maps, APIs, Desktop Search, ...

Page 6: Webs of People, Webs of Data

Web 2.0 Technology (nothing new)

• Minimum– CGI (e.g. Perl, PHP, Python, C/C++)– Database (e.g. MySQL, Postgres, Oracle)

• More recent additions– Java– XML– Web Services– AJAX– Ruby on Rails

Page 7: Webs of People, Webs of Data

Social Networks

A key ingredient in the Web 2.0 melting pot

Page 8: Webs of People, Webs of Data

Google PageRank™

• Sergey Brin and Lawrence Page (Stanford, 1995)

• Intuition behind PageRank:– Web is a network (graph) connected by links– A link is a "vote" for the destination page– Strength of vote is a fraction of the PageRank

of the page casting the vote

Page 9: Webs of People, Webs of Data

PageRank of a page is the

probability of a random

surfer arriving at that page

after many clicks.

(By Markov Theory)

Page 10: Webs of People, Webs of Data

Newsgroup Mining

Work by Jonathan Roberts

Page 11: Webs of People, Webs of Data

Web Mining

www.theyrule.net

Page 12: Webs of People, Webs of Data

Link Discovery

www.theyrule.net

Page 13: Webs of People, Webs of Data
Page 14: Webs of People, Webs of Data

The Web of Data

Page 15: Webs of People, Webs of Data

Semantic Web

The Semantic Web is a graph-based knowledge representation of data, spanning the Web, traditional databases, the desktop and mobile devices.

Page 16: Webs of People, Webs of Data

Friend of a Friend (FOAF)

"The FOAF project is about creating a Web of machine-readable homepages describing people, the links between them and the things they create and do."

http://www.foaf-project.org/

Page 17: Webs of People, Webs of Data

FOAF and Co-depiction

Page 18: Webs of People, Webs of Data

PARIP

• PARIP = Practice As Research In Performance– 5 year national project– Led by University of Bristol's Department of Drama:

Theatre, Film, Television– Professor Baz Kershaw and Dr Angela Piccini

• PARIP Explorer– Innovative contacts and research database– Developed by ILRT– Semantic Web technology

Page 19: Webs of People, Webs of Data

PARIP - Data Fusion

• contact details

• research interests

• images

• interviews

• concepts

• questionnaire responses

• institutions

• projects

• …

Page 20: Webs of People, Webs of Data

PARIP - User Perspective

• Dual interface:– Text View cross-database search-engine– Map View visual link discovery and browsing

Page 21: Webs of People, Webs of Data

PARIP - Technical Perspective

• Semantic Web: RDF/XML and FOAF

• Prolog running as a Web Service (WSDL+SOAP)

• SPARQL query interface for programmatic access

• XHTML AJAX client

• Visualisation via Flash

Page 22: Webs of People, Webs of Data

Research Directions

Page 23: Webs of People, Webs of Data

Automated Data Fusion

Page 24: Webs of People, Webs of Data

Exabyte Scale Informatics

• 1 Exabyte = 1018 bytes i.e. 1,000,000,000,000,000,000 bytes

• 1 Exabyte is approximately everything ever:• written,• composed,• filmed,• painted• or in any other way 'recorded' by humans.

• Manual classification and retrieval is inadequate; machine learning and data mining are essential.

Page 25: Webs of People, Webs of Data

Google on "Simon Price Bristol"

Contact details