Facilitating Web Science Collaboration through Semantic Markup

24
The Web Observatory Extension: Facilitating Web Science Collaboration through Semantic Markup Dominic DiFranzo, John S. Erickson, Marie Joan Kristine T. Gloria, Joanne S. Luciano, Deborah McGuinness, James Hendler The Tetherless World Constellation & Institute for Data Exploration and Applications Rensselaer Polytechnic Institute, Troy, NY

description

These are the slides that accompanied the paper "Dominic DiFranzo, John S. Erickson, Marie Joan Kristine T. Gloria, Joanne S. Luciano, Deborah McGuinness, & James Hendler, The Web Observatory Extension: Facilitating Web Science Collaboration through Semantic Markup, Proc. WWW 2014 (Web Science Track), Seoul, Korea, 2014." They describe an extension to schema.org that can be used for sharing Web-related datasets and projects.

Transcript of Facilitating Web Science Collaboration through Semantic Markup

Page 1: Facilitating Web Science Collaboration through Semantic Markup

The Web Observatory Extension: Facilitating Web Science

Collaboration through Semantic Markup"

Dominic DiFranzo, John S. Erickson, Marie Joan Kristine T. Gloria, Joanne S. Luciano, Deborah McGuinness, James Hendler

The Tetherless World Constellation & Institute for Data Exploration and Applications

Rensselaer Polytechnic Institute, Troy, NY

Page 2: Facilitating Web Science Collaboration through Semantic Markup

Introduction

6

•  Web Science involves using and producing large amounts of heterogeneous data about and from the web"

"•  As we (Web Science researchers) strive to collaborate and work

together, we must find ways to share, link and reuse each other’s data and tools."

"•  To do this, we are striving to build “Web Observatories” – a

common infrastructure for enhancing this sharing, and to extend it to also include tools, research project results(papers & experiments), etc."

Tiropanis, T., Hall, W., Shadbolt, N., DeRoure, D., Contractor, N. and Hendler, J., The Web Science Observatory, IEEE Intelligent Systems, March/April, 2013.

Page 3: Facilitating Web Science Collaboration through Semantic Markup

Web Observatory Concept

WO Portal

Engaging communities with analytics Publication of catalogues (schema.org)

Access with/without credentials Searching and Indexing

Distributed Queries Plugged in Datastores and App Servers

Harvesting Dataset enrichment/curation

Dataset management Provenance

Optimisation

WO Datastores

Hosting of analytic apps Hosting of visualisation apps Monitoring dependency on

datasets Monitoring dependency on tools

Explicit links between tools & datasets used

WO Apps WO Portal

WO Apps WO Datastores

WO Portal

WO Apps WO Datastores

Links to resources in other Web Observatories

Thanassis Tiropanis – University of Southampton

Page 4: Facilitating Web Science Collaboration through Semantic Markup

RPI Observatory Themes

Science Data Observatory Health & Life Sciences Observatory

Open Government Observatory Social Spaces Observatory

Example: Indian Election Twitter Dataset

Example: Deep Carbon Obs. Datasets

Example: Cancer Treatment Datasets

Example: Int’l Open Govt Metadata

Page 5: Facilitating Web Science Collaboration through Semantic Markup

Data use (Social Spaces)

6

Page 6: Facilitating Web Science Collaboration through Semantic Markup

Data use (Open Govt Data)

6

Page 7: Facilitating Web Science Collaboration through Semantic Markup

Problem: putting these together across laboratories (and fields)

6

Page 8: Facilitating Web Science Collaboration through Semantic Markup

Schema.org

6

•  An initiative launched by the leading search engine providers to create and support a common set of schemas for structured data markup on Web pages.

•  These vocabularies enable the metadata to be more machine readable, allowing for better search, discover and display this information

Page 9: Facilitating Web Science Collaboration through Semantic Markup

Example RDFA Lite

6

<div http://schema.org/ > <h1 >Avatar</h1> <span>Director: <span ">James Cameron</span> </span> <span >Science fiction</span> <a href="../movies/avatar-theatrical-trailer.html"

>Trailer</a> </div>

Page 10: Facilitating Web Science Collaboration through Semantic Markup

Schema.org in action

6

Page 11: Facilitating Web Science Collaboration through Semantic Markup

Schema.org in action

6 http://datasets.schema-labs.appspot.com/

Page 12: Facilitating Web Science Collaboration through Semantic Markup

Goals

6

•  Describe Web Observatories •  Interconnect Web Observatories •  Facilitate discovery of tools, datasets,

and projects for researchers

Page 13: Facilitating Web Science Collaboration through Semantic Markup

Overview

6

Web Observatory

Project

Dataset Tool

Page 14: Facilitating Web Science Collaboration through Semantic Markup

Without Schema.org:

Search

Page 15: Facilitating Web Science Collaboration through Semantic Markup

6

Web Observatory

Project

Dataset Tool

Web Observatory

Project

Dataset Tool

Web Observatory

Project

Dataset Tool

Search

With Schema.org:

Page 16: Facilitating Web Science Collaboration through Semantic Markup

Schema.org vocabulary extension

Web Observatory Class"

Page 17: Facilitating Web Science Collaboration through Semantic Markup

Schema.org vocabulary extension

Web Observatory Project"

Page 18: Facilitating Web Science Collaboration through Semantic Markup

Schema.org vocabulary extension

Web Observatory Dataset"

Page 19: Facilitating Web Science Collaboration through Semantic Markup

Schema.org vocabulary extension

Web Observatory Tool"

Page 20: Facilitating Web Science Collaboration through Semantic Markup

Schema.org vocabulary demo

Page 21: Facilitating Web Science Collaboration through Semantic Markup

Schema.org vocabulary demo

Social Spaces WO

WO Project: Cosmic

WO Project:

First Responder

Page 22: Facilitating Web Science Collaboration through Semantic Markup

Schema.org vocabulary demo

Health/Life Science WO

WO Project: Mobile Health

WO Project: Health Data

Challenge

WO Dataset: Health Data Challenge

Page 23: Facilitating Web Science Collaboration through Semantic Markup

Conclusions

Science Data Observatory

Social Spaces Observatory

•  Integrating data on the Web, in general, is growing •  Schema.org is a data embedding model

showing great success •  Schema.org/Dataset became official April

2013 •  Search Engine tools are increasingly making

used of embedded markup •  Web Observatory extension aimed at use in

(Web) scientific community •  Also being used by AGU and DCO scientific

Page 24: Facilitating Web Science Collaboration through Semantic Markup

Future Work

Science Data Observatory

Social Spaces Observatory

•  Further extend the vocabulary to fit more web observatories •  Subcommunities can extend terminologies

•  Build better tools to use and embed schema.org vocabulary into web observatories •  Integrate into “telescope” toolbox

•  Build tools to make use of schema.org WO metadata (search engines, crawlers, etc) •  Google Domain Search underway