The Future of the Journal

46
The Future of the Journal Anita de Waard , [email protected] Disruptive Technologies Director, Elsevier Labs June 3, 2010

description

Updated version of slides ESWC 2010 AI Mashup session

Transcript of The Future of the Journal

Page 1: The Future of the Journal

The Future of the Journal

Anita de Waard , [email protected] Disruptive Technologies Director, Elsevier Labs

June 3, 2010

Page 2: The Future of the Journal

2

Science is made of information...

Page 3: The Future of the Journal

2

Science is made of information...

...that gets created...

Page 4: The Future of the Journal

2

Science is made of information...

...that gets created... ... and destroyed.

Page 5: The Future of the Journal

3

What is the problem?

Page 6: The Future of the Journal

3

What is the problem?

1. Researchers can’t keep track of their data.

Page 7: The Future of the Journal

3

What is the problem?

1. Researchers can’t keep track of their data.

2. Data is not stored in a way that is easy for authors.

Page 8: The Future of the Journal

3

What is the problem?

1. Researchers can’t keep track of their data.

2. Data is not stored in a way that is easy for authors.

3. For readers, article text is not linked to the underlying data.

Page 9: The Future of the Journal

The Vision

4

Work done with Ed Hovy, Phil Bourne, Gully Burns and Cartic Ramakrishnan

Page 10: The Future of the Journal

The Vision

4

Work done with Ed Hovy, Phil Bourne, Gully Burns and Cartic Ramakrishnan

1. Research: Each item in the system has metadata (including provenance) and relations to other data items added to it.

metadata

metadata

metadata

metadata

metadata

Page 11: The Future of the Journal

The Vision

4

Work done with Ed Hovy, Phil Bourne, Gully Burns and Cartic Ramakrishnan

1. Research: Each item in the system has metadata (including provenance) and relations to other data items added to it.

metadata

metadata

metadata

metadata

metadata

2. Workflow: All data items created in the lab are added to a (lab-owned) workflow system.

Page 12: The Future of the Journal

The Vision

4

Work done with Ed Hovy, Phil Bourne, Gully Burns and Cartic Ramakrishnan

1. Research: Each item in the system has metadata (including provenance) and relations to other data items added to it.

metadata

metadata

metadata

metadata

metadata

2. Workflow: All data items created in the lab are added to a (lab-owned) workflow system.

Rats were subjected to two grueling tests(click on fig 2 to see underlying data). These results suggest that the neurological pain pro-

3. Authoring: A paper is written in an authoring tool which can pull data with provenance from the workflow tool in the appropriate representation into the document.

Page 13: The Future of the Journal

The Vision

4

Work done with Ed Hovy, Phil Bourne, Gully Burns and Cartic Ramakrishnan

1. Research: Each item in the system has metadata (including provenance) and relations to other data items added to it.

metadata

metadata

metadata

metadata

metadata

2. Workflow: All data items created in the lab are added to a (lab-owned) workflow system.

4. Editing and review: Once the co-authors agree, the paper is ‘exposed’ to the editors, who in turn expose it to reviewers. Reports are stored in the authoring/editing system, the paper gets updated, until it is validated.

Review

EditRevise

Rats were subjected to two grueling tests(click on fig 2 to see underlying data). These results suggest that the neurological pain pro-

3. Authoring: A paper is written in an authoring tool which can pull data with provenance from the workflow tool in the appropriate representation into the document.

Page 14: The Future of the Journal

The Vision

4

Work done with Ed Hovy, Phil Bourne, Gully Burns and Cartic Ramakrishnan

1. Research: Each item in the system has metadata (including provenance) and relations to other data items added to it.

metadata

metadata

metadata

metadata

metadata

5. Publishing and distribution: When a paper is published, a collection of validated information is exposed to the world. It remains connected to its related data item, and its heritage can be traced.

2. Workflow: All data items created in the lab are added to a (lab-owned) workflow system.

4. Editing and review: Once the co-authors agree, the paper is ‘exposed’ to the editors, who in turn expose it to reviewers. Reports are stored in the authoring/editing system, the paper gets updated, until it is validated.

Review

EditRevise

Rats were subjected to two grueling tests(click on fig 2 to see underlying data). These results suggest that the neurological pain pro-

3. Authoring: A paper is written in an authoring tool which can pull data with provenance from the workflow tool in the appropriate representation into the document.

Page 15: The Future of the Journal

Some other publisher

6. User applications: distributed applications run on this ‘exposed data’ universe.

The Vision

4

Work done with Ed Hovy, Phil Bourne, Gully Burns and Cartic Ramakrishnan

1. Research: Each item in the system has metadata (including provenance) and relations to other data items added to it.

metadata

metadata

metadata

metadata

metadata

5. Publishing and distribution: When a paper is published, a collection of validated information is exposed to the world. It remains connected to its related data item, and its heritage can be traced.

2. Workflow: All data items created in the lab are added to a (lab-owned) workflow system.

4. Editing and review: Once the co-authors agree, the paper is ‘exposed’ to the editors, who in turn expose it to reviewers. Reports are stored in the authoring/editing system, the paper gets updated, until it is validated.

Review

EditRevise

Rats were subjected to two grueling tests(click on fig 2 to see underlying data). These results suggest that the neurological pain pro-

3. Authoring: A paper is written in an authoring tool which can pull data with provenance from the workflow tool in the appropriate representation into the document.

Page 16: The Future of the Journal

5

What is needed to get there?

tool builders

standards bodies

institutes, funding bodies, individuals

Page 17: The Future of the Journal

5

What is needed to get there? A. Workflow tools: Linked-data-based workflow tools for all

science, are scalable, safe, and user-friendly tool builders

standards bodies

institutes, funding bodies, individuals

Page 18: The Future of the Journal

5

What is needed to get there? A. Workflow tools: Linked-data-based workflow tools for all

science, are scalable, safe, and user-friendlyB. Authoring and reviewing tools: which enable use of rich

and provenance-tracked elements

tool builders

standards bodies

institutes, funding bodies, individuals

Page 19: The Future of the Journal

5

What is needed to get there? A. Workflow tools: Linked-data-based workflow tools for all

science, are scalable, safe, and user-friendlyB. Authoring and reviewing tools: which enable use of rich

and provenance-tracked elementsC. Metadata standards: Standards that allow interoperable

exchange of information on any knowledge item created in a lab, including provenance and privacy/IPR rights

tool builders

standards bodies

institutes, funding bodies, individuals

Page 20: The Future of the Journal

5

What is needed to get there? A. Workflow tools: Linked-data-based workflow tools for all

science, are scalable, safe, and user-friendlyB. Authoring and reviewing tools: which enable use of rich

and provenance-tracked elementsC. Metadata standards: Standards that allow interoperable

exchange of information on any knowledge item created in a lab, including provenance and privacy/IPR rights

D. Social change: Scientists need to realize they should annotate their work

tool builders

standards bodies

institutes, funding bodies, individuals

Page 21: The Future of the Journal

5

What is needed to get there? A. Workflow tools: Linked-data-based workflow tools for all

science, are scalable, safe, and user-friendlyB. Authoring and reviewing tools: which enable use of rich

and provenance-tracked elementsC. Metadata standards: Standards that allow interoperable

exchange of information on any knowledge item created in a lab, including provenance and privacy/IPR rights

D. Social change: Scientists need to realize they should annotate their work

E. Semantic/Linked Data XML repositories.

tool builders

standards bodies

institutes, funding bodies, individuals

Page 22: The Future of the Journal

5

What is needed to get there? A. Workflow tools: Linked-data-based workflow tools for all

science, are scalable, safe, and user-friendlyB. Authoring and reviewing tools: which enable use of rich

and provenance-tracked elementsC. Metadata standards: Standards that allow interoperable

exchange of information on any knowledge item created in a lab, including provenance and privacy/IPR rights

D. Social change: Scientists need to realize they should annotate their work

E. Semantic/Linked Data XML repositories. F. Publishing systems that run application servers.

tool builders

standards bodies

institutes, funding bodies, individuals

Page 23: The Future of the Journal

5

What is needed to get there? A. Workflow tools: Linked-data-based workflow tools for all

science, are scalable, safe, and user-friendlyB. Authoring and reviewing tools: which enable use of rich

and provenance-tracked elementsC. Metadata standards: Standards that allow interoperable

exchange of information on any knowledge item created in a lab, including provenance and privacy/IPR rights

D. Social change: Scientists need to realize they should annotate their work

E. Semantic/Linked Data XML repositories. F. Publishing systems that run application servers.

tool builders

standards bodies

institutes, funding bodies, individualspublishers

Page 24: The Future of the Journal

5

What is needed to get there? A. Workflow tools: Linked-data-based workflow tools for all

science, are scalable, safe, and user-friendlyB. Authoring and reviewing tools: which enable use of rich

and provenance-tracked elementsC. Metadata standards: Standards that allow interoperable

exchange of information on any knowledge item created in a lab, including provenance and privacy/IPR rights

D. Social change: Scientists need to realize they should annotate their work

E. Semantic/Linked Data XML repositories. F. Publishing systems that run application servers.

tool builders

standards bodies

institutes, funding bodies, individualspublisherspublishers

Page 25: The Future of the Journal

6

Workflow tools are scaling up!

Page 26: The Future of the Journal

6

Workflow tools are scaling up!

http://MyExperiment.org

Page 27: The Future of the Journal

6

Workflow tools are scaling up!

http://MyExperiment.org

http://VisTrails.org

Page 28: The Future of the Journal

6

Workflow tools are scaling up!

http://wings.isi.edu/

http://MyExperiment.org

http://VisTrails.org

Page 29: The Future of the Journal

7

Linked Data for Elsevier

10

Page 30: The Future of the Journal

7

Linked Data for Elsevier

<ce:section id=#123>

10

Page 31: The Future of the Journal

7

Linked Data for Elsevier

<ce:section id=#123> mice like cheesethis says

10

Page 32: The Future of the Journal

7

Linked Data for Elsevier

<ce:section id=#123>

said @anita on May 31 2010

mice like cheesethis says

10

Page 33: The Future of the Journal

7

but we all know she was jetlagged then

Linked Data for Elsevier

<ce:section id=#123>

said @anita on May 31 2010

mice like cheesethis says

10

Page 34: The Future of the Journal

7

but we all know she was jetlagged then

Linked Data for Elsevier

<ce:section id=#123>

said @anita on May 31 2010

immutable, $$, proprietary

mice like cheesethis says

10

Page 35: The Future of the Journal

7

dynamic, personal, task-driven, - open?

but we all know she was jetlagged then

Linked Data for Elsevier

<ce:section id=#123>

said @anita on May 31 2010

immutable, $$, proprietary

mice like cheesethis says

10

Page 36: The Future of the Journal

8

Semantic annotation grid

11

Page 37: The Future of the Journal

8

Semantic annotation grid

11

Page 38: The Future of the Journal

8

Semantic annotation grid

document

claim

triple

entity

collectionGranularity

11

Page 39: The Future of the Journal

8

Semantic annotation grid

document

claim

triple

entity

collectionGranularity

reader/data miningtypesetter/productionauthor/editorMoment

measure

11

Page 40: The Future of the Journal

8

Semantic annotation grid

automated

manual

semi-automated

Means

document

claim

triple

entity

collectionGranularity

reader/data miningtypesetter/productionauthor/editorMoment

measure

11

Page 41: The Future of the Journal

8

Semantic annotation grid

Automated Copy Editing

automated

manual

semi-automated

Means

document

claim

triple

entity

collectionGranularity

reader/data miningtypesetter/productionauthor/editorMoment

measure

11

Page 42: The Future of the Journal

8

Semantic annotation grid

Automated Copy Editing

Reflect

automated

manual

semi-automated

Means

document

claim

triple

entity

collectionGranularity

reader/data miningtypesetter/productionauthor/editorMoment

measure

11

Page 43: The Future of the Journal

9

.XMP RDF in all our PDFs: DC + PRISM

12

Page 44: The Future of the Journal

Application server: an ecosystem open to accelerate science

Page 45: The Future of the Journal

Application server: an app-driven ecosystem

Page 46: The Future of the Journal

Application server: an example