PARSAlliance Conference 20071115 1 ©John Womersley/Keith Jeffery/STFC Developing Tomorrow’s...

33
©John Womersley/Keith Jeffery/STFC PARSAlliance Conference 20071115 1 Developing Tomorrow’s Infrastructure for Science John Womersley Director, Science Strategy Science and Technology Facilities Council presented by: Keith Jeffery Director, IT & International Strategy Science and Technology Facilities Council

Transcript of PARSAlliance Conference 20071115 1 ©John Womersley/Keith Jeffery/STFC Developing Tomorrow’s...

Page 1: PARSAlliance Conference 20071115 1 ©John Womersley/Keith Jeffery/STFC Developing Tomorrow’s Infrastructure for Science John Womersley Director, Science.

©John Womersley/Keith Jeffery/STFC PARSAlliance Conference 20071115 1

Developing Tomorrow’s Infrastructure for

Science

John WomersleyDirector, Science Strategy

Science and Technology Facilities Council

presented by:Keith Jeffery

Director, IT & International StrategyScience and Technology Facilities Council

Page 2: PARSAlliance Conference 20071115 1 ©John Womersley/Keith Jeffery/STFC Developing Tomorrow’s Infrastructure for Science John Womersley Director, Science.

©John Womersley/Keith Jeffery/STFC PARSAlliance Conference 20071115 2

Overview

1. Some STFC Science

2. Tomorrow’s Digital Infrastructure for Science

3. Supporting the Research Lifecycle

4. Some Policy Frameworks

5. Conclusion

Page 3: PARSAlliance Conference 20071115 1 ©John Womersley/Keith Jeffery/STFC Developing Tomorrow’s Infrastructure for Science John Womersley Director, Science.

©John Womersley/Keith Jeffery/STFC PARSAlliance Conference 20071115 3

What is STFC?

The Science and Technology Facilities Council (UK) Created on April 1, 2007 It is responsible for

– fundamental research in particle physics, nuclear physics, astronomy, space

– major UK facilities for the physical and life sciences synchrotrons, light sources, lasers, neutrons

– national laboratories at RAL, Daresbury, UKATC– international science projects

CERN, ESO, ESA, ILL, ESRF… Over 2000 staff and an annual budget of over £700M

Page 4: PARSAlliance Conference 20071115 1 ©John Womersley/Keith Jeffery/STFC Developing Tomorrow’s Infrastructure for Science John Womersley Director, Science.

©John Womersley/Keith Jeffery/STFC PARSAlliance Conference 20071115 4

The Science we Address

Some examples

Why is there a universe? What is the origin of mass? Was there ever life on Mars? How are the chemical elements created? How can we design better treatments for cancer? How do cells work? How can we create new materials to store energy?

Page 5: PARSAlliance Conference 20071115 1 ©John Womersley/Keith Jeffery/STFC Developing Tomorrow’s Infrastructure for Science John Womersley Director, Science.

©John Womersley/Keith Jeffery/STFC PARSAlliance Conference 20071115 5

STFC Facilities

data

ComputingAnalysisModelling

knowledgebeam

sample

Imaging detector

Neutrons and photons Provide complementary views of matter:

Photons “see” electric charge – high atomic number nuclei

Neutrons “see” nucleons – especially hydrogen atoms

Page 6: PARSAlliance Conference 20071115 1 ©John Womersley/Keith Jeffery/STFC Developing Tomorrow’s Infrastructure for Science John Womersley Director, Science.

©John Womersley/Keith Jeffery/STFC PARSAlliance Conference 20071115 6

Some STFC Projects

ESA centre

ISIS TS2 phase 3

Diamond phase 3

Sapphire

Materials Innovation Institute

Page 7: PARSAlliance Conference 20071115 1 ©John Womersley/Keith Jeffery/STFC Developing Tomorrow’s Infrastructure for Science John Womersley Director, Science.

©John Womersley/Keith Jeffery/STFC PARSAlliance Conference 20071115 7

Some STFC Projects

ESRF upgrade

4GLS

Hartree CentreComputational Science

Page 8: PARSAlliance Conference 20071115 1 ©John Womersley/Keith Jeffery/STFC Developing Tomorrow’s Infrastructure for Science John Womersley Director, Science.

©John Womersley/Keith Jeffery/STFC PARSAlliance Conference 20071115 8

Some STFC Projects

HIPER

Future neutron sources:ESS/MW neutron sourceILL 20/20 upgrade

ELI

DIPOLE laser

Page 9: PARSAlliance Conference 20071115 1 ©John Womersley/Keith Jeffery/STFC Developing Tomorrow’s Infrastructure for Science John Womersley Director, Science.

©John Womersley/Keith Jeffery/STFC PARSAlliance Conference 20071115 9

Some STFC Projects

European ELT

SKA

Next generationGravitational waveobservatory

FAIR

Page 10: PARSAlliance Conference 20071115 1 ©John Womersley/Keith Jeffery/STFC Developing Tomorrow’s Infrastructure for Science John Womersley Director, Science.

©John Womersley/Keith Jeffery/STFC PARSAlliance Conference 20071115 10

Some STFC Projects

Neutrino factory

International Linear Collider

LHC upgrades

Underground scienceNeutrinos, dark matter

Page 11: PARSAlliance Conference 20071115 1 ©John Womersley/Keith Jeffery/STFC Developing Tomorrow’s Infrastructure for Science John Womersley Director, Science.

©John Womersley/Keith Jeffery/STFC PARSAlliance Conference 20071115 11

XFEL

Project “launched” on 5th June– This means DESY is now authorised to spend

XFEL GmbH to be set up by end of year Our goal is to maximise our in-kind contributions within the

£30M already allocated in LFCF– Pixel detector, streak camera…

Page 12: PARSAlliance Conference 20071115 1 ©John Womersley/Keith Jeffery/STFC Developing Tomorrow’s Infrastructure for Science John Womersley Director, Science.

©John Womersley/Keith Jeffery/STFC PARSAlliance Conference 20071115 12

Part 2Tomorrow’s Digital

Infrastructure for Science

Page 13: PARSAlliance Conference 20071115 1 ©John Womersley/Keith Jeffery/STFC Developing Tomorrow’s Infrastructure for Science John Womersley Director, Science.

©John Womersley/Keith Jeffery/STFC PARSAlliance Conference 20071115 13

The 7 C’s

Creation Collection Capacity Computation Curation Collaboration Communication

Page 14: PARSAlliance Conference 20071115 1 ©John Womersley/Keith Jeffery/STFC Developing Tomorrow’s Infrastructure for Science John Womersley Director, Science.

©John Womersley/Keith Jeffery/STFC PARSAlliance Conference 20071115 14

Its all about scale

Creation: Examining the

detector arrays on the MAPs spectrometer at ISIS

Page 15: PARSAlliance Conference 20071115 1 ©John Womersley/Keith Jeffery/STFC Developing Tomorrow’s Infrastructure for Science John Womersley Director, Science.

©John Womersley/Keith Jeffery/STFC PARSAlliance Conference 20071115 15

Its all about scale

Collection: An ATSR

image of Sicily with Mount Etna eruption; taken 24 July 2001

Page 16: PARSAlliance Conference 20071115 1 ©John Womersley/Keith Jeffery/STFC Developing Tomorrow’s Infrastructure for Science John Womersley Director, Science.

©John Womersley/Keith Jeffery/STFC PARSAlliance Conference 20071115 16

Its all about scaleEstd Data Storage CCLRC to 2010

0

2000

4000

6000

8000

10000

12000

14000

16000

18000

20000

2003-04 2005-06 2007-8 2009-10

Year

Vol

ume

(TB

)

CSE

BADC

E-SCI

Diamond

PP

External

Total (TB)

Cum Total (TB)

Capacity:

eg at CCLRC

20PB by 2010

1PB = 1015 Bytes

Billions of Floppys

Millions of CDs

Thousands of PCs

(today’s)

Page 17: PARSAlliance Conference 20071115 1 ©John Womersley/Keith Jeffery/STFC Developing Tomorrow’s Infrastructure for Science John Womersley Director, Science.

©John Womersley/Keith Jeffery/STFC PARSAlliance Conference 20071115 17

Its all about scale

Computation: 3-D rabbit heart

MRI rendered at 512 x 512 x 1400 using 12 GPUs

Data needs interpretation and analysis

Picture of heart

Page 18: PARSAlliance Conference 20071115 1 ©John Womersley/Keith Jeffery/STFC Developing Tomorrow’s Infrastructure for Science John Womersley Director, Science.

©John Womersley/Keith Jeffery/STFC PARSAlliance Conference 20071115 18

Its all about scale

Curation: Some CCLRC based Repositories

– The Atlas Datastore

– The British Atmospheric Data centre

– The CCLRC Data Portal

– The CCLRC Publications Archive

– The CCPs (Collaborative Computational Projects)

– The Chemical Database Service

– The Digital Curation Centre

– The EUROPRACTICE Software service

– The HPCx Supercomputer

– The JISCmail service

– The NERC Datagrid

– The NERC Earth Observation Data Centre

– The Starlink Software suite

– The UK Grid Support Centre

– The UK Grid for Particle Physics Tier 1A

– The World Data Centre for Solar-Terrestrial Physics

Atlas Datastore Tape Robot

Page 19: PARSAlliance Conference 20071115 1 ©John Womersley/Keith Jeffery/STFC Developing Tomorrow’s Infrastructure for Science John Womersley Director, Science.

©John Womersley/Keith Jeffery/STFC PARSAlliance Conference 20071115 19

Its all about scale

Collaboration: Barrel toroid magnet

and detector module from ATLAS at CERN

ATLAS: 2000 scientists 150 Universities 30 countries

Page 20: PARSAlliance Conference 20071115 1 ©John Womersley/Keith Jeffery/STFC Developing Tomorrow’s Infrastructure for Science John Womersley Director, Science.

©John Womersley/Keith Jeffery/STFC PARSAlliance Conference 20071115 20

It’s all about scaleCommunication: “The web has changed

everything...”

Technology enables:– access to everything

distributed,searchable information sources

Interlinking enables:– Revalidation of results

‘repeat experiment’

Discovery enables:– new knowledge from old

Archiving enables:– Recording unique events

Antarctic environmental data

CCLRC’s “e-pubs” Institutional Repository has records of 20,000 publications spanning 20 years

Page 21: PARSAlliance Conference 20071115 1 ©John Womersley/Keith Jeffery/STFC Developing Tomorrow’s Infrastructure for Science John Womersley Director, Science.

©John Womersley/Keith Jeffery/STFC PARSAlliance Conference 20071115 21

Part 3 Supporting

the Research Lifecycle

Page 22: PARSAlliance Conference 20071115 1 ©John Womersley/Keith Jeffery/STFC Developing Tomorrow’s Infrastructure for Science John Womersley Director, Science.

©John Womersley/Keith Jeffery/STFC PARSAlliance Conference 20071115 22

The Body of Knowledge

The GovernmentProcess

The ResearchProcess

Aggregation of Knowledge lies at the heart of the innovation lifecycle

Enabling Knowledge Creation

Enabling Wealth Creation

Quality Assessment

Strategic Direction

Improved Quality of Life

Improved Understanding

The Innovation Lifecycle

Page 23: PARSAlliance Conference 20071115 1 ©John Womersley/Keith Jeffery/STFC Developing Tomorrow’s Infrastructure for Science John Womersley Director, Science.

©John Womersley/Keith Jeffery/STFC PARSAlliance Conference 20071115 23

The Body of Knowledge

The Information Infrastructure

Creation

Archival

Access

Storage ComputeNetwork

Services

Curation

the researcher actsthrough ingest and access

Virtual Research Environment

the researcher shouldn’t have to worry about the information infrastructure

Information Infrastructure

Page 24: PARSAlliance Conference 20071115 1 ©John Womersley/Keith Jeffery/STFC Developing Tomorrow’s Infrastructure for Science John Womersley Director, Science.

©John Womersley/Keith Jeffery/STFC PARSAlliance Conference 20071115 24

Current View

Raw DataData Analysis

Analysed Data

Publication Data

Publications

Facility 1

Raw DataData Analysis

Analysed Data

Publication Data

Publications

Facility 2

Raw DataData Analysis

Analysed Data

Publication Data

Publications

Facility 3

Distinct Infrastructures / Distinct User Experiences

Page 25: PARSAlliance Conference 20071115 1 ©John Womersley/Keith Jeffery/STFC Developing Tomorrow’s Infrastructure for Science John Womersley Director, Science.

©John Womersley/Keith Jeffery/STFC PARSAlliance Conference 20071115 25

Future View

Raw DataData Analysis

Analysed Data

Publication Data

Publications

Facility 1

Raw DataData Analysis

Analysed Data

Publication Data

Publications

Facility 2

Raw DataData Analysis

Analysed Data

Publication Data

Publications

Facility 3CapacityStorage

Publications Repositories

Standards/Converters

Data Repositories

Raw Data Catalogue

Data Analysis

Analysed Data Catalogue

Publication Data Catalogue

Publications Catalogue

Common Infrastructure / Common User Experience

Page 26: PARSAlliance Conference 20071115 1 ©John Womersley/Keith Jeffery/STFC Developing Tomorrow’s Infrastructure for Science John Womersley Director, Science.

©John Womersley/Keith Jeffery/STFC PARSAlliance Conference 20071115 26

Part 4

Some Policy Frameworks

Page 27: PARSAlliance Conference 20071115 1 ©John Womersley/Keith Jeffery/STFC Developing Tomorrow’s Infrastructure for Science John Womersley Director, Science.

©John Womersley/Keith Jeffery/STFC PARSAlliance Conference 20071115 27

Some Policy Frameworks

– UK Research Councils’ initiative on access to research outputs 2005 and 2006 statements of principles

– OECD Guidelines on Access to Research Data 2004 Declaration, 2007 Guidelines

– UK Office of Science and Innovation Report (2006) Developing the UK’s e-infrastructure for science and innovation

Page 28: PARSAlliance Conference 20071115 1 ©John Womersley/Keith Jeffery/STFC Developing Tomorrow’s Infrastructure for Science John Womersley Director, Science.

©John Womersley/Keith Jeffery/STFC PARSAlliance Conference 20071115 28

RCUK Policy (2005, 2006)

Four principles:

Ideas and knowledge derived from publicly-funded research are made available and accessible for public use, interrogation, and scrutiny, as widely, rapidly, and effectively as practicable

Effective mechanisms are in place to ensure that published research outputs are subject to rigorous quality assurance, through peer review

The models and mechanisms for publication and access to research results are both efficient and cost-effective in the use of public funds

The outputs from current and future research can be preserved and remain accessible not only for the next few years but for future generations

Page 29: PARSAlliance Conference 20071115 1 ©John Womersley/Keith Jeffery/STFC Developing Tomorrow’s Infrastructure for Science John Womersley Director, Science.

©John Womersley/Keith Jeffery/STFC PARSAlliance Conference 20071115 29

OECD Recommendation (2006)OECD Recommendation on Access to

research data from public funding

13 principles:A – Openness Openness means access on equal terms for the international research

community at the lowest possible cost, .... B – Flexibility, C – Transparency, D – Legal conformity, E –

Protection of intellectual property, F – Formal responsibility, G – Professionalism

H – Interoperability Technological and semantic interoperability is a key consideration in

enabling and promoting international and interdisciplinary access to and use of research data. ...

I – Quality, J – Security, K – Efficiency, L – Accountability M – Sustainability ... taking administrative responsibility for the measures to guarantee

permanent access to data that have been determined to require long-term retention.

http://webdomino1.oecd.org/horizontal/oecdacts.nsf/Display/3A5FB1397B5ADFB7C12572980053C9D3?OpenDocument

Page 30: PARSAlliance Conference 20071115 1 ©John Womersley/Keith Jeffery/STFC Developing Tomorrow’s Infrastructure for Science John Womersley Director, Science.

©John Womersley/Keith Jeffery/STFC PARSAlliance Conference 20071115 30

OSI e-Infrastructure Steering Group

“Developing the UK’s e-infrastructure for science and innovation”

– Cross departmental view– 6 working groups:

1. Data and Information creation2. Preservation and curation3. Search and navigation4. Virtual research communities5. Networks, compute power and storage hardware6. Middleware, AAA and digital rights management

– Reports available on UK National eScience Centre Website

http://www.nesc.ac.uk/documents/OSI/index.html

– Note here report on Data and Information Creation

Page 31: PARSAlliance Conference 20071115 1 ©John Womersley/Keith Jeffery/STFC Developing Tomorrow’s Infrastructure for Science John Womersley Director, Science.

©John Womersley/Keith Jeffery/STFC PARSAlliance Conference 20071115 31

OSI e-Infrastructure Steering Group Data and Information Creation

Key findings:

1. The future e-infrastructure should directly support the management of data throughout its lifecycle ’from cradle to grave’

2. The future e-infrastructure should reduce the cycle time from conducting research, through analysis, publication and feedback into new research

3. There should be a much greater use of simulation-based research and its much closer integration with physical research

4. The future e-infrastructure should support the use for research purposes of data collected for other purposes

5. The future e-infrastructure should be based upon standards which support uniform classification, integration, certification and citation of data across all sources

Page 32: PARSAlliance Conference 20071115 1 ©John Womersley/Keith Jeffery/STFC Developing Tomorrow’s Infrastructure for Science John Womersley Director, Science.

©John Womersley/Keith Jeffery/STFC PARSAlliance Conference 20071115 32

Conclusion

STFC has massive holdings of data and information The benefits of ready, online, open access to research

are self-evident – wealth creation, improvement in quality of life

The data and information requires:– Preservation: making it available indefinitely– Curation: making it understandable indefinitely

This implies use of metadata– Needs to be ‘more intelligent’ (semantics on syntax)– Needs standards (for interoperation)

This is what the PARS Alliance is all about

Page 33: PARSAlliance Conference 20071115 1 ©John Womersley/Keith Jeffery/STFC Developing Tomorrow’s Infrastructure for Science John Womersley Director, Science.

©John Womersley/Keith Jeffery/STFC PARSAlliance Conference 20071115 33

TheThe

EndEnd