The Faster Research Cycle Interoperability for better science Brian Matthews, Leader, Information...
-
Upload
patricia-richard -
Category
Documents
-
view
218 -
download
2
Transcript of The Faster Research Cycle Interoperability for better science Brian Matthews, Leader, Information...
![Page 1: The Faster Research Cycle Interoperability for better science Brian Matthews, Leader, Information Management Group, E-Science Centre, STFC Rutherford Appleton.](https://reader035.fdocuments.net/reader035/viewer/2022062720/56649f125503460f94c256ab/html5/thumbnails/1.jpg)
The Faster Research Cycle Interoperability for better science
Brian Matthews, Leader, Information Management Group,E-Science Centre,STFC Rutherford Appleton [email protected]
![Page 2: The Faster Research Cycle Interoperability for better science Brian Matthews, Leader, Information Management Group, E-Science Centre, STFC Rutherford Appleton.](https://reader035.fdocuments.net/reader035/viewer/2022062720/56649f125503460f94c256ab/html5/thumbnails/2.jpg)
![Page 3: The Faster Research Cycle Interoperability for better science Brian Matthews, Leader, Information Management Group, E-Science Centre, STFC Rutherford Appleton.](https://reader035.fdocuments.net/reader035/viewer/2022062720/56649f125503460f94c256ab/html5/thumbnails/3.jpg)
The Research Lifecycle
E-Science: providing theinfrastructure for the research lifecycle
![Page 4: The Faster Research Cycle Interoperability for better science Brian Matthews, Leader, Information Management Group, E-Science Centre, STFC Rutherford Appleton.](https://reader035.fdocuments.net/reader035/viewer/2022062720/56649f125503460f94c256ab/html5/thumbnails/4.jpg)
How do we speed up this cycle?
By speeding up the cycle we can increase the volume of good science
– Make a better return from the investment in science
– Make breakthroughs in science earlier
Do this via:– Integration
• Support the whole lifecycle• See Kerstin’s talk
– Interoperability• Support across lifecycles
![Page 5: The Faster Research Cycle Interoperability for better science Brian Matthews, Leader, Information Management Group, E-Science Centre, STFC Rutherford Appleton.](https://reader035.fdocuments.net/reader035/viewer/2022062720/56649f125503460f94c256ab/html5/thumbnails/5.jpg)
Interoperability
Sharing across boundaries – Across different research lifecycles– Across institutions– Across information objects– Across disciplines– Across time
Characteristics– Loosely coupled– Across different authorities– Different internal models
![Page 6: The Faster Research Cycle Interoperability for better science Brian Matthews, Leader, Information Management Group, E-Science Centre, STFC Rutherford Appleton.](https://reader035.fdocuments.net/reader035/viewer/2022062720/56649f125503460f94c256ab/html5/thumbnails/6.jpg)
Enabling better scienceNeutron diffraction X-ray diffraction NMR
}High-qualitystructure refinement
}
SCIENCE
MASHUPS
![Page 7: The Faster Research Cycle Interoperability for better science Brian Matthews, Leader, Information Management Group, E-Science Centre, STFC Rutherford Appleton.](https://reader035.fdocuments.net/reader035/viewer/2022062720/56649f125503460f94c256ab/html5/thumbnails/7.jpg)
Vision
Infrastructure to support science across disciplines, scientific institutions and
research groups
![Page 8: The Faster Research Cycle Interoperability for better science Brian Matthews, Leader, Information Management Group, E-Science Centre, STFC Rutherford Appleton.](https://reader035.fdocuments.net/reader035/viewer/2022062720/56649f125503460f94c256ab/html5/thumbnails/8.jpg)
EDNS
European Data Infrastructure for Neutron and Synchrotron Sources
Combining European Neutron and Synchrotron Facilities
Already a common user community
Across many disciplines– Materials, chemistry, proteomics,
pharmaceuticals, nuclear physics, archaeology …
![Page 9: The Faster Research Cycle Interoperability for better science Brian Matthews, Leader, Information Management Group, E-Science Centre, STFC Rutherford Appleton.](https://reader035.fdocuments.net/reader035/viewer/2022062720/56649f125503460f94c256ab/html5/thumbnails/9.jpg)
![Page 10: The Faster Research Cycle Interoperability for better science Brian Matthews, Leader, Information Management Group, E-Science Centre, STFC Rutherford Appleton.](https://reader035.fdocuments.net/reader035/viewer/2022062720/56649f125503460f94c256ab/html5/thumbnails/10.jpg)
![Page 11: The Faster Research Cycle Interoperability for better science Brian Matthews, Leader, Information Management Group, E-Science Centre, STFC Rutherford Appleton.](https://reader035.fdocuments.net/reader035/viewer/2022062720/56649f125503460f94c256ab/html5/thumbnails/11.jpg)
Interoperability Across Facilities
ISIS ILL
Diamond ESRF
e-Science
Synchrotron X-Rays
Neutrons
UK France
![Page 12: The Faster Research Cycle Interoperability for better science Brian Matthews, Leader, Information Management Group, E-Science Centre, STFC Rutherford Appleton.](https://reader035.fdocuments.net/reader035/viewer/2022062720/56649f125503460f94c256ab/html5/thumbnails/12.jpg)
Integration and interoperation across facilitiesSingle Infrastructure Single User Experience
Different Infrastructures Different User Experiences
Facility 1
Raw Data
Data Analysis
Analysed Data
Published Data
PublicationsUser Data
Facility 3
Raw Data
Data Analysis
Analysed Data
Published Data
PublicationsUser Data
Raw Data Catalogue
Data Analysis
Analysed Data Catalogue
Published Data Catalogue
Publications Catalogue
User Catalogue
Facility 2
Raw Data
Data Analysis
Analysed Data
Published Data
PublicationsUser Data
Publications Repositories
Data Repositories
Software Repositories
UserRegistries
CapacityStorage
Common CRIS
![Page 13: The Faster Research Cycle Interoperability for better science Brian Matthews, Leader, Information Management Group, E-Science Centre, STFC Rutherford Appleton.](https://reader035.fdocuments.net/reader035/viewer/2022062720/56649f125503460f94c256ab/html5/thumbnails/13.jpg)
Potential Impact Most of Research Lifecycle
– User Management, Data Collection, Analysis, Publication
• Establish a Production service
– benefit to users – usability, findability: user info, data, pubs, software
– benefit to facilities – manageability: users, data, pubs, software
•Outreach and expansion
– Linking with other facilities in Europe and the wider world
• USA, Canada, Australia
– Linking with User communities
But at the moment, we are still in the planning and discussion phase
![Page 14: The Faster Research Cycle Interoperability for better science Brian Matthews, Leader, Information Management Group, E-Science Centre, STFC Rutherford Appleton.](https://reader035.fdocuments.net/reader035/viewer/2022062720/56649f125503460f94c256ab/html5/thumbnails/14.jpg)
Sharing UsersSharing knowledge of users
– Enhancing level of support for users
– Can correlate similar applications put into different facilities
– Facilities can provide a continuity of service
– Facilities can increase accuracy
Common Authentication– Common UID ?– Shibboleth– Grid Certificates– SSO at STFC, ShibGrid– Virtual Organisation Support
Policy Issues– Data protection– Institutional Security policy
FedID
Facility User
DN
Shibboleth ID
SRB System UID
SSHPK
Facility UserID
![Page 15: The Faster Research Cycle Interoperability for better science Brian Matthews, Leader, Information Management Group, E-Science Centre, STFC Rutherford Appleton.](https://reader035.fdocuments.net/reader035/viewer/2022062720/56649f125503460f94c256ab/html5/thumbnails/15.jpg)
Sharing Data
Sharing data is hard:– Different data formats– Different access rights– Complex objects– Maintaining context
Metadata is key– Structural Metadata (CSMD)– Conceptual structures (Ontologies) –
maintain meaning– Metadata is hard to collect
Consistent data policies are needed
![Page 16: The Faster Research Cycle Interoperability for better science Brian Matthews, Leader, Information Management Group, E-Science Centre, STFC Rutherford Appleton.](https://reader035.fdocuments.net/reader035/viewer/2022062720/56649f125503460f94c256ab/html5/thumbnails/16.jpg)
Aggregator services
Institutional data repositories
Deposit , Validation
Publication
ValidationData analysis
Search, harvest
Presentation services / portals
Data discovery, linking, citation
Laboratory repository
Deposit
eCrystals ‘Data Federation’ Model
Publishers: peer-review journals, conference proceedings, etc
Curation
Preservation
Subject Repository
Institution Library & Information Services
Data creation & capture in “Smart lab”
Data discovery, linking, citation
Search, harvest
Search, harvest
Deposit
Deposit
Deposit
![Page 17: The Faster Research Cycle Interoperability for better science Brian Matthews, Leader, Information Management Group, E-Science Centre, STFC Rutherford Appleton.](https://reader035.fdocuments.net/reader035/viewer/2022062720/56649f125503460f94c256ab/html5/thumbnails/17.jpg)
Data Policy
Data policy– Retention– Quality– Access
Learning how to manage policy as part of the SOA infrastructure
– E.g GridTrust– Consequence – looking
at Data Policy
Remains as a very large Business question
Goals & Requirements
Self-* …
Dynamic VO
PoliciesVO Mngt
…
…
Trust and Security for NGGs
Usage control
Resources
![Page 18: The Faster Research Cycle Interoperability for better science Brian Matthews, Leader, Information Management Group, E-Science Centre, STFC Rutherford Appleton.](https://reader035.fdocuments.net/reader035/viewer/2022062720/56649f125503460f94c256ab/html5/thumbnails/18.jpg)
Sharing Publications
Institutional Repository s/w now very well established– ePrints, DSpace, Fedora, ePubs– Large body of expertise available– Standard metadata models and protocols:
• DC-APs, FRBR, OAI-PMH, OAI-ORE
– Not yet embedded in science practise • except HEP!
Linking science data and publications– Not yet well established– Needs data citation– Needs peer review of data– Can (and should) be done on a P2P basis
![Page 19: The Faster Research Cycle Interoperability for better science Brian Matthews, Leader, Information Management Group, E-Science Centre, STFC Rutherford Appleton.](https://reader035.fdocuments.net/reader035/viewer/2022062720/56649f125503460f94c256ab/html5/thumbnails/19.jpg)
STFC
![Page 20: The Faster Research Cycle Interoperability for better science Brian Matthews, Leader, Information Management Group, E-Science Centre, STFC Rutherford Appleton.](https://reader035.fdocuments.net/reader035/viewer/2022062720/56649f125503460f94c256ab/html5/thumbnails/20.jpg)
Sharing Software
Analysis software tends to be specialised– Dependent on specific data formats– Dependent of nature of data– Dependent on the particular result to
demonstratedNevertheless common s/w repositories exist
– GAMS, StarLink, NAG, CCPForge etcAdvantages in sharing it
– Saves programmer effort– Verification of results– Common algorithms– Visualisation tools
Little work on systematic preservation of s/w– Signficant properties of s/w
![Page 21: The Faster Research Cycle Interoperability for better science Brian Matthews, Leader, Information Management Group, E-Science Centre, STFC Rutherford Appleton.](https://reader035.fdocuments.net/reader035/viewer/2022062720/56649f125503460f94c256ab/html5/thumbnails/21.jpg)
Common Representation and Transport of Information
To support the infrastructure we need a means to share information
– Lightweight– Minimal impact on internal systems– Keeps control at the source– Easy to share and merge– Can share conceptual information
The Semantic Web (still) provides the best current option
![Page 22: The Faster Research Cycle Interoperability for better science Brian Matthews, Leader, Information Management Group, E-Science Centre, STFC Rutherford Appleton.](https://reader035.fdocuments.net/reader035/viewer/2022062720/56649f125503460f94c256ab/html5/thumbnails/22.jpg)
DataWebsDataWeb concept– David Shotton,
Oxford– Biological
images– Publishing
metadata locally– With different
conceptual description
– Mapped to core Ontology
– Search and aggregator service
Integration comes for free
![Page 23: The Faster Research Cycle Interoperability for better science Brian Matthews, Leader, Information Management Group, E-Science Centre, STFC Rutherford Appleton.](https://reader035.fdocuments.net/reader035/viewer/2022062720/56649f125503460f94c256ab/html5/thumbnails/23.jpg)
SKOS: Simple conceptual relationships
![Page 24: The Faster Research Cycle Interoperability for better science Brian Matthews, Leader, Information Management Group, E-Science Centre, STFC Rutherford Appleton.](https://reader035.fdocuments.net/reader035/viewer/2022062720/56649f125503460f94c256ab/html5/thumbnails/24.jpg)
A Reality Check: the SUPER ReportDo the users really want all this?
Study of User Priorities fore-Infrastructure for e-Research (SUPER)
Survey commissioned by the UK NeSC– Steven Newhouse, Jennifer Schopf,
Andrew Richards, Malcolm AtkinsonCovered 45 people from over 30 e-Science projects
– Small survey– Selected from the already converted!
Available: http://www.nesc.ac.uk
![Page 25: The Faster Research Cycle Interoperability for better science Brian Matthews, Leader, Information Management Group, E-Science Centre, STFC Rutherford Appleton.](https://reader035.fdocuments.net/reader035/viewer/2022062720/56649f125503460f94c256ab/html5/thumbnails/25.jpg)
SUPER Results
Some Concerns:• How to share data with colleagues
– Large-scale data sets (files)– Metadata standards seen as key– Automatic capture of provenance
•Long-term data curation– Help with best practice to curate data
•Authentication– Simpler authentication mechanisms– Easier use of Virtual Organisations
•Training and outreach
We seem to be hitting the right points!
![Page 26: The Faster Research Cycle Interoperability for better science Brian Matthews, Leader, Information Management Group, E-Science Centre, STFC Rutherford Appleton.](https://reader035.fdocuments.net/reader035/viewer/2022062720/56649f125503460f94c256ab/html5/thumbnails/26.jpg)
Summary
Leverage to speed up the science lifecycle from interoperability
Access to resources across institutions and disciplines
Metadata KeyPolicy Key
Need to use semantic description to share meaning
Loose coupling of resources via Semantic Web