Research Data Management Introduction: EUDAT/Open AIRE Webinar
-
Upload
eudat -
Category
Data & Analytics
-
view
3.737 -
download
0
Transcript of Research Data Management Introduction: EUDAT/Open AIRE Webinar
![Page 1: Research Data Management Introduction: EUDAT/Open AIRE Webinar](https://reader033.fdocuments.net/reader033/viewer/2022061306/586f8f711a28ab54768b75a5/html5/thumbnails/1.jpg)
Research Data Management- an introductory webinar
Tony Ross-Hellauer, OpenAIRESarah Jones, EUDAT
This work is licensed under the Creative Commons CC-BY 4.0 licence
![Page 2: Research Data Management Introduction: EUDAT/Open AIRE Webinar](https://reader033.fdocuments.net/reader033/viewer/2022061306/586f8f711a28ab54768b75a5/html5/thumbnails/2.jpg)
Open Access Infrastructure for Research in Europe
www.openaire.eu
Who we are
Research Data Services, Expertise & Technology https://www.eudat.eu
![Page 3: Research Data Management Introduction: EUDAT/Open AIRE Webinar](https://reader033.fdocuments.net/reader033/viewer/2022061306/586f8f711a28ab54768b75a5/html5/thumbnails/3.jpg)
• Why manage data?
• RDM in Horizon 2020 (+ recent changes)
• How to manage and share research data?
• EUDAT and OpenAIRE services
Overview
![Page 4: Research Data Management Introduction: EUDAT/Open AIRE Webinar](https://reader033.fdocuments.net/reader033/viewer/2022061306/586f8f711a28ab54768b75a5/html5/thumbnails/4.jpg)
WHY MANAGE DATA?Image CC-BY-NC-SA by Leo Reynolds www.flickr.com/photos/lwr/13442910354
![Page 5: Research Data Management Introduction: EUDAT/Open AIRE Webinar](https://reader033.fdocuments.net/reader033/viewer/2022061306/586f8f711a28ab54768b75a5/html5/thumbnails/5.jpg)
Data explosion• More and more data is
being created• Issue is not creating
data, but being able to navigate and use it
• Data management is critical to make sure data are well-organised, understandable and reusable
![Page 6: Research Data Management Introduction: EUDAT/Open AIRE Webinar](https://reader033.fdocuments.net/reader033/viewer/2022061306/586f8f711a28ab54768b75a5/html5/thumbnails/6.jpg)
Digital data are fragile and susceptible to loss for a wide variety of reasons• Natural disaster• Facilities infrastructure failure• Storage failure• Server hardware/software failure• Application software failure• Format obsolescence• Legal encumbrance• Human error• Malicious attack• Loss of staffing competencies• Loss of institutional commitment• Loss of financial stability• Changes in user expectations
Data loss
Image CC BY-NC-SA 2.0 by Dave Hill https://www.flickr.com/photos/dmh650/4031607067
![Page 7: Research Data Management Introduction: EUDAT/Open AIRE Webinar](https://reader033.fdocuments.net/reader033/viewer/2022061306/586f8f711a28ab54768b75a5/html5/thumbnails/7.jpg)
A reproducibility crisis
![Page 8: Research Data Management Introduction: EUDAT/Open AIRE Webinar](https://reader033.fdocuments.net/reader033/viewer/2022061306/586f8f711a28ab54768b75a5/html5/thumbnails/8.jpg)
Why manage data?
• Make your research easier• Stop yourself drowning in irrelevant stuff• Save data for later• Avoid accusations of fraud or bad science• Share your data for re-use• Get credit for it• Meet funder/institution requirements
Because well-managed data opens up opportunities for re-use, sharing and makes for better science!
![Page 9: Research Data Management Introduction: EUDAT/Open AIRE Webinar](https://reader033.fdocuments.net/reader033/viewer/2022061306/586f8f711a28ab54768b75a5/html5/thumbnails/9.jpg)
RDM IN HORIZON 2020 Image “Open Data” CC BY 2.0 by http://www.descrier.co.uk
![Page 10: Research Data Management Introduction: EUDAT/Open AIRE Webinar](https://reader033.fdocuments.net/reader033/viewer/2022061306/586f8f711a28ab54768b75a5/html5/thumbnails/10.jpg)
EC Open Research Data Pilot, Jan 2015 -
• A limited, voluntary pilot (initially 8 programme areas) with opt-out and safeguards
• Participating projects must:• Keep a data management plan, to be updated at regular intervals• Deposit in an open access repository:
1. the data, including associated metadata, needed to validate the results presented in scientific publications as soon as possible;
2. other data, including associated metadata, as specified and within the deadlines laid down in the data management plan
![Page 11: Research Data Management Introduction: EUDAT/Open AIRE Webinar](https://reader033.fdocuments.net/reader033/viewer/2022061306/586f8f711a28ab54768b75a5/html5/thumbnails/11.jpg)
EC Open Research Data Pilot Opt-out Reasons
https://open-data.europa.eu/data/dataset/open-research-data-the-uptake-of-the-pilot-in-the-first-calls-of-horizon-2020
![Page 12: Research Data Management Introduction: EUDAT/Open AIRE Webinar](https://reader033.fdocuments.net/reader033/viewer/2022061306/586f8f711a28ab54768b75a5/html5/thumbnails/12.jpg)
Just announced!
H2020 - Open Data by Default from 2017
![Page 13: Research Data Management Introduction: EUDAT/Open AIRE Webinar](https://reader033.fdocuments.net/reader033/viewer/2022061306/586f8f711a28ab54768b75a5/html5/thumbnails/13.jpg)
MANAGING & SHARING DATA
![Page 14: Research Data Management Introduction: EUDAT/Open AIRE Webinar](https://reader033.fdocuments.net/reader033/viewer/2022061306/586f8f711a28ab54768b75a5/html5/thumbnails/14.jpg)
CREATING DATA
PROCESSING DATA
ANALYSING DATA
PRESERVING DATA
GIVING ACCESS TO
DATA
RE-USING DATA
Research data lifecycleCREATING DATA: designing research, DMPs, planning consent, locate existing data, data collection and management, capturing and creating metadata
RE-USING DATA: follow-up research, new research, undertake research reviews, scrutinising findings, teaching & learning
ACCESS TO DATA: distributing data, sharing data, controlling access, establishing copyright, promoting data PRESERVING DATA: data storage, back-
up & archiving, migrating to best format & medium, creating metadata and documentation
ANALYSING DATA: interpreting, & deriving data, producing outputs, authoring publications, preparing for sharing
PROCESSING DATA: entering, transcribing, checking, validating and cleaning data, anonymising data, describing data, manage and store data
Ref: UK Data Archive: http://www.data-archive.ac.uk/create-manage/life-cycle
![Page 15: Research Data Management Introduction: EUDAT/Open AIRE Webinar](https://reader033.fdocuments.net/reader033/viewer/2022061306/586f8f711a28ab54768b75a5/html5/thumbnails/15.jpg)
• Findable– assign persistent IDs, provide rich metadata, register in a searchable
resource...
• Accessible– Retrievable by their ID using a standard protocol, metadata remain
accessible even if data aren’t...
• Interoperable– Use formal, broadly applicable languages, use standard vocabularies,
qualified references...
• Reusable– Rich, accurate metadata, clear licences, provenance, use of community
standards...
www.force11.org/group/fairgroup/fairprinciples
FAIR data
![Page 16: Research Data Management Introduction: EUDAT/Open AIRE Webinar](https://reader033.fdocuments.net/reader033/viewer/2022061306/586f8f711a28ab54768b75a5/html5/thumbnails/16.jpg)
A DMP is a brief plan to define:• how the data will be created?• how it will be documented?• who will access it?• where it will be stored?• who will back it up?• whether (and how) it will be shared & preserved?
DMPs are often submitted as part of grant applications, but are useful whenever researchers are creating data.
Data Management Plans
![Page 17: Research Data Management Introduction: EUDAT/Open AIRE Webinar](https://reader033.fdocuments.net/reader033/viewer/2022061306/586f8f711a28ab54768b75a5/html5/thumbnails/17.jpg)
DMPonlineA web-based tool to help researchers write DMPs
Includes a template for Horizon 2020Guidance from EUDAT and OpenAIRE being added
https://dmponline.dcc.ac.uk
![Page 18: Research Data Management Introduction: EUDAT/Open AIRE Webinar](https://reader033.fdocuments.net/reader033/viewer/2022061306/586f8f711a28ab54768b75a5/html5/thumbnails/18.jpg)
• Metadata and documentation is needed to locate and understand research data
• Think about what others would need in order to find, evaluate, understand, and reuse your data.
• Get others to check the metadata to improve quality
• Use standards to enable interoperability
Metadata & documentation
![Page 19: Research Data Management Introduction: EUDAT/Open AIRE Webinar](https://reader033.fdocuments.net/reader033/viewer/2022061306/586f8f711a28ab54768b75a5/html5/thumbnails/19.jpg)
Metadata standardsUse relevant standards for interoperability
http://rd-alliance.github.io/metadata-directory
![Page 20: Research Data Management Introduction: EUDAT/Open AIRE Webinar](https://reader033.fdocuments.net/reader033/viewer/2022061306/586f8f711a28ab54768b75a5/html5/thumbnails/20.jpg)
Where to store data?
• Your own drive (PC, server, flash drive, etc.)– And if you lose it? Or it breaks?
• Somebody else’s drive / departmental drive
• “Cloud” drive– Do they care as much about your data as you
do?
• Large scale infrastructure services like EUDAT
![Page 21: Research Data Management Introduction: EUDAT/Open AIRE Webinar](https://reader033.fdocuments.net/reader033/viewer/2022061306/586f8f711a28ab54768b75a5/html5/thumbnails/21.jpg)
How to backup?• 3... 2... 1... backup!
– at least 3 copies of a file– on at least 2 different media– with at least 1 offsite
• Use managed services where possible e.g. University filestores or infrastructure services like EUDAT rather than local or external hard drives
• Ask IT teams for advice
![Page 22: Research Data Management Introduction: EUDAT/Open AIRE Webinar](https://reader033.fdocuments.net/reader033/viewer/2022061306/586f8f711a28ab54768b75a5/html5/thumbnails/22.jpg)
Backup and preservation – not the same thing!
• Backups– Used to take periodic snapshots of data in case the current
version is destroyed or lost– Backups are copies of files stored for short or near-long-
term– Often performed on a somewhat frequent schedule
• Archiving– Used to preserve data for historical reference or potentially
during disasters– Archives are usually the final version, stored for long-term,
and generally not copied over– Often performed at the end of a project or during major
milestones
![Page 23: Research Data Management Introduction: EUDAT/Open AIRE Webinar](https://reader033.fdocuments.net/reader033/viewer/2022061306/586f8f711a28ab54768b75a5/html5/thumbnails/23.jpg)
Data repositories
http://databib.org
http://service.re3data.org/search
• Does your publisher or funder suggest a repository?• Are there data centres or databases for your discipline?• Does your university offer support for long-term preservation?
![Page 24: Research Data Management Introduction: EUDAT/Open AIRE Webinar](https://reader033.fdocuments.net/reader033/viewer/2022061306/586f8f711a28ab54768b75a5/html5/thumbnails/24.jpg)
A mistake in a spreadsheet led to dramatically different results from those published.
These results were cited by the International Monetary Fund and the UK Treasury to justify austerity programmes.
Had the data been shared, this could have been picked up earlier.
The importance of sharing data
![Page 25: Research Data Management Introduction: EUDAT/Open AIRE Webinar](https://reader033.fdocuments.net/reader033/viewer/2022061306/586f8f711a28ab54768b75a5/html5/thumbnails/25.jpg)
Concerns about data sharing
Concern Solution
inappropriate use due to misunderstanding of research purpose or parameters
security and confidentiality of sensitive data
lack of acknowledgement / credit
loss of advantage when competing for research funding
![Page 26: Research Data Management Introduction: EUDAT/Open AIRE Webinar](https://reader033.fdocuments.net/reader033/viewer/2022061306/586f8f711a28ab54768b75a5/html5/thumbnails/26.jpg)
Concerns about data sharing
Concern Solution
inappropriate use due to misunderstanding of research purpose or parameters
security and confidentiality of sensitive data
lack of acknowledgement / credit
loss of advantage when competing for research funding
metadata
metadata
metadata
metadata
![Page 27: Research Data Management Introduction: EUDAT/Open AIRE Webinar](https://reader033.fdocuments.net/reader033/viewer/2022061306/586f8f711a28ab54768b75a5/html5/thumbnails/27.jpg)
Concerns about data sharing
Concern Solutioninappropriate use due to misunderstanding of research purpose or parameters
provide rich Abstract, Purpose, Use Constraints and Supplemental Information where needed
security and confidentiality of sensitive data
• the metadata does NOT contain the data
• Use Constraints specify who may access the data and how
lack of acknowledgement / credit
specify a required data citation within the Use Constraints
loss of data insight and competitive advantage when vying for research funding
create second, public version with generalised Data Processing Description
![Page 28: Research Data Management Introduction: EUDAT/Open AIRE Webinar](https://reader033.fdocuments.net/reader033/viewer/2022061306/586f8f711a28ab54768b75a5/html5/thumbnails/28.jpg)
Make data shareable• Create robust metadata that has been checked
• Include reference information in metadata e.g. unique IDs & properly formatted data citations
• Publish your metadata so it’s discoverable. Use portals, clearing houses, online resources…
• Package up the data and associated metadata to deposit in repositories
• License the data clearly
![Page 29: Research Data Management Introduction: EUDAT/Open AIRE Webinar](https://reader033.fdocuments.net/reader033/viewer/2022061306/586f8f711a28ab54768b75a5/html5/thumbnails/29.jpg)
www.dcc.ac.uk/resources/how-guides/license-research-data
Licensing research data
This DCC guide outlines the pros and cons of each approach and gives practical advice on how to implement your licence
CREATIVE COMMONS LIMITATIONSNC Non-
CommercialWhat counts as
commercial?
ND No DerivativesSeverely restricts use
These clauses are not open licenses
Horizon 2020 Open Access guidelines point to:
or
![Page 30: Research Data Management Introduction: EUDAT/Open AIRE Webinar](https://reader033.fdocuments.net/reader033/viewer/2022061306/586f8f711a28ab54768b75a5/html5/thumbnails/30.jpg)
EUDAT licensing toolAnswer questions to determine which licence(s) are appropriate to use
http://ufal.github.io/public-license-selector
![Page 31: Research Data Management Introduction: EUDAT/Open AIRE Webinar](https://reader033.fdocuments.net/reader033/viewer/2022061306/586f8f711a28ab54768b75a5/html5/thumbnails/31.jpg)
What to preserve & share
It’s not possible to keep everything. Select based on:– What has to be kept e.g. data underlying publications– What can’t be recreated e.g. environmental recordings – What is potentially useful to others– What has scientific, cultural or historical value– What legally must be destroyed
How to select and appraise research data:www.dcc.ac.uk/resources/how-guides/appraise-select-research-data
![Page 32: Research Data Management Introduction: EUDAT/Open AIRE Webinar](https://reader033.fdocuments.net/reader033/viewer/2022061306/586f8f711a28ab54768b75a5/html5/thumbnails/32.jpg)
EUDAT & OPENAIRE SERVICESImage CC-BY-NC ‘Data centre’ by Bob Mical www.flickr.com/photos/small_realm/15995555571
![Page 33: Research Data Management Introduction: EUDAT/Open AIRE Webinar](https://reader033.fdocuments.net/reader033/viewer/2022061306/586f8f711a28ab54768b75a5/html5/thumbnails/33.jpg)
EUDAT servicesEUDAT offers a pan-European solution, providing a generic set of services to ensure minimum level of interoperability
Building common data services in close collaboration with 25+ communities
![Page 34: Research Data Management Introduction: EUDAT/Open AIRE Webinar](https://reader033.fdocuments.net/reader033/viewer/2022061306/586f8f711a28ab54768b75a5/html5/thumbnails/34.jpg)
EUDAT B2 service suite
Covering both access and deposit, from
informal data sharing to long-term archiving,
and addressing identification,
discoverability and computability of both long-tail and big data, EUDAT’s services will
address the full lifecycle of research
data
![Page 35: Research Data Management Introduction: EUDAT/Open AIRE Webinar](https://reader033.fdocuments.net/reader033/viewer/2022061306/586f8f711a28ab54768b75a5/html5/thumbnails/35.jpg)
CREATING DATA
PROCESSING DATA
ANALYSING DATA
PRESERVING DATA
GIVING ACCESS TO
DATA
RE-USING DATA
PIDs Referencing data:Finding data and making data findable
Data Transfer from public data servers
Store mutable data
Accessing services
Move data to HPC
![Page 36: Research Data Management Introduction: EUDAT/Open AIRE Webinar](https://reader033.fdocuments.net/reader033/viewer/2022061306/586f8f711a28ab54768b75a5/html5/thumbnails/36.jpg)
OpenAIRE services:zenodo.org
For all content types!
With GitHub integration!
Upload Describe Publish
Create communities!
![Page 37: Research Data Management Introduction: EUDAT/Open AIRE Webinar](https://reader033.fdocuments.net/reader033/viewer/2022061306/586f8f711a28ab54768b75a5/html5/thumbnails/37.jpg)
https://www.openaire.eu/search
Link data to publications
![Page 38: Research Data Management Introduction: EUDAT/Open AIRE Webinar](https://reader033.fdocuments.net/reader033/viewer/2022061306/586f8f711a28ab54768b75a5/html5/thumbnails/38.jpg)
OpenAIRE training and support materials
• Briefing papers, factsheets, Webinars, workshops, FAQs
• Information on:• Open Research Data Pilot• Creating a data
management plan• Selecting a data
repository
https://www.openaire.eu/opendatapilothttps://www.openaire.eu/support
![Page 39: Research Data Management Introduction: EUDAT/Open AIRE Webinar](https://reader033.fdocuments.net/reader033/viewer/2022061306/586f8f711a28ab54768b75a5/html5/thumbnails/39.jpg)
www.eudat.eu www.openaire.eu
Thanks – any questions?Contact us:
Tony Ross-Hellauer, OpenAIRE: [email protected] Jones, EUDAT: [email protected]
Acknowledgements:
Thanks to EUDAT colleagues Mark van de Sanden and Christine Staiger for slides.
Content has also been repurposed from the DataONE Educational modules, ‘Data Management’ and ‘Data Sharing’ Retrieved from https://www.dataone.org/education-modules