Introduction to data management planning · Levels of open data ⭐ make your stuff available on...
Transcript of Introduction to data management planning · Levels of open data ⭐ make your stuff available on...
![Page 1: Introduction to data management planning · Levels of open data ⭐ make your stuff available on the Web (whateverformat) under an open licence ⭐⭐ make it available as structured](https://reader036.fdocuments.net/reader036/viewer/2022081323/601a2f90b05b7a45084ef3d2/html5/thumbnails/1.jpg)
Introductiontodatamanagementplanning
JoyDavidsonDigitalCurationCentre
Acknowledgements:contentcontributedbySarahJones,JonathanRans Funded by:
![Page 2: Introduction to data management planning · Levels of open data ⭐ make your stuff available on the Web (whateverformat) under an open licence ⭐⭐ make it available as structured](https://reader036.fdocuments.net/reader036/viewer/2022081323/601a2f90b05b7a45084ef3d2/html5/thumbnails/2.jpg)
Definitionofresearchdata
‘Researchdata’referstoinformation,inparticularfactsornumbers,collectedtobeexaminedandconsideredasabasisforreasoning,discussionorcalculation.
Inaresearchcontext,examplesofdataincludestatistics,resultsofexperiments,measurements,observationsresultingfromfieldwork,surveyresults,interviewrecordingsandimages.Thefocusisonresearchdatathatisavailableindigitalform.
GuidelinesonOpenAccesstoScientificPublicationsandResearchDatainHorizon2020v.1.0,11December2013,Footnote5,p3
![Page 3: Introduction to data management planning · Levels of open data ⭐ make your stuff available on the Web (whateverformat) under an open licence ⭐⭐ make it available as structured](https://reader036.fdocuments.net/reader036/viewer/2022081323/601a2f90b05b7a45084ef3d2/html5/thumbnails/3.jpg)
Howdoesresearchdatafitinwiththethemeofopenscience?
“sciencecarriedoutandcommunicatedinamannerwhichallowsotherstocontribute,collaborateandaddtothe
researcheffort,withallkindsofdata,resultsandprotocolsmadefreelyavailableatdifferentstagesoftheresearch
process.”
Research InformationNetwork,OpenSciencecasestudieswww.rin.ac.uk/our-work/data-management-and-curation/
open-science-case-studies
![Page 4: Introduction to data management planning · Levels of open data ⭐ make your stuff available on the Web (whateverformat) under an open licence ⭐⭐ make it available as structured](https://reader036.fdocuments.net/reader036/viewer/2022081323/601a2f90b05b7a45084ef3d2/html5/thumbnails/4.jpg)
Levelsofopendata
⭐ makeyourstuffavailableontheWeb(whatever format)underanopen licence
⭐⭐ makeitavailableasstructureddata(e.g.Excelinsteadofascanofatable)
⭐⭐⭐ usenon-proprietary formats(e.g.CSVinsteadofExcel)
⭐⭐⭐⭐ useURIstodenotethings,sothatpeoplecanpointatyour stuff
⭐⭐⭐⭐⭐ linkyourdatatootherdatatoprovidecontext
Tim Berners-Lee’s proposal for five star open data - http://5stardata.info
“Open data and content can be freely used, modified and shared by anyone for any purpose”
http://opendefinition.org
![Page 5: Introduction to data management planning · Levels of open data ⭐ make your stuff available on the Web (whateverformat) under an open licence ⭐⭐ make it available as structured](https://reader036.fdocuments.net/reader036/viewer/2022081323/601a2f90b05b7a45084ef3d2/html5/thumbnails/5.jpg)
Howdoesresearchdatamanagementfitintothepicture?
Create
Document
Use
Store
Share
Preserve
• DataManagementPlanning• Creatingdata• Documentingdata• Accessing/usingdata• Storageandbackup• Selectingwhattokeep• Sharingdata• Datalicensingandcitation• Preservingdata
Create
Document
Use
Store
Share
Preserve
Create
Document
Use
Store
Share
Preserve
![Page 6: Introduction to data management planning · Levels of open data ⭐ make your stuff available on the Web (whateverformat) under an open licence ⭐⭐ make it available as structured](https://reader036.fdocuments.net/reader036/viewer/2022081323/601a2f90b05b7a45084ef3d2/html5/thumbnails/6.jpg)
Funders haveexpectationsaboutdatasharing…
“TheEuropeanCommission’svisionisthatinformationalreadypaidforbythepublicpurseshouldnotbepaidforagaineachtimeitisaccessedorused,andthatitshouldbenefitEuropeancompanies
andcitizenstothefull.”
http://ec.europa.eu/research/participants/data/ref/h2020/grants_manual/hi/oa_pilot/h2020-hi-oa-pilot-guide_en.pdf
Data management plans requested for those participating in Open Data pilot.
![Page 7: Introduction to data management planning · Levels of open data ⭐ make your stuff available on the Web (whateverformat) under an open licence ⭐⭐ make it available as structured](https://reader036.fdocuments.net/reader036/viewer/2022081323/601a2f90b05b7a45084ef3d2/html5/thumbnails/7.jpg)
“Datasetsarebecomingthenewinstrumentsof
science”
DanAtkins,UniversityofMichigan
…butRDMispartofgoodresearchpractice!
![Page 8: Introduction to data management planning · Levels of open data ⭐ make your stuff available on the Web (whateverformat) under an open licence ⭐⭐ make it available as structured](https://reader036.fdocuments.net/reader036/viewer/2022081323/601a2f90b05b7a45084ef3d2/html5/thumbnails/8.jpg)
DMPscanhelp
ProjectsparticipatinginthepilotwillberequiredtodevelopaDataManagementplan(DMP),inwhichtheywillspecifywhatdatawillbeopen.
Note that the Commission does NOT require applicants to submit a DMP at the proposal stage.
A DMP is therefore NOT part of the evaluation.
DMPs are a deliverable for those participating in the pilot.
![Page 9: Introduction to data management planning · Levels of open data ⭐ make your stuff available on the Web (whateverformat) under an open licence ⭐⭐ make it available as structured](https://reader036.fdocuments.net/reader036/viewer/2022081323/601a2f90b05b7a45084ef3d2/html5/thumbnails/9.jpg)
WhataspectsofRDMshouldbe inaDMP?§ What data will be created (format, types, volume...)
§ Standards and methodologies to be used (incl. metadata)
§ How ethics and Intellectual Property will be addressed
§ Plans for data sharing and access
§ Strategy for long-term preservation Create
Document
Use
Store
Share
Preserve
A DMP is a plan to share!
![Page 10: Introduction to data management planning · Levels of open data ⭐ make your stuff available on the Web (whateverformat) under an open licence ⭐⭐ make it available as structured](https://reader036.fdocuments.net/reader036/viewer/2022081323/601a2f90b05b7a45084ef3d2/html5/thumbnails/10.jpg)
Howwillyounameyourfiles?• Keepitsimple!• Agreemethodswithpartners• Includedates• Avoidnon-alphanumericcharacters• Usehyphensorunderscoresnotspacese.g.day-sheet,day_sheet• Ordertheelements logically
Example from ARM Climate Research Facility www.arm.gov/data/docs/plan
www.jiscdigitalmedia.ac.uk/guide/choosing-a-file-name
![Page 11: Introduction to data management planning · Levels of open data ⭐ make your stuff available on the Web (whateverformat) under an open licence ⭐⭐ make it available as structured](https://reader036.fdocuments.net/reader036/viewer/2022081323/601a2f90b05b7a45084ef3d2/html5/thumbnails/11.jpg)
Whatismetadata?
![Page 12: Introduction to data management planning · Levels of open data ⭐ make your stuff available on the Web (whateverformat) under an open licence ⭐⭐ make it available as structured](https://reader036.fdocuments.net/reader036/viewer/2022081323/601a2f90b05b7a45084ef3d2/html5/thumbnails/12.jpg)
Whatisthedifference?
• Metadata– Standardised– Structured– Machineandhumanreadable Metadata
Documentation
![Page 13: Introduction to data management planning · Levels of open data ⭐ make your stuff available on the Web (whateverformat) under an open licence ⭐⭐ make it available as structured](https://reader036.fdocuments.net/reader036/viewer/2022081323/601a2f90b05b7a45084ef3d2/html5/thumbnails/13.jpg)
Howshouldyoudescribeyourdata?
http://www.dcc.ac.uk/resources/metadata-standards
![Page 14: Introduction to data management planning · Levels of open data ⭐ make your stuff available on the Web (whateverformat) under an open licence ⭐⭐ make it available as structured](https://reader036.fdocuments.net/reader036/viewer/2022081323/601a2f90b05b7a45084ef3d2/html5/thumbnails/14.jpg)
Whatistheminimumrequired?
• DataCite metadatausedbyOpenAIRE• Citation/disambiguation
– Identifiere.g.DOI– Creator– Title– Publisher– PublicationYear
• Licencing/accessconditions
![Page 15: Introduction to data management planning · Levels of open data ⭐ make your stuff available on the Web (whateverformat) under an open licence ⭐⭐ make it available as structured](https://reader036.fdocuments.net/reader036/viewer/2022081323/601a2f90b05b7a45084ef3d2/html5/thumbnails/15.jpg)
Wherewillyoustorethedataduringyourresearch?
• Yourownlaptop• Universitysystems• Cloudstorage• Combination
Your decision will be based on how sensitive your data are, how robust you need the storage to be, who needs access to the data,
and when they need access to the data!
![Page 16: Introduction to data management planning · Levels of open data ⭐ make your stuff available on the Web (whateverformat) under an open licence ⭐⭐ make it available as structured](https://reader036.fdocuments.net/reader036/viewer/2022081323/601a2f90b05b7a45084ef3d2/html5/thumbnails/16.jpg)
Whichdatamustbekept?
• Data,includingassociatedmetadata,neededtovalidatetheresultsinscientificpublications
• Othercuratedand/orrawdata,includingassociatedmetadata,asspecifiedintheDMP
Doesn’t apply to all data (researchers to define as appropriate)
Don’t have to share data if inappropriate – exemptions apply
![Page 17: Introduction to data management planning · Levels of open data ⭐ make your stuff available on the Web (whateverformat) under an open licence ⭐⭐ make it available as structured](https://reader036.fdocuments.net/reader036/viewer/2022081323/601a2f90b05b7a45084ef3d2/html5/thumbnails/17.jpg)
Exemptions– reasonsforoptingout
• Ifresultsareexpectedtobecommerciallyorindustriallyexploited
• Ifparticipation isincompatiblewiththeneed forconfidentiality inconnectionwithsecurityissues
• Incompatiblewithexistingrulesontheprotectionofpersonaldata
• Would jeopardise theachievementofthemainaimoftheaction
• Iftheprojectwillnotgenerate/collectanyresearchdata•
• IfthereareotherlegitimatereasonstonottakepartinthePilot
CanoptoutatproposalstageORduringlifetimeofprojectShoulddescribeissuesintheprojectDataManagementPlan
![Page 18: Introduction to data management planning · Levels of open data ⭐ make your stuff available on the Web (whateverformat) under an open licence ⭐⭐ make it available as structured](https://reader036.fdocuments.net/reader036/viewer/2022081323/601a2f90b05b7a45084ef3d2/html5/thumbnails/18.jpg)
Which additional datamightbekeptaftertheprojectends?
Fivestepstofollow
① Could thisdatabere-used② Must itbekeptasevidenceorforlegalreasons③ Shoulditbekeptforitspotentialvalue④ Considercosts– dobenefitsoutweighcost?⑤ Evaluatecriteriatodecidewhattokeep
5stepstodecidewhatdatatokeepwww.dcc.ac.uk/resources/how-guides/five-steps-decide-what-data-keep
![Page 19: Introduction to data management planning · Levels of open data ⭐ make your stuff available on the Web (whateverformat) under an open licence ⭐⭐ make it available as structured](https://reader036.fdocuments.net/reader036/viewer/2022081323/601a2f90b05b7a45084ef3d2/html5/thumbnails/19.jpg)
Assignpersistentidentifiers• Theyareanalphanumericcodeidentifyingaresource,organisationorindividual
• Theymustbe– Unique– Persistent
• Ideallytheyshouldbeactionabletoo
![Page 20: Introduction to data management planning · Levels of open data ⭐ make your stuff available on the Web (whateverformat) under an open licence ⭐⭐ make it available as structured](https://reader036.fdocuments.net/reader036/viewer/2022081323/601a2f90b05b7a45084ef3d2/html5/thumbnails/20.jpg)
https://ssi-dev.epcc.ed.ac.uk/
Remembertoconsiderphysicaldata,software
andmodels
http://spatialinformationdesignlab.org/project_sites/library/catalog.html
http://www.ukcrcexpmed.org.uk/Coventry_Warwick_CRF/PublishingImages/Tissue%20Bank%201.jpg
![Page 21: Introduction to data management planning · Levels of open data ⭐ make your stuff available on the Web (whateverformat) under an open licence ⭐⭐ make it available as structured](https://reader036.fdocuments.net/reader036/viewer/2022081323/601a2f90b05b7a45084ef3d2/html5/thumbnails/21.jpg)
Canyourdatabesharedwithothers?
• PI/researcher
• Datarepositoryandsupportstaff
• Researchparticipants
• Commercialpartners
• Secondarydatauser
![Page 22: Introduction to data management planning · Levels of open data ⭐ make your stuff available on the Web (whateverformat) under an open licence ⭐⭐ make it available as structured](https://reader036.fdocuments.net/reader036/viewer/2022081323/601a2f90b05b7a45084ef3d2/html5/thumbnails/22.jpg)
Howwillitbeshared?
http://service.re3data.org/search
Zenodo
• Joint effort by OpenAIRE-CERN
• Multidisciplinary repository
• Multiple data types
• Citable data (DOI)
• Links funding, publications, data & software
www.zenodo.org
• Does your publisher or funder suggest a repository?
• Are there data centres or community databases for your discipline?
• Does your university offer support for long-term preservation?
![Page 23: Introduction to data management planning · Levels of open data ⭐ make your stuff available on the Web (whateverformat) under an open licence ⭐⭐ make it available as structured](https://reader036.fdocuments.net/reader036/viewer/2022081323/601a2f90b05b7a45084ef3d2/html5/thumbnails/23.jpg)
www.dcc.ac.uk/resources/how-guides/license-research-data
Licensing research data
This DCC guide outlines the pros and cons of each approach and gives practical advice on how to implement your licence
CREATIVE COMMONS LIMITATIONS
NC Non-CommercialWhat counts as commercial?
ND No DerivativesSeverely restricts use
These clauses are not open licenses
Horizon 2020 Open Access guidelines point to:
or
![Page 24: Introduction to data management planning · Levels of open data ⭐ make your stuff available on the Web (whateverformat) under an open licence ⭐⭐ make it available as structured](https://reader036.fdocuments.net/reader036/viewer/2022081323/601a2f90b05b7a45084ef3d2/html5/thumbnails/24.jpg)
EUDATlicensingtool
Answerquestionstodeterminewhichlicence(s)areappropriatetouse
http://ufal.github.io/lindat-license-selector
![Page 25: Introduction to data management planning · Levels of open data ⭐ make your stuff available on the Web (whateverformat) under an open licence ⭐⭐ make it available as structured](https://reader036.fdocuments.net/reader036/viewer/2022081323/601a2f90b05b7a45084ef3d2/html5/thumbnails/25.jpg)
Optionsforopendata
• Domainrepository• Generalrepository– Figshare,Zenodo,Dryad• Institutionalrepository• Journalsupplementarymaterial• Departmentalwebpage
![Page 26: Introduction to data management planning · Levels of open data ⭐ make your stuff available on the Web (whateverformat) under an open licence ⭐⭐ make it available as structured](https://reader036.fdocuments.net/reader036/viewer/2022081323/601a2f90b05b7a45084ef3d2/html5/thumbnails/26.jpg)
Ø GeneraldirectoriesRe3data.org
Ø Domainspecificdirectoriese.g.lifesciences– Biosharing.org
Ø DatajournalrecommendationsEdinburgh researchdatablog:Sourcesofdatasetpeerreview
Ø FundingbodyrecommendationsE.g.WellcomeTrustDatarepositories anddatabasesources
FindingexternalrepositoriesGo
![Page 27: Introduction to data management planning · Levels of open data ⭐ make your stuff available on the Web (whateverformat) under an open licence ⭐⭐ make it available as structured](https://reader036.fdocuments.net/reader036/viewer/2022081323/601a2f90b05b7a45084ef3d2/html5/thumbnails/27.jpg)
Considerations• Theremaybeanacceptedrepositoryusedbypeersorrequiredbyfunders
• Multidisciplinarystudiesmaynothaveanobvioushome
• Datatypesandvolumeswillimpactondecision
![Page 28: Introduction to data management planning · Levels of open data ⭐ make your stuff available on the Web (whateverformat) under an open licence ⭐⭐ make it available as structured](https://reader036.fdocuments.net/reader036/viewer/2022081323/601a2f90b05b7a45084ef3d2/html5/thumbnails/28.jpg)
Howwillyoumakeyourdatadiscoverable?
http://ckan.data.alpha.jisc.ac.uk/datasethttps://www.researchfish.com/
http://gtr.rcuk.ac.uk/
http://researchdata.gla.ac.uk/
![Page 29: Introduction to data management planning · Levels of open data ⭐ make your stuff available on the Web (whateverformat) under an open licence ⭐⭐ make it available as structured](https://reader036.fdocuments.net/reader036/viewer/2022081323/601a2f90b05b7a45084ef3d2/html5/thumbnails/29.jpg)
Optionsforcloseddata
• Institutionaldataarchive/vault• Safehavens– (e.g.securepatientdata)• 3rd partydataarchiving• Cloudstorage• Institutionalservers– the‘donothing’option
![Page 30: Introduction to data management planning · Levels of open data ⭐ make your stuff available on the Web (whateverformat) under an open licence ⭐⭐ make it available as structured](https://reader036.fdocuments.net/reader036/viewer/2022081323/601a2f90b05b7a45084ef3d2/html5/thumbnails/30.jpg)
Approach:asopenaspossible,
asclosedasnecessary
Image: ‘Balancing rocks’ by Viewminder CC-BY-SA-ND www.flickr.com/photos/light_seeker/7780857224
![Page 31: Introduction to data management planning · Levels of open data ⭐ make your stuff available on the Web (whateverformat) under an open licence ⭐⭐ make it available as structured](https://reader036.fdocuments.net/reader036/viewer/2022081323/601a2f90b05b7a45084ef3d2/html5/thumbnails/31.jpg)
Refertofreeguidesandbriefingpapers
www.dcc.ac.uk/resources/
![Page 32: Introduction to data management planning · Levels of open data ⭐ make your stuff available on the Web (whateverformat) under an open licence ⭐⭐ make it available as structured](https://reader036.fdocuments.net/reader036/viewer/2022081323/601a2f90b05b7a45084ef3d2/html5/thumbnails/32.jpg)
GuidelinesfromtheCommission
• FactsheetonOpenAccess– https://ec.europa.eu/programmes/horizon2020/sites/horizon2020/files/FactSheet_Ope
n_Access.pdf
• GuidelinesonOpenAccesstoScientificPublicationsandResearchDatainHorizon2020– http://ec.europa.eu/research/participants/data/ref/h2020/grants_manual/hi/oa_
pilot/h2020-hi-oa-pilot-guide_en.pdf
• GuidelinesonDataManagementinHorizon2020– http://ec.europa.eu/research/participants/data/ref/h2020/grants_manual/hi/oa_
pilot/h2020-hi-oa-data-mgt_en.pdf
![Page 33: Introduction to data management planning · Levels of open data ⭐ make your stuff available on the Web (whateverformat) under an open licence ⭐⭐ make it available as structured](https://reader036.fdocuments.net/reader036/viewer/2022081323/601a2f90b05b7a45084ef3d2/html5/thumbnails/33.jpg)
https://dmponline.dcc.ac.uk
Makeuseoffreetools
![Page 34: Introduction to data management planning · Levels of open data ⭐ make your stuff available on the Web (whateverformat) under an open licence ⭐⭐ make it available as structured](https://reader036.fdocuments.net/reader036/viewer/2022081323/601a2f90b05b7a45084ef3d2/html5/thumbnails/34.jpg)
WhatisDMPonline?
• Aweb-basedtooltohelpresearcherswriteDataManagementandSharingPlans
• Includesrequirementsandguidancefromfunders,universitiesandothergroups
• DevelopedbytheDigitalCurationCentre
![Page 35: Introduction to data management planning · Levels of open data ⭐ make your stuff available on the Web (whateverformat) under an open licence ⭐⭐ make it available as structured](https://reader036.fdocuments.net/reader036/viewer/2022081323/601a2f90b05b7a45084ef3d2/html5/thumbnails/35.jpg)
• Morevisibleresearchoutputsandincreasedimpact - evenfornegativeresults
• Easieroutputsreporting• Betterandmorereproducibleresearch!
GoodRDMhelpsyoucomplywithmandatesbutalsoleadsto…