Data Management Planning Ron Nakao Stanford University Libraries Jake Carlson Purdue University...

40
Data Management Planning Ron Nakao Stanford University Libraries Jake Carlson Purdue University Libraries

Transcript of Data Management Planning Ron Nakao Stanford University Libraries Jake Carlson Purdue University...

Page 1: Data Management Planning Ron Nakao Stanford University Libraries Jake Carlson Purdue University Libraries.

Data Management Planning

Ron NakaoStanford University

Libraries

Jake CarlsonPurdue University

Libraries

Page 2: Data Management Planning Ron Nakao Stanford University Libraries Jake Carlson Purdue University Libraries.

What will be Covered

• An introduction to terms and concepts.

• An understanding of the purpose of data management planning.

• Coverage of some of the elements of data management planning and how they may relate to each other.

• Case studies from Purdue and Stanford.

Page 3: Data Management Planning Ron Nakao Stanford University Libraries Jake Carlson Purdue University Libraries.

What is Data Management?

“In the context of research and scholarship, "Data Management" refers to the storage, access and preservation of data produced from a given investigation. Data management is practices through the entire lifecycle of the data…”

-Texas A&M, Research Data Management Lib Guidehttp://guides.library.tamu.edu/DataManagement

Page 4: Data Management Planning Ron Nakao Stanford University Libraries Jake Carlson Purdue University Libraries.

What is a DMP?

• A formal document.• Describes:

– what data will be produced– how each type of data will be managed– how each type of data will be shared– how each type of data will be archived– who will take responsibility for these actions

• DMP Resources and Examples:http

://www.icpsr.umich.edu/icpsrweb/content/datamanagement/dmp/resources.html

Page 5: Data Management Planning Ron Nakao Stanford University Libraries Jake Carlson Purdue University Libraries.

DMP Requirement (NSF)• Data - samples, physical collections, software, curriculum

materials, and other materials;

• Standards - for data and metadata formats and content;

• Policies for access and sharing – incl. IP, protection of privacy/confidentiality, security, etc.;

• Policies for re-use – including provisions for re-distribution, and the production of derivatives;

• Archiving - data, samples, and other research products, and for preservation of access.

http://www.nsf.gov/bfa/dias/policy/dmp.jsp

Page 6: Data Management Planning Ron Nakao Stanford University Libraries Jake Carlson Purdue University Libraries.

DMP Tool

https://dmp.cdlib.org/

Page 7: Data Management Planning Ron Nakao Stanford University Libraries Jake Carlson Purdue University Libraries.

https://dmponline.dcc.ac.uk/

Page 8: Data Management Planning Ron Nakao Stanford University Libraries Jake Carlson Purdue University Libraries.

DMP Consulting

Page 9: Data Management Planning Ron Nakao Stanford University Libraries Jake Carlson Purdue University Libraries.

Why Manage Data?• Because you have to:

– Meet grant requirements

• Because you want to:– Increase the visibility of your research– Simplify your life / Save time– Protect yourself

http://libraries.mit.edu/guides/subjects/data-management/why.html

Page 10: Data Management Planning Ron Nakao Stanford University Libraries Jake Carlson Purdue University Libraries.

Effective Data Management Planning

• Is a process, not an event• Probably requires more thought than it is given

in developing the grant• Probably requires more than 2 pages• Should be informed by disciplinary and local

cultures and environments• Should be driven by goals and objectives• Must be implemented to be successful

Page 11: Data Management Planning Ron Nakao Stanford University Libraries Jake Carlson Purdue University Libraries.

Other DMP Elements (ICPSR)• Responsibility - who does what, when?• Audience – identifying the potential secondary users of the

data • Selection and retention periods –

– what criteria will be used?– how long will data be retained and/or archived?– when will data be transferred to a 3rd party for curation?

• Quality Assurance• Ethics & Legal Requirements• Budget & Financial Aspects

http://www.icpsr.umich.edu/icpsrweb/content/datamanagement/dmp/elements.html

Page 12: Data Management Planning Ron Nakao Stanford University Libraries Jake Carlson Purdue University Libraries.

Preparing Data for Sharing

Data Collection & File Creation

Depositing Data

Data Analysis

Project Start-Up

DMP Purpose

Proposal Development

& DMPs

Page 13: Data Management Planning Ron Nakao Stanford University Libraries Jake Carlson Purdue University Libraries.

Guidance Across the Lifecycle

Preparing Data for Sharing

> Address disclosure risk limitation> Determine file formats to deposit> Contact archive for advice

Page 14: Data Management Planning Ron Nakao Stanford University Libraries Jake Carlson Purdue University Libraries.

Case Study on Data Management Planning

Page 15: Data Management Planning Ron Nakao Stanford University Libraries Jake Carlson Purdue University Libraries.

• Libraries sponsored research center.

• Established in 2006 to focus on issues associated with curating data sets for present and future research use.

• Working in partnership with domain scientists and IT personnel to address the real world data needs of a research community.

Page 16: Data Management Planning Ron Nakao Stanford University Libraries Jake Carlson Purdue University Libraries.

Background Research

“Unpacking” the NSF requirements

Review of the content of existing data management plans

Review of existing guides on creating a DMP

Review of the information gathered from our Data Curation Profiles work, and other faculty-librarian collaborations

Page 17: Data Management Planning Ron Nakao Stanford University Libraries Jake Carlson Purdue University Libraries.

IASSIST 2011

The Data Curation Profile is not designed to produce a Data Management Plan, however it could be used as a foundation to develop a more specific tool

Page 18: Data Management Planning Ron Nakao Stanford University Libraries Jake Carlson Purdue University Libraries.

Carlson IASSIST 2011

Interviews

Working with OVPR, four proposals were selected: Engineering Education Agronomy Physics / Electrical & Computer Engineering Pharmacy

Interviews are conducted: Multiple faculty / Multiple interviews Sponsored Programs personnel and Subject Librarians

also attend interviews

Page 19: Data Management Planning Ron Nakao Stanford University Libraries Jake Carlson Purdue University Libraries.

Challenges

• Metadata & Preservation– Hard for researchers to define, or their

understanding may not be fully accurate. • Archive = an old copy and/or a back-up copy

• Generally outside researcher’s current practices.

• Disciplinary standards or solutions may not be known, or may not exist.

Page 20: Data Management Planning Ron Nakao Stanford University Libraries Jake Carlson Purdue University Libraries.

DMP Self-Assessment Questionnaire

http://purl.lib.purdue.edu/d2c2/dmp_saq

Page 21: Data Management Planning Ron Nakao Stanford University Libraries Jake Carlson Purdue University Libraries.

IASSIST 2011

Guides

Page 22: Data Management Planning Ron Nakao Stanford University Libraries Jake Carlson Purdue University Libraries.

PURR

http://purr.purdue.edu

Page 23: Data Management Planning Ron Nakao Stanford University Libraries Jake Carlson Purdue University Libraries.

Nano HUB

http://nanohub.org/

Page 24: Data Management Planning Ron Nakao Stanford University Libraries Jake Carlson Purdue University Libraries.

PURR - Planning

Page 25: Data Management Planning Ron Nakao Stanford University Libraries Jake Carlson Purdue University Libraries.

PURR – Active Data

Page 26: Data Management Planning Ron Nakao Stanford University Libraries Jake Carlson Purdue University Libraries.

Publishing & Curation

• Abstract• Cite this Work• Tags• Citations

• Supporting Docs• Versions• Reviews• Questions

Page 27: Data Management Planning Ron Nakao Stanford University Libraries Jake Carlson Purdue University Libraries.

Stanford Case Study

• Stanford Data Management Services• Faculty collaboration example (HCMST)• Stanford Digital Repository (SDR)

Page 28: Data Management Planning Ron Nakao Stanford University Libraries Jake Carlson Purdue University Libraries.

Data Management Services

Page 29: Data Management Planning Ron Nakao Stanford University Libraries Jake Carlson Purdue University Libraries.

Plan

Page 30: Data Management Planning Ron Nakao Stanford University Libraries Jake Carlson Purdue University Libraries.

Manage

Page 31: Data Management Planning Ron Nakao Stanford University Libraries Jake Carlson Purdue University Libraries.

Preserve

Page 32: Data Management Planning Ron Nakao Stanford University Libraries Jake Carlson Purdue University Libraries.

Case Study• Collaborating with Professor Michael

Rosenfeld on Data Management Plan & Its Implementation

DMP (later in Exercise)“Painless” creation of MetadataQuick turnaround for public data sharing

<data.stanford.edu>Long-term Preservation

ICPSRStanford Digital Repository (SDR)

Page 33: Data Management Planning Ron Nakao Stanford University Libraries Jake Carlson Purdue University Libraries.

<data.stanford.edu> MetadataTitleCitation Abstract, Principal Investigator, Funding Agency, Bibliographic Citation, Contact EmailDescription Introduction, AcknowledgementsMethodology Universe, Unit of Analysis, Type of data collection, Time span,

Time of data collection, Geographic coverage, Smallest geographic unit, Sample description, Sample response rate, Weights

Documentation Document file(s), Web site or document download link(s)Data Download Link(s) Data file(s)Notes Errata, Data NotesNews News Coverage

Page 34: Data Management Planning Ron Nakao Stanford University Libraries Jake Carlson Purdue University Libraries.

data.stanford.edu

Page 35: Data Management Planning Ron Nakao Stanford University Libraries Jake Carlson Purdue University Libraries.

Data entry form

Page 36: Data Management Planning Ron Nakao Stanford University Libraries Jake Carlson Purdue University Libraries.

Lessons from Case Study• Quick development, enhancement, and data

availability (Drupal)• Active PI involvement & metadata creation• Ownership & “freshness” of PI’s data page• Easy referral by PI (customized URL), usage

stats, and contact lists provided ongoing value for PI

Page 37: Data Management Planning Ron Nakao Stanford University Libraries Jake Carlson Purdue University Libraries.

Archiving HCMST: ICPSR

Page 38: Data Management Planning Ron Nakao Stanford University Libraries Jake Carlson Purdue University Libraries.

Stanford Digital Repository (SDR) • The SDR is a service supporting long-term management of

scholarly information resources at Stanford.

• Deposit in the SDR enables faculty, students, researchers to promote and protect the products of their work.

• Librarians use the SDR to preserve and share scholarly collections of enduring value to the larger Stanford community.

• Through robust preservation and security measures, the repository maintains appropriate access to deposited content from persistent web links while protecting against data loss and corruption.

Page 39: Data Management Planning Ron Nakao Stanford University Libraries Jake Carlson Purdue University Libraries.

Stanford’s Digital Library Infrastructure

Diagram courtesy of Hannah Frost, Services Manager, Stanford Digital Repository

Page 40: Data Management Planning Ron Nakao Stanford University Libraries Jake Carlson Purdue University Libraries.

Thanks!

Any Questions?