Introduc)onto* Data*Management** - University Library · 2017-04-18 ·...

30
Introduc)on to Data Management Chris’e Wiley Sarah C. Williams Heidi Imker

Transcript of Introduc)onto* Data*Management** - University Library · 2017-04-18 ·...

Page 1: Introduc)onto* Data*Management** - University Library · 2017-04-18 · DataManagementWorkshop!Series! • Introduc)on*to*Data*Management! – Feb10th!!4PM–5PM – thApr6 !1PM–2PM

Introduc)on  to    Data  Management    

 Chris'e  Wiley  

Sarah  C.  Williams  Heidi  Imker  

Page 2: Introduc)onto* Data*Management** - University Library · 2017-04-18 · DataManagementWorkshop!Series! • Introduc)on*to*Data*Management! – Feb10th!!4PM–5PM – thApr6 !1PM–2PM
Page 3: Introduc)onto* Data*Management** - University Library · 2017-04-18 · DataManagementWorkshop!Series! • Introduc)on*to*Data*Management! – Feb10th!!4PM–5PM – thApr6 !1PM–2PM

Data  Management  Workshop  Series  

•  Introduc)on  to  Data  Management  –  Feb  10th    4PM  –  5PM  –  Apr  6th  1PM  –  2PM  

•  Documenta)on  and  Organiza)on  for  Data  and  Processes  –  Feb  17th    4PM  –  5PM  –  Apr  13th    1PM  –  2PM  

•  Making  Research  Data  Public:  Why,  What,  and  How  –  Feb  24th  4PM  –  5PM  –  Apr  20th    1PM  –  2PM  

Page 4: Introduc)onto* Data*Management** - University Library · 2017-04-18 · DataManagementWorkshop!Series! • Introduc)on*to*Data*Management! – Feb10th!!4PM–5PM – thApr6 !1PM–2PM

Intro  to  Data  Management  Objec'ves  

1.  Learn  the  benefits  of  data  management  2.  Learn  the  components  of  data  management  3.  Iden'fy  gaps  in  your  current  prac'ces  4.  Know  where  to  find  help  

Page 5: Introduc)onto* Data*Management** - University Library · 2017-04-18 · DataManagementWorkshop!Series! • Introduc)on*to*Data*Management! – Feb10th!!4PM–5PM – thApr6 !1PM–2PM

Data  Management  Prac'ces  •     complete  records    •     find    

•     understand    

•     back-­‐up    

•     migrate  

Page 6: Introduc)onto* Data*Management** - University Library · 2017-04-18 · DataManagementWorkshop!Series! • Introduc)on*to*Data*Management! – Feb10th!!4PM–5PM – thApr6 !1PM–2PM

Data  Management  Prac'ces  •  I’ve  lost  two  patent  claims  because  we  didn’t  have  complete  records  in  lab  notebooks.  

•  I  panic  every  'me  I  try  to  find  my  students’  data.  

•  I  know  I  won’t  be  able  to  publish  anything  aSer  my  grad  student  leave  because  it’s  too  hard  to  understand  what  he/she  did.  

•  We  don’t  have  the  assay  results  with  substrate  X,  because  I  didn’t  back-­‐up  my  computer.    And  we’re  out  of  substrate  X.    

•  The  only  reason  I  was  able  to  do  the  analysis  was  because  on  a  whim  I  decided  to  migrate  some  of  my  data  off  of  floppy  disks  to  CDs  back  in  the  90s.    I  wish  I  had  done  all  of  it.      

Page 7: Introduc)onto* Data*Management** - University Library · 2017-04-18 · DataManagementWorkshop!Series! • Introduc)on*to*Data*Management! – Feb10th!!4PM–5PM – thApr6 !1PM–2PM
Page 8: Introduc)onto* Data*Management** - University Library · 2017-04-18 · DataManagementWorkshop!Series! • Introduc)on*to*Data*Management! – Feb10th!!4PM–5PM – thApr6 !1PM–2PM
Page 9: Introduc)onto* Data*Management** - University Library · 2017-04-18 · DataManagementWorkshop!Series! • Introduc)on*to*Data*Management! – Feb10th!!4PM–5PM – thApr6 !1PM–2PM

Benefits  of  Good  Data  Management  

-­‐  Be^er  understand  your  data  now  and  in  the  future  -­‐  Access  your  data  in  the  future    -­‐  Save  'me  and  effort  -­‐  Meet  funder  and  publisher  requirements  

Page 10: Introduc)onto* Data*Management** - University Library · 2017-04-18 · DataManagementWorkshop!Series! • Introduc)on*to*Data*Management! – Feb10th!!4PM–5PM – thApr6 !1PM–2PM

What  are  the  elements  of  good    data  management?  

   

Page 11: Introduc)onto* Data*Management** - University Library · 2017-04-18 · DataManagementWorkshop!Series! • Introduc)on*to*Data*Management! – Feb10th!!4PM–5PM – thApr6 !1PM–2PM

Plan  

Page 12: Introduc)onto* Data*Management** - University Library · 2017-04-18 · DataManagementWorkshop!Series! • Introduc)on*to*Data*Management! – Feb10th!!4PM–5PM – thApr6 !1PM–2PM

Data  Management  Planning  Basics  

•     Data  Management  Plans  (DMPs)  •  What  data  produced,  how  will  you  make  it  accessible,  how  will  you  preserve  it,    what  restric'ons  are  there  on  access  

•   NSF  -­‐  2011  •   NIH  -­‐  2003  (>$500K)  •   DOE  -­‐  2014  

Page 13: Introduc)onto* Data*Management** - University Library · 2017-04-18 · DataManagementWorkshop!Series! • Introduc)on*to*Data*Management! – Feb10th!!4PM–5PM – thApr6 !1PM–2PM

DMPTool  –  h^p://www.dmptool.org/  

Page 14: Introduc)onto* Data*Management** - University Library · 2017-04-18 · DataManagementWorkshop!Series! • Introduc)on*to*Data*Management! – Feb10th!!4PM–5PM – thApr6 !1PM–2PM

Organize  

Page 15: Introduc)onto* Data*Management** - University Library · 2017-04-18 · DataManagementWorkshop!Series! • Introduc)on*to*Data*Management! – Feb10th!!4PM–5PM – thApr6 !1PM–2PM

Organiza'on  

•  Dis'nguish  Raw  Data  

•  Be  Consistent  in  File  Naming  and  Directory  Structures  

•  Define  and  Exercise  Version  Control  

Page 16: Introduc)onto* Data*Management** - University Library · 2017-04-18 · DataManagementWorkshop!Series! • Introduc)on*to*Data*Management! – Feb10th!!4PM–5PM – thApr6 !1PM–2PM

Date  Tip  

BAM  Co-­‐Exp  Run  01  20140904.txt  BAM  Co-­‐Exp  Run  02  20140904.txt  BAM  Co-­‐Exp  Run  03  20140904.txt  

Run  1  B  anth  meth  Sept  4  .txt  BAM  Rxn  2  2014_09_04.txt  20140904_meth_3.txt  

vs.

Page 17: Introduc)onto* Data*Management** - University Library · 2017-04-18 · DataManagementWorkshop!Series! • Introduc)on*to*Data*Management! – Feb10th!!4PM–5PM – thApr6 !1PM–2PM

Document  

Page 18: Introduc)onto* Data*Management** - University Library · 2017-04-18 · DataManagementWorkshop!Series! • Introduc)on*to*Data*Management! – Feb10th!!4PM–5PM – thApr6 !1PM–2PM

Describing  Data  •  Enables  someone  unfamiliar  with  your  data  to  find,  evaluate,  understand,  and  reuse  your  data  

•  Helps  YOU  to  remember  details  about  your  data  aSer  'me  has  passed  

 Examples      ReadMe  File:    h^p://goo.gl/eOzQOO  Metadata:    h^p://dublincore.org/documents/dc-­‐xml-­‐guidelines/  Data  Dic'onaries:    Page  4-­‐1  of  this  User  Guide    h^p://goo.gl/Ugqiqo  Codebook:    h^p://www.icpsr.umich.edu/icpsrweb/ICPSR/help/cb9721.jsp    

Page 19: Introduc)onto* Data*Management** - University Library · 2017-04-18 · DataManagementWorkshop!Series! • Introduc)on*to*Data*Management! – Feb10th!!4PM–5PM – thApr6 !1PM–2PM

Basic  Documenta'on  

•  Generic  Informa'on:    •  'tle,  key  dates,  creators,  subjects,  rights,  files,  formats,  versions,  project  &  funder  

 •  Addi'onal  Informa'on:  •  source  of  the  data,  how  analyzed,  soSware  and  hardware  requirements,  resul'ng  publica'ons  

Page 20: Introduc)onto* Data*Management** - University Library · 2017-04-18 · DataManagementWorkshop!Series! • Introduc)on*to*Data*Management! – Feb10th!!4PM–5PM – thApr6 !1PM–2PM

Storage  &  Backup  

Page 21: Introduc)onto* Data*Management** - University Library · 2017-04-18 · DataManagementWorkshop!Series! • Introduc)on*to*Data*Management! – Feb10th!!4PM–5PM – thApr6 !1PM–2PM

Storage  &  Back-­‐Up  Best  Prac'ces  

•  Security  •  Robustness  •  Number  of  copies  •  Loca'on  of  copies  •  Scheduling  •  Efficacy  

Page 22: Introduc)onto* Data*Management** - University Library · 2017-04-18 · DataManagementWorkshop!Series! • Introduc)on*to*Data*Management! – Feb10th!!4PM–5PM – thApr6 !1PM–2PM

protected  human  informa'on  (PHI)   con'ngent  on  compliance  with  FERPA/HIPAA/IRB  regula'ons  and  guidelines  

personally  iden'fiable   con'ngent  on  ability  to  de-­‐iden'fy  

proprietary  data   con'ngent  on  campus  IP  regula'ons  

classified  data   con'ngent  on  campus  export  control  

local  sensi'vity   con'ngent  on  lab/collabora'on  culture  

Considera'ons  for  Sensi've  Data  

Page 23: Introduc)onto* Data*Management** - University Library · 2017-04-18 · DataManagementWorkshop!Series! • Introduc)on*to*Data*Management! – Feb10th!!4PM–5PM – thApr6 !1PM–2PM

Archive  

Page 24: Introduc)onto* Data*Management** - University Library · 2017-04-18 · DataManagementWorkshop!Series! • Introduc)on*to*Data*Management! – Feb10th!!4PM–5PM – thApr6 !1PM–2PM

An'cipate  the  Long  Term  •  What  happens  to  your  data  in…  •  2  years?    •  5  years?    •  20  years?  

•  Will  the  formats  be  valid?      •  Will  the  hardware  be  defunct?  •  Will  anyone  know  where  it  is?  

Page 25: Introduc)onto* Data*Management** - University Library · 2017-04-18 · DataManagementWorkshop!Series! • Introduc)on*to*Data*Management! – Feb10th!!4PM–5PM – thApr6 !1PM–2PM

An'cipate  the  Long  Term  

More  Control  More  Responsibility  

Less  Control  Higher  Poten'al  Impact  

Keep  Private   Make  Public  

! h^p://databib.org/    

Page 26: Introduc)onto* Data*Management** - University Library · 2017-04-18 · DataManagementWorkshop!Series! • Introduc)on*to*Data*Management! – Feb10th!!4PM–5PM – thApr6 !1PM–2PM

Intro  to  Data  Management  Objec'ves  

1.  Learn  the  benefits  of  data  management  2.  Learn  the  components  of  data  management  3.  Iden'fy  gaps  in  your  current  prac'ces  4.  Know  where  to  find  help  

Page 27: Introduc)onto* Data*Management** - University Library · 2017-04-18 · DataManagementWorkshop!Series! • Introduc)on*to*Data*Management! – Feb10th!!4PM–5PM – thApr6 !1PM–2PM

Contact  Chris)e  Wiley  Engineering  Data  Services  Librarian  [email protected]    Sarah  C.  Williams  Life  Sciences  Data  Services  Librarian  [email protected]    Heidi  Imker  Director,  Research  Data  Service  [email protected]    

visit:      310-­‐312  Main  Library  

 call:      

(217)  300-­‐3513    

website:    researchdataservice.illinois.edu  

Page 28: Introduc)onto* Data*Management** - University Library · 2017-04-18 · DataManagementWorkshop!Series! • Introduc)on*to*Data*Management! – Feb10th!!4PM–5PM – thApr6 !1PM–2PM

Data  Management  Workshop  Series  

•  Introduc)on  to  Data  Management  –  Feb  10th    4PM  –  5PM  –  Apr  6th  1PM  –  2PM  

•  Documenta)on  and  Organiza)on  for  Data  and  Processes  –  Feb  17th    4PM  –  5PM  –  Apr  13th    1PM  –  2PM  

•  Making  Research  Data  Public:  Why,  What,  and  How  –  Feb  24th  4PM  –  5PM  –  Apr  20th    1PM  –  2PM  

Page 29: Introduc)onto* Data*Management** - University Library · 2017-04-18 · DataManagementWorkshop!Series! • Introduc)on*to*Data*Management! – Feb10th!!4PM–5PM – thApr6 !1PM–2PM

Scholarly  Commons  h^p://www.library.illinois.edu/sc/index.html  

Library  Home  Page  h^p://www.library.illinois.edu/  

 

Paste  DOI  here.  No  VPN  needed  for  access  to  full  text  (just  ne'd/password)!  

Bonus  Tips!  

Page 30: Introduc)onto* Data*Management** - University Library · 2017-04-18 · DataManagementWorkshop!Series! • Introduc)on*to*Data*Management! – Feb10th!!4PM–5PM – thApr6 !1PM–2PM

Useful?  Tell  your  friends  and  collogues!  

 Not  Useful?  Tell  us!