Informatics Transform : Re-engineering Libraries for the Data Decade
description
Transcript of Informatics Transform : Re-engineering Libraries for the Data Decade
![Page 1: Informatics Transform : Re-engineering Libraries for the Data Decade](https://reader034.fdocuments.net/reader034/viewer/2022051609/5462862bb4af9f581c8b4801/html5/thumbnails/1.jpg)
A centre of expertise in digital information management
www.ukoln.ac.uk
UKOLN is supported by:
The Informatics Transform:
Re-engineering Libraries for the Data Decade
Dr Liz Lyon, Associate Director, UK Digital Curation Centre Director, UKOLN, University of Bath, UK
VALA2012, Melbourne, Australia
This work is licensed under a Creative Commons LicenceAttribution-ShareAlike 2.0
![Page 2: Informatics Transform : Re-engineering Libraries for the Data Decade](https://reader034.fdocuments.net/reader034/viewer/2022051609/5462862bb4af9f581c8b4801/html5/thumbnails/2.jpg)
“Data is the new oil.”
Andreas Weigend, Stanford (ex Amazon)
“The future belongs to companies and people that turn data into products”
Mike Loukides, O’Reilly Media
![Page 3: Informatics Transform : Re-engineering Libraries for the Data Decade](https://reader034.fdocuments.net/reader034/viewer/2022051609/5462862bb4af9f581c8b4801/html5/thumbnails/3.jpg)
http://www.flickr.com/photos/thinkmulejunk/352387473/
http://www.google.co.uk/imgres?q=illumina+bgi&hl=en&client=firefox-a&hs=Jl2&rls=org.mozilla:en-GB:official&biw=1366&bih
http://www.flickr.com/photos/wasp_barcode/4793484478/http://www.flickr.com/photos/charleswelch/3597432481//
http://www.flickr.com/photos/usfsregion5/4546851916//
Data...
![Page 4: Informatics Transform : Re-engineering Libraries for the Data Decade](https://reader034.fdocuments.net/reader034/viewer/2022051609/5462862bb4af9f581c8b4801/html5/thumbnails/4.jpg)
Oceans:last unmapped frontier?
http://bohemianadventures.blogspot.com.au/2010/06/bering-sea-day-1-dutch-harbor.html
http://www.wired.com/wiredscience/2011/09/ocean-sensor-network/
![Page 5: Informatics Transform : Re-engineering Libraries for the Data Decade](https://reader034.fdocuments.net/reader034/viewer/2022051609/5462862bb4af9f581c8b4801/html5/thumbnails/5.jpg)
..using personal data for research
![Page 6: Informatics Transform : Re-engineering Libraries for the Data Decade](https://reader034.fdocuments.net/reader034/viewer/2022051609/5462862bb4af9f581c8b4801/html5/thumbnails/6.jpg)
Share your genome data? • Buy a DTC kit• Join a project
![Page 7: Informatics Transform : Re-engineering Libraries for the Data Decade](https://reader034.fdocuments.net/reader034/viewer/2022051609/5462862bb4af9f581c8b4801/html5/thumbnails/7.jpg)
Would if given the opportunity
54%
15%
13%
18%
Have not/would not
Have had genome analysis
Not sure
In a recent 2011 survey, Nature asked its readers whether they had, or would consider, a genome analysis (n=1588)
![Page 8: Informatics Transform : Re-engineering Libraries for the Data Decade](https://reader034.fdocuments.net/reader034/viewer/2022051609/5462862bb4af9f581c8b4801/html5/thumbnails/8.jpg)
Consumer data…
![Page 9: Informatics Transform : Re-engineering Libraries for the Data Decade](https://reader034.fdocuments.net/reader034/viewer/2022051609/5462862bb4af9f581c8b4801/html5/thumbnails/9.jpg)
One in every nine people on Earth is on Facebook
30billion pieces of content are shared on Facebook each month
People upload 3000images to Flickr every minute
Google+ has > 25million users
From 20 Social Media Statistics (Jeffbullas)
![Page 10: Informatics Transform : Re-engineering Libraries for the Data Decade](https://reader034.fdocuments.net/reader034/viewer/2022051609/5462862bb4af9f581c8b4801/html5/thumbnails/10.jpg)
…and conversations
http://www.touchagency.com/free-twitter-infographic/
![Page 11: Informatics Transform : Re-engineering Libraries for the Data Decade](https://reader034.fdocuments.net/reader034/viewer/2022051609/5462862bb4af9f581c8b4801/html5/thumbnails/11.jpg)
“Data is the new oil.”
Andreas Weigend, Stanford (ex Amazon)
Data is more like soup – its messy and you don’t know what’s in it….
![Page 12: Informatics Transform : Re-engineering Libraries for the Data Decade](https://reader034.fdocuments.net/reader034/viewer/2022051609/5462862bb4af9f581c8b4801/html5/thumbnails/12.jpg)
“DIY”
http://www.technologyreview.com/biomedicine/37784/
Kyle Machulis
Human physiology data
![Page 13: Informatics Transform : Re-engineering Libraries for the Data Decade](https://reader034.fdocuments.net/reader034/viewer/2022051609/5462862bb4af9f581c8b4801/html5/thumbnails/13.jpg)
Particle physics data
“Herculean” and“Heroic”
![Page 14: Informatics Transform : Re-engineering Libraries for the Data Decade](https://reader034.fdocuments.net/reader034/viewer/2022051609/5462862bb4af9f581c8b4801/html5/thumbnails/14.jpg)
“Crowd-sourced” astronomy
![Page 15: Informatics Transform : Re-engineering Libraries for the Data Decade](https://reader034.fdocuments.net/reader034/viewer/2022051609/5462862bb4af9f581c8b4801/html5/thumbnails/15.jpg)
Researchers need help to manage their data.
This is a really exciting opportunity for libraries…..
![Page 16: Informatics Transform : Re-engineering Libraries for the Data Decade](https://reader034.fdocuments.net/reader034/viewer/2022051609/5462862bb4af9f581c8b4801/html5/thumbnails/16.jpg)
http://www.flickr.com/photos/49397559@N02/5899381202/
with
a bit o
f re-eng
ineerin
g
![Page 17: Informatics Transform : Re-engineering Libraries for the Data Decade](https://reader034.fdocuments.net/reader034/viewer/2022051609/5462862bb4af9f581c8b4801/html5/thumbnails/17.jpg)
1. Leadership
(Getting attention…)
![Page 18: Informatics Transform : Re-engineering Libraries for the Data Decade](https://reader034.fdocuments.net/reader034/viewer/2022051609/5462862bb4af9f581c8b4801/html5/thumbnails/18.jpg)
Six reasons why you should care about managing your research data
![Page 19: Informatics Transform : Re-engineering Libraries for the Data Decade](https://reader034.fdocuments.net/reader034/viewer/2022051609/5462862bb4af9f581c8b4801/html5/thumbnails/19.jpg)
Photo credits: Harvey Rutt http://www.ecs.soton.ac.uk/regenesis/pictures/
1. Risk: where is your data?
![Page 20: Informatics Transform : Re-engineering Libraries for the Data Decade](https://reader034.fdocuments.net/reader034/viewer/2022051609/5462862bb4af9f581c8b4801/html5/thumbnails/20.jpg)
2. Reputation : data access, FOI
![Page 21: Informatics Transform : Re-engineering Libraries for the Data Decade](https://reader034.fdocuments.net/reader034/viewer/2022051609/5462862bb4af9f581c8b4801/html5/thumbnails/21.jpg)
http://www.sciencemag.org/content/334/6060/1226.full.html
3. Quality: data gold standard
![Page 22: Informatics Transform : Re-engineering Libraries for the Data Decade](https://reader034.fdocuments.net/reader034/viewer/2022051609/5462862bb4af9f581c8b4801/html5/thumbnails/22.jpg)
4. Scale: an explosion of data
http://www.phgfoundation.org/reports/10364/
“A single sequencer can now generate in a day what it took 10 years to collect for the Human Genome Project”
![Page 23: Informatics Transform : Re-engineering Libraries for the Data Decade](https://reader034.fdocuments.net/reader034/viewer/2022051609/5462862bb4af9f581c8b4801/html5/thumbnails/23.jpg)
Alzheimer’s Disease Neuroimaging Initiative: a unique (open) $60M partnership between
NIH, FDA, universities and drug companies.
“It was unbelievable. Its not science the way most of us have practiced in our careers.
But we all realised that we would never get biomarkers unless all of us parked our egos
and intellectual property noses outside the door and agreed that all of our data would
be public immediately.” Dr John Trojanowski, University of Pennsylvania
5.Partnerships
![Page 24: Informatics Transform : Re-engineering Libraries for the Data Decade](https://reader034.fdocuments.net/reader034/viewer/2022051609/5462862bb4af9f581c8b4801/html5/thumbnails/24.jpg)
http://www.epsrc.ac.uk/about/standards/researchdata/Pages/expectations.aspx
6. Funding
EPSRC expects all those institutions it funds
•to develop a roadmap that aligns their policies and processes with EPSRC’s expectations by 1st May 2012;
•to be fully compliant with these expectations by 1st May 2015.
![Page 25: Informatics Transform : Re-engineering Libraries for the Data Decade](https://reader034.fdocuments.net/reader034/viewer/2022051609/5462862bb4af9f581c8b4801/html5/thumbnails/25.jpg)
• Awareness of regulatory environment
• Data access statement
• Policies and processes
• Data storage
• Structured metadata descriptions
• DOIs for data
• Securely preserved for a minimum of 10 years
![Page 26: Informatics Transform : Re-engineering Libraries for the Data Decade](https://reader034.fdocuments.net/reader034/viewer/2022051609/5462862bb4af9f581c8b4801/html5/thumbnails/26.jpg)
http://www.flickr.com/photos/darshan-shah/6237564870/http://www.cartoonstock.com/lowres/csl4846l.jpg
…and Carrots
Sticks
![Page 27: Informatics Transform : Re-engineering Libraries for the Data Decade](https://reader034.fdocuments.net/reader034/viewer/2022051609/5462862bb4af9f581c8b4801/html5/thumbnails/27.jpg)
2. Research Data Management services
(Providing tools & support)
![Page 28: Informatics Transform : Re-engineering Libraries for the Data Decade](https://reader034.fdocuments.net/reader034/viewer/2022051609/5462862bb4af9f581c8b4801/html5/thumbnails/28.jpg)
Understanding Data Requirements
http://www.dcc.ac.uk/
![Page 29: Informatics Transform : Re-engineering Libraries for the Data Decade](https://reader034.fdocuments.net/reader034/viewer/2022051609/5462862bb4af9f581c8b4801/html5/thumbnails/29.jpg)
Data management plans
![Page 30: Informatics Transform : Re-engineering Libraries for the Data Decade](https://reader034.fdocuments.net/reader034/viewer/2022051609/5462862bb4af9f581c8b4801/html5/thumbnails/30.jpg)
• Advocacy & Training• Informatics: disciplinary
metadata schema, standards, formats, identifiers, ontologies
• Storage: file-store, cloud, data centres, funder policy
• Access: embargoes, FOI
![Page 31: Informatics Transform : Re-engineering Libraries for the Data Decade](https://reader034.fdocuments.net/reader034/viewer/2022051609/5462862bb4af9f581c8b4801/html5/thumbnails/31.jpg)
How to cite data
What data to keep
![Page 32: Informatics Transform : Re-engineering Libraries for the Data Decade](https://reader034.fdocuments.net/reader034/viewer/2022051609/5462862bb4af9f581c8b4801/html5/thumbnails/32.jpg)
Data Licensing
• Bespoke licences• Standard licences• Multiple licensing• Licence mechanisms
![Page 33: Informatics Transform : Re-engineering Libraries for the Data Decade](https://reader034.fdocuments.net/reader034/viewer/2022051609/5462862bb4af9f581c8b4801/html5/thumbnails/33.jpg)
Tools to track impact
http://total-impact.org/
![Page 34: Informatics Transform : Re-engineering Libraries for the Data Decade](https://reader034.fdocuments.net/reader034/viewer/2022051609/5462862bb4af9f581c8b4801/html5/thumbnails/34.jpg)
Research360@Bath
• Partnership approach• UKOLN-DCC• Library• IT services• Research Support
Office• Doctoral Training
Centres
http://blogs.bath.ac.uk/research360/
![Page 35: Informatics Transform : Re-engineering Libraries for the Data Decade](https://reader034.fdocuments.net/reader034/viewer/2022051609/5462862bb4af9f581c8b4801/html5/thumbnails/35.jpg)
Library & institutional stakeholders
•Roles (7 listed)•Responsibilities •Requirements •Relationships
Partnership approach
Liz Lyon, Informatics Transform, Ariadne Issue 68, 2012
![Page 36: Informatics Transform : Re-engineering Libraries for the Data Decade](https://reader034.fdocuments.net/reader034/viewer/2022051609/5462862bb4af9f581c8b4801/html5/thumbnails/36.jpg)
1. Director IS/CIO/University Librarian2. Data librarians /data scientist
/liaison/subject/faculty librarians3. Repository managers4. IT/Computing Services5. Research Support/Innovation Office6. Doctoral Training Centres7. PVC Research
Data rolesLiz Lyon, Informatics Transform, Ariadne Issue 68, 2012
![Page 37: Informatics Transform : Re-engineering Libraries for the Data Decade](https://reader034.fdocuments.net/reader034/viewer/2022051609/5462862bb4af9f581c8b4801/html5/thumbnails/37.jpg)
Full mapping : Informatics Transform, Ariadne Issue 68, 2012
![Page 38: Informatics Transform : Re-engineering Libraries for the Data Decade](https://reader034.fdocuments.net/reader034/viewer/2022051609/5462862bb4af9f581c8b4801/html5/thumbnails/38.jpg)
3. Developing data informatics capacity & capability
(Acquiring the skills….)
![Page 39: Informatics Transform : Re-engineering Libraries for the Data Decade](https://reader034.fdocuments.net/reader034/viewer/2022051609/5462862bb4af9f581c8b4801/html5/thumbnails/39.jpg)
Sheila Corrall: Libraries, Librarians and Data Many action exemplars
RLUK/Mary Auckland: Reskilling for Research
9 areas are skill gaps for subject librarians
2012: Libraries in review
![Page 40: Informatics Transform : Re-engineering Libraries for the Data Decade](https://reader034.fdocuments.net/reader034/viewer/2022051609/5462862bb4af9f581c8b4801/html5/thumbnails/40.jpg)
Skill gap 2-5 years NowPreserving research outputs 49% 10%
Data management & curation 48% 16%
Comply with funder mandates 40% 16%
Data manipulation tools 34% 7%
Data mining 33% 3%
Metadata 29% 10%
Preservation of project records 24% 3%
Sources of research funding 21% 8%
Metadata schema, discipline standards, practices
16% 2%
Data from RLUK/Mary Auckland: Reskilling for Research 2012
![Page 41: Informatics Transform : Re-engineering Libraries for the Data Decade](https://reader034.fdocuments.net/reader034/viewer/2022051609/5462862bb4af9f581c8b4801/html5/thumbnails/41.jpg)
• Skills shortage for data informatics?• Reposition LIS curriculum?• LIS entry requirements?• Get credit for informatics work?
Pause for reflection….
Lyon, Informatics Transform, Ariadne 2012
![Page 42: Informatics Transform : Re-engineering Libraries for the Data Decade](https://reader034.fdocuments.net/reader034/viewer/2022051609/5462862bb4af9f581c8b4801/html5/thumbnails/42.jpg)
1. Define core components of data informatics
• Visualisation e.g. VisTrails• Workflow e.g. Taverna• Analysis e.g. R
Play for action….
Lyon, Informatics Transform, Ariadne 2012
![Page 43: Informatics Transform : Re-engineering Libraries for the Data Decade](https://reader034.fdocuments.net/reader034/viewer/2022051609/5462862bb4af9f581c8b4801/html5/thumbnails/43.jpg)
“Very few librarians are likely to have specialist scientific or medical knowledge - if you train as a research scientist or a medic, you probably won’t become a librarian.”
RLUK/Mary Auckland: Reskilling for Research 2012
![Page 44: Informatics Transform : Re-engineering Libraries for the Data Decade](https://reader034.fdocuments.net/reader034/viewer/2022051609/5462862bb4af9f581c8b4801/html5/thumbnails/44.jpg)
2. Analyse LIS entry qualifications & increase STEM entrants
Target• Biologists• Chemists• Mathematicians
Play for action….
Lyon, Informatics Transform, Ariadne 2012
![Page 45: Informatics Transform : Re-engineering Libraries for the Data Decade](https://reader034.fdocuments.net/reader034/viewer/2022051609/5462862bb4af9f581c8b4801/html5/thumbnails/45.jpg)
Let’s get together
![Page 46: Informatics Transform : Re-engineering Libraries for the Data Decade](https://reader034.fdocuments.net/reader034/viewer/2022051609/5462862bb4af9f581c8b4801/html5/thumbnails/46.jpg)
3. International Data Informatics Working Group to explore promotion, recognition & reward
• Global awareness campaign• Career incentives• Benchmark good practice
Play for action….
Lyon, Informatics Transform, Ariadne 2012
![Page 47: Informatics Transform : Re-engineering Libraries for the Data Decade](https://reader034.fdocuments.net/reader034/viewer/2022051609/5462862bb4af9f581c8b4801/html5/thumbnails/47.jpg)
Position LocationScience Data Librarian Stanford
Data Management Librarian Oregon State
Social Sciences Data Librarian Brown
Data Curation Librarian Northeastern
Data Librarian New South Wales
Research Data Management Co-ordinator
Sydney
Research Data & Digital Curation Officer
Cambridge
Data Services Librarian Iowa
Data Analyst ANDS
Institutional Data Scientist Bath
![Page 48: Informatics Transform : Re-engineering Libraries for the Data Decade](https://reader034.fdocuments.net/reader034/viewer/2022051609/5462862bb4af9f581c8b4801/html5/thumbnails/48.jpg)
Data journalist?
Data artist?
![Page 49: Informatics Transform : Re-engineering Libraries for the Data Decade](https://reader034.fdocuments.net/reader034/viewer/2022051609/5462862bb4af9f581c8b4801/html5/thumbnails/49.jpg)
Implications of “Big Data” and data science for organisations in all sectors
Predicts a shortage of 190,000 data scientists by 2019
http://www.mckinsey.com/Insights/MGI/Research/Technology_and_Innovation/Big_data_The_next_frontier_for_innovation
![Page 50: Informatics Transform : Re-engineering Libraries for the Data Decade](https://reader034.fdocuments.net/reader034/viewer/2022051609/5462862bb4af9f581c8b4801/html5/thumbnails/50.jpg)
“Big Data” Data scientist
http://www.emc.com/collateral/about/news/emc-data-science-study-wp.pdf
Data Science Revealed community survey
![Page 51: Informatics Transform : Re-engineering Libraries for the Data Decade](https://reader034.fdocuments.net/reader034/viewer/2022051609/5462862bb4af9f581c8b4801/html5/thumbnails/51.jpg)
For a University, research data is a key element of “Big Data”.
Managing research data effectively will give business advantage.
![Page 52: Informatics Transform : Re-engineering Libraries for the Data Decade](https://reader034.fdocuments.net/reader034/viewer/2022051609/5462862bb4af9f581c8b4801/html5/thumbnails/52.jpg)
http://communitymodel.sharepoint.com/
Data-intensive research • Intelligence• Decision-making• Planning• Investment• Capacity• Capability
![Page 53: Informatics Transform : Re-engineering Libraries for the Data Decade](https://reader034.fdocuments.net/reader034/viewer/2022051609/5462862bb4af9f581c8b4801/html5/thumbnails/53.jpg)
• Research Funders• Institutions• Research leaders/PIs
Community Capability Model Framework CCMF
http://communitymodel.sharepoint.com/
![Page 54: Informatics Transform : Re-engineering Libraries for the Data Decade](https://reader034.fdocuments.net/reader034/viewer/2022051609/5462862bb4af9f581c8b4801/html5/thumbnails/54.jpg)
“The ability to take data -to be able to understand it, to process it, to extract value from it, to visualise it, to communicate it -that’s going to be a hugely important skill in the next decades.”
Hal Varian, Chief Economist, Google
![Page 55: Informatics Transform : Re-engineering Libraries for the Data Decade](https://reader034.fdocuments.net/reader034/viewer/2022051609/5462862bb4af9f581c8b4801/html5/thumbnails/55.jpg)
Libraries are on a data journey -the Informatics Transform is the first step in a new direction…
![Page 56: Informatics Transform : Re-engineering Libraries for the Data Decade](https://reader034.fdocuments.net/reader034/viewer/2022051609/5462862bb4af9f581c8b4801/html5/thumbnails/56.jpg)
Thank you!
Informatics Transform article (in press)
http://ariadne.ac.uk/issue68/lyonuse details:
Slideshttp://www.ukoln.ac.uk/ukoln/staff/e.j.lyon/presentations.html
DCC http://www.dcc.ac.uk