Preserving Research Data in Canada: an update DLI/ACCOLEDS 2009 Chuck Humphrey University of Alberta...

Post on 27-Mar-2015

217 views 0 download

Tags:

Transcript of Preserving Research Data in Canada: an update DLI/ACCOLEDS 2009 Chuck Humphrey University of Alberta...

Preserving Research Data in Canada: an update

DLI/ACCOLEDS 2009

Chuck Humphrey

University of Alberta

1

Environmental scan of this decade

2000

2002

2004

Research Data Centre Network, 2000

2008

2006

2001

2003

2005

2007

2009

DLI becomes a permanent program, 2001

National Data Archive Consultation, 2001-2002

OECD Access to Publicly Funded Research Data, 2004

Canadian Digital Information Strategy, 2006-2007

Consultation on Access to Scientific Research Data, 2005

International Data Forum, 2007

Research Data Strategy Working Group, 2008

CARL Data Management Working Group

UNESCO Charter on Preservation of Digital Heritage, 2003

2

3

Tipping the balance toward action

Research Data Strategy Working Group

Initiated by Pam Bjornson, CISTI Executive Director Cross-sector working group consisting of members from

government departments & agencies and research libraries Stewardship of Research Data in Canada: A Gap

Analysis (January 2009). Uses a lifecycle model to identify data problems in Canada.

RDSWG reorganized in anticipation of the release of the Gap Analysis in fall 2008. Task Group 1: Engagement strategy Task Group 2: Policies, funding and reward systems Task Group 3: Infrastructure and services Task Group 4: Capacity

4

Gap analysis summary

Source: The Stewardship of Research Data in Canada: a gap analysis, Table 2, page 17. 5

CARL Data Management Working Group

Members Marnie Swanson, Chair (U of Victoria) Pam Bjornson (CISTI) Lynn Copeland (SFU) Michelle Edwards (U of Guelph)

Observers Bernie Gloyn (Statistics Canada) Margaret Haines (Carleton U) Janine Schmidt (McGill U) Kathleen Shearer (CARL consultant)

Produced the Data Management Awareness Toolkit

6

Research Data Management Seminar

7

http://www.dcc.ac.uk/lifecycle-model/8

http://www.dcc.ac.uk/lifecycle-model/9

10

This table lists changes to the stages in the DCC model, re-aggregating activities in the lifecycle to create a data library viewpoint.

DCC Data Lib

create or receive data production

appraisal and select

dissemination

ingest, store, access and use

data repository

discovery

transform repurpose11

Data stewardship lifecycle

Data Repurposing

Data ProductionData Repository

Data Dissemination

Data Discovery

12

Where are we headed?

Trusted Research Data Repositories (TRDR’s) are emerging as a new institutional model to support data preservation. Based on work advanced for trusted digital repositories, TRDR’s are specialized services dedicated to research data.

Internationally, the U.S. and Europe are investing in the development of infrastructure to support TRDR’s. NSF DataNet Europe’s Digital Repository Infrastructure Vision

for European Research (DRIVER) DRIVER II: Federated data repositories

13

VirtualVirtualCommunityCommunity

Network

Grid

Scientific Data

VirtualVirtualCommunityCommunity

Network

Grid

Scientific Data

VirtualLaboratories

Workspace

Meetings, experiments, etc.

VirtualVirtualCommunityCommunity

Network

Grid

Scientific Data

VirtualLaboratories

Workspace

Meetings, experiments, etc.

VirtualLaboratories

Workspace

Meetings, experiments, etc.

Network

Grid

Scientific Data

Econ

om

ies

Econ

om

ies

of

Scale

of

Scale

Effi

cie

ncy

Effi

cie

ncy

Gain

sG

ain

sGlobal virtual research community

Source: Ulf Dahlsten, “Building a global virtual research community,” at the International Data Forum, Beijing, June 7, 2007

14

e-I

nfr

ast

ructu

re

of

re

posi

tori

es

e-I

nfr

ast

ructu

re

for

re

posi

tori

es

Management TransparentResponsiveInformedGrids, Virtual Organisations, etc

Repositories TrustedOpenWell managedRepository management, curation, physical security,

etc

Repositories services Ease of useAvailabilityReliabilityDeposit, annotation, delivery, visualisation, search,

help, etc

Information AuthenticityQualityLongevity

Collections: data, work-flows, publications, learning materials, etc.

AvailableScaleableReliableNetworks, computing, HPC, physical storage, etc

Physical infrastructure

Access StandardisedStableFlexible

Authentication, authorisation, logical security, federation, portals, etc

15Source: Mário CampolargoOpen Grid Forum Barcelona, 3 June 2008 source: eSciDR study (adapted)

e-Infrastructure for repositories

Data stewardship framework

Infrastructure layerInfrastructure layer

Data layerData layer

Services layerServices layer

Metadata layerMetadata layer

PPrroodduuccttiioonn

PPrroodduuccttiioonn

AAcccceessss

AAcccceessss

PPrreesseerrvvaattiioonn

PPrreesseerrvvaattiioonn

Arc

hite

ctur

e

Lifecycle activities

16

Data stewardship framework

Production Access Preservation

Services

Metadata

Data

Infrastructure

Local

National

International

Local

National

International

Local

National

International

17

18

What’s next?

The agenda for research data over the next decade, while substantial, can be managed through collaboration. We can all play a role in furthering developments in sound data management and in advancing data stewardship.