Ross Wilkinson - Data Publication: Australian and Global Policy Developments

Post on 10-Jan-2017

237 views 0 download

Transcript of Ross Wilkinson - Data Publication: Australian and Global Policy Developments

Research Data Publishing

International and National Trends

Ross Wilkinson Australian National Data Service Melbourne, November, 2015

1

Outline

Trends The research data assets of Australia International trends The challenges for the publication process The opportunity Conclusions

2

Some Trends:

Reproducible Science Open Science Open Data Data Citation Data Citation

Bibliometrics

Data Journals Data Repositories Trusted Data

Repositories FAIR Data Funded Fair Data

3

What’s going on?

Data is no longer a by-product of research Data is valuable Funders and Government want more from their

research investments So do research institutional leadership

4

5

The Value of Open Data Report The analysis in the report suggests that the value of data in Australia’s public research is at least $1.9 billion per annum and possibly up to $6 billion per annum – at 2012-13 levels of expenditure and activity. The report discusses the implications for Australia including if this value is not realised while recognising potential costs if its value is to be effectively leveraged.

6

7

Australian Research Data Activity Data Policy Capturing data valuable over long periods in Marine,

Astronomy, Earth Sciences, Ecosystems …for a wide range of research purposes

Supporting the storage of data Supporting the management of data Supporting the enhancement of data Building Institutional Research Data Capacity Developing data partnerships with industry?

8

Research Data Policy

ARC and NHMRC: Treat data as an asset Department of Environment: Requirement that

data is open, discoverable, and available Department of Education: The Australian Research

Data Infrastructure Strategy provides recommendations for coherent approach to research data and research data infrastructure

9

Data is Transformative

Governments are not investing in research data to make life easier for researchers

Investments in research data to enable societal problems to be addressed

This requires data to be in a form that allows a wide variety of use

10

Data as a research output – and more

• Funders are seeing research data a publishable output

• They expect data to be managed

• They expect it to be available for industry, education, the public and further research

11

Industry Education

Public Research

Data

AURIN – Urban data infrastructure

How can I increase the value of my suburban property development?

How do I make it more “liveable” to attract more buyers?

Integrate data from developers, local government, state government, federal government, mapping data, roads data, public transport maps….

Apply University of Melbourne developed “walkability” index 12

How do you develop suburbs that work for residents, developers and local government?

Along the Maribyrnong River, 10 km from Melbourne’s CBD, 128 ha of government land is ripe for redevelopment

It could accommodate 3000 dwellings and offices for 3000 people Planning a sustainable, liveable community integrated into its urban surrounds demands

information on transport, health services, environment, housing prices, recreation facilities and more

This comes from Federal and State government agencies, local councils, utilities and private companies

For Maribyrnong, data and 80 tools to manage it are being made available through the Australian Urban Research Intelligence Network (AURIN) and the Australian National Data Service (ANDS)

New tools—such as employment opportunities and walkability—are being added Similar projects can facilitate development across Australia’s cities and towns

13

Data Value

Stronger research More efficient research Stronger partnerships More industry engagment – data as a trust builder

14

Australian National Data Service:

To make Australia’s research data assets more valuable for its researchers, research institutions and the nation

15

So we need to transform: Data that are:

Unmanaged Disconnected Invisible Single use

To Structured Collections that are: Managed Connected Findable Reusable

so that researchers can easily publish, discover, access and use research data.

Value

Major Open Data Program Connecting mining data, to

research techniques, to industry exploration

Connecting twitter data to Jakarta map to analytics for managing flooding

Collecting tropical data to institutional strategy

Collecting ancient DNA for forming international partnerships for new results

17

Data Opportunities – and threats

Data sharing is great for trust development Data openness challenges traditional business

models Data partners can be anywhere – EU is investing

€1.4B in open data to drive jobs and innovation Research data environment in Australia is world

leading

18

Back to some Trends:

Reproducible Science Open Science Open Data Data Citation Data Citation

Bibliometrics

Data Journals Data Repositories Trusted Data

Repositories FAIR Data Funded Fair Data

19

20

From G. Boulton

Royal Society publishes “Science as an open enterprise” – written by Geoffrey Boulton

Influential in EU/UK

21

EU Open Data “Pilot”

1.4B Euros as part of H2020 80% take up

22

Data citation Data that is used

should be cited – just as other work is cited

Provides appropriate credit

Enables reproduction

23

DataCite provides reliability

Agreed basic information: Creator (Publication year), Title, Publisher, Identifier

Suitably formatted DOI

Data citation works with.. ORCID – for people Crossref – for papers Fundref – for funders IGSN – for specimens … Can we measure the

value? Bibliometricians arise!

24

Connection is key And the connections

should be machine operable

Research is more valuable if it is more connected

Data Journals

Geoscience Data Journal (Wiley)

Scientific Data (Nature) Journal of Open

Archaeology Data (Ubiquity)

Biodiversity Data Journal (Pensoft)

A means of describing the data – its formation, properties, usage

Enables recognition of a contribution

Enhances usage of the data

Enables “traditional” bibliometrics

25

Data Repositories:

Provide: Data storage Metadata storage Data access methods Data management software

But also: Integrated approach to content and metadata Policies, processes, services, and people Overall commitment to the stewardship of digital materials

26

Trusted data repositories

Need for reliable data Trusted repositories: Trusted Repositories Audit & Certification (TRAC) -ISO 16363 Data Seal of Approval e.g. Pacific and Regional Archive for

Digital Sources in Endangered Cultures (PARADISEC)

Often required by publishers May be increasingly required (and funded) by

research funders

27

FAIR Data – (FORCE 11) To be Findable: (meta)data are assigned a globally unique and eternally persistent identifier. data are described with rich metadata. (meta)data are registered or indexed in a searchable resource. metadata specify the data identifier. To be Accessible: (meta)data are retrievable by their identifier the protocol is open, free, and universally implementable the protocol allows for an authentication and authorization procedure, where necessary. metadata are accessible, even when the data are no longer available.

To be Interoperable: (meta)data use a formal, accessible, shared, and broadly applicable language for knowledge representation (meta)data use vocabularies that follow FAIR principles. (meta)data include qualified references to other (meta)data. To be Re-usable: meta(data) have a plurality of accurate and relevant attributes. (meta)data are released with a clear and accessible data usage license (meta)data are associated with their provenance. (meta)data meet domain-relevant community standards.

28

Funded FAIR data

All of the data that support a research finding should be FAIR

It should be stored in a trusted repository It should be funded

29

30

The Opportunity

Fully integrated publication of all outputs of a scholarly endeavour with rich connection

FAIR data in a trusted repository Fully explorable scholarly journals Researchers get much better exposure of their

research The outcomes are defensible New research and partners become available

31

Conclusions

Research data is valuable It should be expected that the data underpinning

findings are available for scrutiny Far greater value is available, especially if it is

findable, accessible, interoperable and reusable This is helped if data is published

32

33 This work is licensed under a Creative Commons Attribution 3.0 Australia License

ANDS is supported by the Australian Government through the National Collaborative Research Infrastructure Strategy (NCRIS).

Thank you!