Value of Unique IDs in Academia, Vilnius - Identifying knowledge contributors

63
G. A. Thorisson, University of Leicester / University of Iceland Conference on Unique Identifiers, Vilnius, Feb 14 2012 - Identifying knowledge contributors - Applications of scholarly identifiers in research data publication 1 -- Overview -- Research data in the life sciences - tackling a problem with sharing Incentives/rewards, attributing credit Identifying research outputs, use/reuse and contributors Emerging solutions & interesting developments The potentially vital role of ORCID as a central hub and aggregator This work is published under the Creative Commons Attribution license (CC BY: http://creativecommons.org/licenses/by/3.0/ ) which means that it can be freely copied, redistributed and adapted, as long as proper attribution is given. Gudmundur A. Thorisson, PhD <[email protected] > Research associate, University of Leicester Guest scientist, University of Iceland Participant in the GEN2PHEN Consortium and the ORCID Technical Working Group Tuesday, 14 February 12

description

My slides from an invited talk at a conference on Unique IDs in Vilnius, Lithuania Feb 14th (http://uniqueids.org)

Transcript of Value of Unique IDs in Academia, Vilnius - Identifying knowledge contributors

Page 1: Value of Unique IDs in Academia, Vilnius - Identifying knowledge contributors

G. A. Thorisson, University of Leicester / University of Iceland

Conference on Unique Identifiers, Vilnius, Feb 14 2012

- Identifying knowledge contributors -Applications of scholarly identifiers in research data publication

1

-- Overview --

๏ Research data in the life sciences - tackling a problem with sharing

๏ Incentives/rewards, attributing credit

๏ Identifying research outputs, use/reuse and contributors

๏ Emerging solutions & interesting developments

๏ The potentially vital role of ORCID as a central hub and aggregator

This work is published under the Creative Commons Attribution license (CC BY: http://creativecommons.org/licenses/by/3.0/) which means that it can be freely copied, redistributed and adapted, as long as proper attribution is given.

Gudmundur A. Thorisson, PhD <[email protected]> Research associate, University of Leicester

Guest scientist, University of IcelandParticipant in the GEN2PHEN Consortium and the ORCID Technical Working Group

Tuesday, 14 February 12

Page 2: Value of Unique IDs in Academia, Vilnius - Identifying knowledge contributors

G. A. Thorisson, University of Leicester / University of Iceland

Conference on Unique Identifiers, Vilnius, Feb 14 2012

Big Science, Big Data

• High-profile examples

– High-energy particle physics - experiments performed in the Large Hadron Collider

– Astronomy - data from ground-based and space telescopes, the Virtual Observatory (VO)

2

• Doctorow, C. Big data: Welcome to the petacentre. Nature 455, 16-21 (2008). http://dx.doi.org/10.1038/455016a

• Scientific research increasingly large-scale and data-driven

Tuesday, 14 February 12

Page 3: Value of Unique IDs in Academia, Vilnius - Identifying knowledge contributors

G. A. Thorisson, University of Leicester / University of Iceland

Conference on Unique Identifiers, Vilnius, Feb 14 2012

Big Science, Big Data

• High-profile examples

– High-energy particle physics - experiments performed in the Large Hadron Collider

– Astronomy - data from ground-based and space telescopes, the Virtual Observatory (VO)

2

• Doctorow, C. Big data: Welcome to the petacentre. Nature 455, 16-21 (2008). http://dx.doi.org/10.1038/455016a

• Scientific research increasingly large-scale and data-driven

Tuesday, 14 February 12

Page 4: Value of Unique IDs in Academia, Vilnius - Identifying knowledge contributors

G. A. Thorisson, University of Leicester / University of Iceland

Conference on Unique Identifiers, Vilnius, Feb 14 2012

Large-scale data guides hypothesis generation

3

Gray, J. 2009. The Fourth Paradigm: Data-Intensive Scientific Discovery. Microsoft Research

• Science paradigms– 1st: Empirical - describing natural phenomena

– 2nd: Theoretical - models, generalizations– 3rd: Computational - simulating complex phenomena4t– 4th (1+2+3): Data exploration, e-Science

Tuesday, 14 February 12

Page 5: Value of Unique IDs in Academia, Vilnius - Identifying knowledge contributors

G. A. Thorisson, University of Leicester / University of Iceland

Conference on Unique Identifiers, Vilnius, Feb 14 2012

Large-scale data guides hypothesis generation

3

Kell and Oliver. Bioessays (2004) vol. 26 (1)

Gray, J. 2009. The Fourth Paradigm: Data-Intensive Scientific Discovery. Microsoft Research

• Science paradigms– 1st: Empirical - describing natural phenomena

– 2nd: Theoretical - models, generalizations– 3rd: Computational - simulating complex phenomena4t– 4th (1+2+3): Data exploration, e-Science

Tuesday, 14 February 12

Page 6: Value of Unique IDs in Academia, Vilnius - Identifying knowledge contributors

G. A. Thorisson, University of Leicester / University of Iceland

Conference on Unique Identifiers, Vilnius, Feb 14 2012

Large-scale data guides hypothesis generation

3

Kell and Oliver. Bioessays (2004) vol. 26 (1)

Gray, J. 2009. The Fourth Paradigm: Data-Intensive Scientific Discovery. Microsoft Research

• Science paradigms– 1st: Empirical - describing natural phenomena

– 2nd: Theoretical - models, generalizations– 3rd: Computational - simulating complex phenomena4t– 4th (1+2+3): Data exploration, e-Science

Tuesday, 14 February 12

Page 7: Value of Unique IDs in Academia, Vilnius - Identifying knowledge contributors

G. A. Thorisson, University of Leicester / University of Iceland

Conference on Unique Identifiers, Vilnius, Feb 14 2012 4

Richards, M. et al. Paleolithic and neolithic lineages in the European mitochondrial gene pool. American journal of human genetics 59, 185-203 (1996). http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1915109/

Biological research also big and data-driven

Tuesday, 14 February 12

Page 8: Value of Unique IDs in Academia, Vilnius - Identifying knowledge contributors

G. A. Thorisson, University of Leicester / University of Iceland

Conference on Unique Identifiers, Vilnius, Feb 14 2012

• Huge technological advances in last 5-10 years– experimental / observations <-- gathering data with high-throughput equipment

– computer technology <-- storing & analyzing massive data volumes

5

Biological research also big and data-driven

Tuesday, 14 February 12

Page 9: Value of Unique IDs in Academia, Vilnius - Identifying knowledge contributors

G. A. Thorisson, University of Leicester / University of Iceland

Conference on Unique Identifiers, Vilnius, Feb 14 2012

• Huge technological advances in last 5-10 years– experimental / observations <-- gathering data with high-throughput equipment

– computer technology <-- storing & analyzing massive data volumes

5

• Example: massively-parallel sequencing– Determine human genome sequence in <1 day - the $1000 genome

– Metagenomics: sequence *everything* in environment samples– Large bio-specimen collections

• x100,0000 of individuals in disease/population biobanks

Biological research also big and data-driven

Tuesday, 14 February 12

Page 10: Value of Unique IDs in Academia, Vilnius - Identifying knowledge contributors

G. A. Thorisson, University of Leicester / University of Iceland

Conference on Unique Identifiers, Vilnius, Feb 14 2012

Dealing with ‘Big Data’

• Managing & making sense of large-scale datasets– Data easy/cheap to generate - not so cheap to store & use

– “the $1000 genome sequence, followed by the $10,000 analysis”

• Integration & analysis - combine datasets– more data of the same type - e.g. combine sequences from multiple species– related data of different type - e.g. a person’s genome sequence + his/her phenotype

• Potential for accelerating research, creating new knowledge and (in biomedicine) improving human health.

• Founded on unrestricted sharing of scientific data deposited in the public domain

6

Tuesday, 14 February 12

Page 11: Value of Unique IDs in Academia, Vilnius - Identifying knowledge contributors

G. A. Thorisson, University of Leicester / University of Iceland

Conference on Unique Identifiers, Vilnius, Feb 14 2012 7

[..] If digital technologies are the engine of this revolution, digital data are its fuel. But for many scientific disciplines, this fuel is in short supply.[..]

Smith, V. Data publication: towards a database of everything. BMC Res Notes (2009) vol. 2 (1)

Data = “fuel” of science

Tuesday, 14 February 12

Page 12: Value of Unique IDs in Academia, Vilnius - Identifying knowledge contributors

G. A. Thorisson, University of Leicester / University of Iceland

Conference on Unique Identifiers, Vilnius, Feb 14 2012

Domain repositories for sequence data

8

Tuesday, 14 February 12

Page 13: Value of Unique IDs in Academia, Vilnius - Identifying knowledge contributors

G. A. Thorisson, University of Leicester / University of Iceland

Conference on Unique Identifiers, Vilnius, Feb 14 2012

Domain repositories for sequence data

8

• GenBank - genetic sequence repository, established 1986

Tuesday, 14 February 12

Page 14: Value of Unique IDs in Academia, Vilnius - Identifying knowledge contributors

G. A. Thorisson, University of Leicester / University of Iceland

Conference on Unique Identifiers, Vilnius, Feb 14 2012

Domain repositories for sequence data

8

• UniProt - knowledge base for protein sequence & function

• GenBank - genetic sequence repository, established 1986

Tuesday, 14 February 12

Page 15: Value of Unique IDs in Academia, Vilnius - Identifying knowledge contributors

G. A. Thorisson, University of Leicester / University of Iceland

Conference on Unique Identifiers, Vilnius, Feb 14 2012

Domain repositories for structure data

9

• Protein Data Bank - structure of proteins, nucleic acids and other bio-molecules

Tuesday, 14 February 12

Page 16: Value of Unique IDs in Academia, Vilnius - Identifying knowledge contributors

G. A. Thorisson, University of Leicester / University of Iceland

Conference on Unique Identifiers, Vilnius, Feb 14 2012

• The human genome sequence– International Human Genome project - mandatory rapid data sharing, the Bermuda

principles

• Pattern of variation in the human genome– International Haplotype Map Project - genotyping population samples– 1000 Genomes Project - sequencing population samples

• Examples of community resource projects - large-scale data generation for the purpose of making the data available for broad reuse

10

Large-scale genome projects

Tuesday, 14 February 12

Page 17: Value of Unique IDs in Academia, Vilnius - Identifying knowledge contributors

G. A. Thorisson, University of Leicester / University of Iceland

Conference on Unique Identifiers, Vilnius, Feb 14 2012

Data sharing in the “long tail”• Biology is complex, so data are often very

heterogeneous• Technologies changing rapidly• Lots of small research projects• Lots of small/medium datasets• Data in the long tail usually *not* shared

OR not shared in a useful way

11

The ‘long tail’ of dark bio-data

• Contrasts with other data-intensive disciplines with– a long history of sharing research data - a “culture of sharing”

– big, expensive, shared facilities = the only way to do this kind of research– relatively homogeneous datasets, easier to scale up to big volumes (e.g. telescope images)

Tuesday, 14 February 12

Page 18: Value of Unique IDs in Academia, Vilnius - Identifying knowledge contributors

G. A. Thorisson, University of Leicester / University of Iceland

Conference on Unique Identifiers, Vilnius, Feb 14 2012

The “data sharing problem’ in life sciences

12

Tuesday, 14 February 12

Page 19: Value of Unique IDs in Academia, Vilnius - Identifying knowledge contributors

G. A. Thorisson, University of Leicester / University of Iceland

Conference on Unique Identifiers, Vilnius, Feb 14 2012 13

Tuesday, 14 February 12

Page 20: Value of Unique IDs in Academia, Vilnius - Identifying knowledge contributors

G. A. Thorisson, University of Leicester / University of Iceland

Conference on Unique Identifiers, Vilnius, Feb 14 2012 13

Lots of published knowledge but impossible to to go back reproduce

work and validate findings

analysedsynthesisedinterpreted

Information

published

Knowledge

Publication

Data

Opportunity for maximising the value of data through reuse is wasted

Tuesday, 14 February 12

Page 21: Value of Unique IDs in Academia, Vilnius - Identifying knowledge contributors

G. A. Thorisson, University of Leicester / University of Iceland

Conference on Unique Identifiers, Vilnius, Feb 14 2012

- Too much work, no incentive for me -

Koslow. Should the neuroscience community make a paradigm shift to sharing primary data?. Nat Neurosci (2000) vol. 3 (9)

‘Genuine’ reasons for not sharing freely, e.g.:-ethical: patient confidentiality / consent-IP: intellectual property

- The ‘My Data’ syndrome -‘ownership’ vs ‘stewardship’

14

LOTS of different reasons for not sharing

- My competitors can take the data and ‘scoop’ me -

- No suitable repository -

Tuesday, 14 February 12

Page 22: Value of Unique IDs in Academia, Vilnius - Identifying knowledge contributors

G. A. Thorisson, University of Leicester / University of Iceland

Conference on Unique Identifiers, Vilnius, Feb 14 2012

- Too much work, no incentive for me -

Koslow. Should the neuroscience community make a paradigm shift to sharing primary data?. Nat Neurosci (2000) vol. 3 (9)

‘Genuine’ reasons for not sharing freely, e.g.:-ethical: patient confidentiality / consent-IP: intellectual property

Most reasons can be dismissed by referring to basic premise of scientific discourse

- The ‘My Data’ syndrome -‘ownership’ vs ‘stewardship’

14

LOTS of different reasons for not sharing

- My competitors can take the data and ‘scoop’ me -

- No suitable repository -

Tuesday, 14 February 12

Page 23: Value of Unique IDs in Academia, Vilnius - Identifying knowledge contributors

G. A. Thorisson, University of Leicester / University of Iceland

Conference on Unique Identifiers, Vilnius, Feb 14 2012 15

Journals increasingly require data to be available

“Provide supporting data in a repository OR we won’t publish your paper”

16

Sharing currently driven by mandates

Tuesday, 14 February 12

Page 24: Value of Unique IDs in Academia, Vilnius - Identifying knowledge contributors

G. A. Thorisson, University of Leicester / University of Iceland

Conference on Unique Identifiers, Vilnius, Feb 14 2012 15

Journals increasingly require data to be available

“Provide supporting data in a repository OR we won’t publish your paper”

Funders increasingly require data sharing plan & budget baked into grant proposals.

“Publish data we are funding you to generate OR we will not fund your research again”

16

Sharing currently driven by mandates

Tuesday, 14 February 12

Page 25: Value of Unique IDs in Academia, Vilnius - Identifying knowledge contributors

G. A. Thorisson, University of Leicester / University of Iceland

Conference on Unique Identifiers, Vilnius, Feb 14 2012 15

Using just a stick gets you so far

Journals increasingly require data to be available

“Provide supporting data in a repository OR we won’t publish your paper”

Funders increasingly require data sharing plan & budget baked into grant proposals.

“Publish data we are funding you to generate OR we will not fund your research again”

16

Sharing currently driven by mandates

Tuesday, 14 February 12

Page 26: Value of Unique IDs in Academia, Vilnius - Identifying knowledge contributors

G. A. Thorisson, University of Leicester / University of Iceland

Conference on Unique Identifiers, Vilnius, Feb 14 2012 15

Using just a stick gets you so far

Journals increasingly require data to be available

“Provide supporting data in a repository OR we won’t publish your paper”

Funders increasingly require data sharing plan & budget baked into grant proposals.

“Publish data we are funding you to generate OR we will not fund your research again”

16

Sharing currently driven by mandates

Mandates likely to ensure onlyminimum compliance - AND - often are not enforced

Tuesday, 14 February 12

Page 27: Value of Unique IDs in Academia, Vilnius - Identifying knowledge contributors

G. A. Thorisson, University of Leicester / University of Iceland

Conference on Unique Identifiers, Vilnius, Feb 14 2012 16

Tuesday, 14 February 12

Page 28: Value of Unique IDs in Academia, Vilnius - Identifying knowledge contributors

G. A. Thorisson, University of Leicester / University of Iceland

Conference on Unique Identifiers, Vilnius, Feb 14 2012

Complimentary strategies to encourage sharing

- Make it easy -- Make it useful - - Make it citable -

17

Tuesday, 14 February 12

Page 29: Value of Unique IDs in Academia, Vilnius - Identifying knowledge contributors

G. A. Thorisson, University of Leicester / University of Iceland

Conference on Unique Identifiers, Vilnius, Feb 14 2012 184

Prof Anthony J Brookes GEN2PHEN coordinatorChair, Bioinformatics and GenomicsDepartment of GeneticsUniversity of Leicester, UK

Tuesday, 14 February 12

Page 30: Value of Unique IDs in Academia, Vilnius - Identifying knowledge contributors

G. A. Thorisson, University of Leicester / University of Iceland

Conference on Unique Identifiers, Vilnius, Feb 14 2012

Researcher IDs in GEN2PHEN

• The aim is to– create the mechanisms for researchers to get credit for their contributions to scientific

knowledge, even if contributions don’t fit into* the human-readable ‘aliquots’ of a conventional journal article* some vague notion of attributing credit by way of position in authorlist

– ..and thereby provide incentives for the kind of thankless work that people now do without any kind of reward.

• Main use cases being explored– Research datasets in disease genetics– Database management and curation

19

Tuesday, 14 February 12

Page 31: Value of Unique IDs in Academia, Vilnius - Identifying knowledge contributors

G. A. Thorisson, University of Leicester / University of Iceland

Conference on Unique Identifiers, Vilnius, Feb 14 2012

Key building blocks: the 3 I’s

Identifying the scholarly work or research output– e.g. Bell et al. 2009. Science 323(5919) doi:10.1371/journal.pone.0024357

Identifying use/reuse - measuring impact– e.g number of full-text downloads for article X first 6 months after publication

Identifying contributors - attributing credit– e.g. who contributed to highly-cited resource X?

20

Tuesday, 14 February 12

Page 32: Value of Unique IDs in Academia, Vilnius - Identifying knowledge contributors

G. A. Thorisson, University of Leicester / University of Iceland

Conference on Unique Identifiers, Vilnius, Feb 14 2012

Identifying resources - persistent, unique IDs

• The problem: unstable URL references to online digital materials– http://www.le.ac.uk/~gt50/files/important_data/env2012.dat

– URL decay, tend to have short half-life

• URLs are locators for online resources - need persistent identifiers

21

Tuesday, 14 February 12

Page 33: Value of Unique IDs in Academia, Vilnius - Identifying knowledge contributors

G. A. Thorisson, University of Leicester / University of Iceland

Conference on Unique Identifiers, Vilnius, Feb 14 2012 22

Identifying resources - persistent, unique IDs

• Articles published in scientific journals– Digital Object Identifiers (DOIs) issued to papers via CrossRef

– Identifier resolves to metadata record, link to current location of content• Example: http://dx.doi.org/10.1038/nbt1109-984b => ‘landing page’ on journal website

– Content moves => DOI metadata is updated => stable identifier points to new location

Tuesday, 14 February 12

Page 34: Value of Unique IDs in Academia, Vilnius - Identifying knowledge contributors

G. A. Thorisson, University of Leicester / University of Iceland

Conference on Unique Identifiers, Vilnius, Feb 14 2012 22

Identifying resources - persistent, unique IDs

• Articles published in scientific journals– Digital Object Identifiers (DOIs) issued to papers via CrossRef

– Identifier resolves to metadata record, link to current location of content• Example: http://dx.doi.org/10.1038/nbt1109-984b => ‘landing page’ on journal website

– Content moves => DOI metadata is updated => stable identifier points to new location

Tuesday, 14 February 12

Page 35: Value of Unique IDs in Academia, Vilnius - Identifying knowledge contributors

G. A. Thorisson, University of Leicester / University of Iceland

Conference on Unique Identifiers, Vilnius, Feb 14 2012 23

Identifying resources - persistent, unique IDs

• Biological datasets published in digital repositories– Institutional vs. domain-specific, often offer Handles or other

persistent ID scheme

– DataCite Consortium created in 2009– to promote the use of ‘data DOIs’– Data Dryad, FigShare, others

Tuesday, 14 February 12

Page 36: Value of Unique IDs in Academia, Vilnius - Identifying knowledge contributors

G. A. Thorisson, University of Leicester / University of Iceland

Conference on Unique Identifiers, Vilnius, Feb 14 2012 23

Identifying resources - persistent, unique IDs

• Biological datasets published in digital repositories– Institutional vs. domain-specific, often offer Handles or other

persistent ID scheme

– DataCite Consortium created in 2009– to promote the use of ‘data DOIs’– Data Dryad, FigShare, others

Tuesday, 14 February 12

Page 37: Value of Unique IDs in Academia, Vilnius - Identifying knowledge contributors

G. A. Thorisson, University of Leicester / University of Iceland

Conference on Unique Identifiers, Vilnius, Feb 14 2012 24

Identifying resources - persistent, unique IDs

• Biological databases, added-value from expert curation– No identification scheme - acknowledgement based conventional papers, URLs, names

– e.g. http://www.uniprot.org, GenBank, Database of COL3A1 gene mutations – The new BioDBCore initiative is building the foundations for a registry of bio-databases

– Could become a kind of ‘journal’ for database resources– Example persistent resource URI: http://identifiers.org/biodbcore/COL

Tuesday, 14 February 12

Page 38: Value of Unique IDs in Academia, Vilnius - Identifying knowledge contributors

G. A. Thorisson, University of Leicester / University of Iceland

Conference on Unique Identifiers, Vilnius, Feb 14 2012 24

Identifying resources - persistent, unique IDs

• Biological databases, added-value from expert curation– No identification scheme - acknowledgement based conventional papers, URLs, names

– e.g. http://www.uniprot.org, GenBank, Database of COL3A1 gene mutations – The new BioDBCore initiative is building the foundations for a registry of bio-databases

– Could become a kind of ‘journal’ for database resources– Example persistent resource URI: http://identifiers.org/biodbcore/COL

Tuesday, 14 February 12

Page 39: Value of Unique IDs in Academia, Vilnius - Identifying knowledge contributors

G. A. Thorisson, University of Leicester / University of Iceland

Conference on Unique Identifiers, Vilnius, Feb 14 2012

Identifying reuse - beyond the Impact Factor

• Reliance on citations and citation-based metrics– ISI Impact Factor really a way to measure influence of scholarly journals– Not going away: citation remains the gold standard

• Lots of other use/reuse indicators for impact of scholarly articles– Focus on the impact of the *publication* itself, not the journal in which it appears– Indicators: no. full-text downloads, tweets (i.e. mentions on Twitter), social bookmarking– AltMetrics - a growing grassroots movement “ to better measure and reward all the

different ways that people contribute to the messy and complex process of scientific progress [..] born out of a simple recognition: Many of the traditional measurements are too slow or simplistic to keep pace with today’s Internet-age science”

– Lots new tools emerging to explore possibilities in this space

25

Tuesday, 14 February 12

Page 40: Value of Unique IDs in Academia, Vilnius - Identifying knowledge contributors

G. A. Thorisson, University of Leicester / University of Iceland

Conference on Unique Identifiers, Vilnius, Feb 14 2012 26

Tuesday, 14 February 12

Page 41: Value of Unique IDs in Academia, Vilnius - Identifying knowledge contributors

G. A. Thorisson, University of Leicester / University of Iceland

Conference on Unique Identifiers, Vilnius, Feb 14 2012 26

Tuesday, 14 February 12

Page 42: Value of Unique IDs in Academia, Vilnius - Identifying knowledge contributors

G. A. Thorisson, University of Leicester / University of Iceland

Conference on Unique Identifiers, Vilnius, Feb 14 2012 27

Tuesday, 14 February 12

Page 43: Value of Unique IDs in Academia, Vilnius - Identifying knowledge contributors

G. A. Thorisson, University of Leicester / University of Iceland

Conference on Unique Identifiers, Vilnius, Feb 14 2012

Identifying reuse - beyond the Impact Factor

• Reliance on citations and citation-based metrics– ISI Impact Factor really a way to measure influence of scholarly journals– Not going away: citation remains the gold standard

• Lots of other use/reuse indicators for impact of scholarly articles– Focus on the impact of the *publication* itself, not the journal in which it appears– Indicators: no. full-text downloads, tweets (i.e. mentions on Twitter), social bookmarking– AltMetrics - a growing grassroots movement “ to better measure and reward all the

different ways that people contribute to the messy and complex process of scientific progress [..] born out of a simple recognition: Many of the traditional measurements are too slow or simplistic to keep pace with today’s Internet-age science”

– Lots new tools emerging to explore possibilities in this space

• Key obstacle: access to citation links, downloads, ‘clickstream’ data28

Tuesday, 14 February 12

Page 44: Value of Unique IDs in Academia, Vilnius - Identifying knowledge contributors

G. A. Thorisson, University of Leicester / University of Iceland

Conference on Unique Identifiers, Vilnius, Feb 14 2012

[Or: what’s ORCID go to do with all this??]

• Vast majority of repositories and database currently rely on names for identifying creators/curators/persons in general => leads to disambiguation problems

29

Identifying contributors - the common thread

Tuesday, 14 February 12

Page 45: Value of Unique IDs in Academia, Vilnius - Identifying knowledge contributors

G. A. Thorisson, University of Leicester / University of Iceland

Conference on Unique Identifiers, Vilnius, Feb 14 2012 30

Identifying contributors - the common thread

Tuesday, 14 February 12

Page 46: Value of Unique IDs in Academia, Vilnius - Identifying knowledge contributors

G. A. Thorisson, University of Leicester / University of Iceland

Conference on Unique Identifiers, Vilnius, Feb 14 2012

– For content, many different identifier schemes will be needed, because there’s a range of different types of scholarly content

– For contributors, a single, unified identifier scheme is needed, because a person will tend to contribute to various types of content over his/her career

• What kind of contribution? – Conventional authorship – Analytical, data managing, technical vs. intellectual / scientific, support roles etc.– Editing / peer-review, incl. future post-publication review systems

• Complications relating to attributing credit– like High-ener paper, >1000 authors. they’re not really all *authors* are they?

• Intellectual input vs. pure support?

• designed experiment, ran key analysis, technical support staff etc.

31

Identifying contributors - the common thread

Tuesday, 14 February 12

Page 47: Value of Unique IDs in Academia, Vilnius - Identifying knowledge contributors

G. A. Thorisson, University of Leicester / University of Iceland

Conference on Unique Identifiers, Vilnius, Feb 14 2012

• But, wait! there’s many more ways of contributing to science than publishing papers/data and curating databases!

21st century ‘Digital scholars’ should also be credited for contributions to...

32

Tuesday, 14 February 12

Page 48: Value of Unique IDs in Academia, Vilnius - Identifying knowledge contributors

G. A. Thorisson, University of Leicester / University of Iceland

Conference on Unique Identifiers, Vilnius, Feb 14 2012

... Wikipedia articles

33

Tuesday, 14 February 12

Page 49: Value of Unique IDs in Academia, Vilnius - Identifying knowledge contributors

G. A. Thorisson, University of Leicester / University of Iceland

Conference on Unique Identifiers, Vilnius, Feb 14 2012

... open-source software projects

34

Tuesday, 14 February 12

Page 50: Value of Unique IDs in Academia, Vilnius - Identifying knowledge contributors

G. A. Thorisson, University of Leicester / University of Iceland

Conference on Unique Identifiers, Vilnius, Feb 14 2012

... distributed analytical workflows + other digital research objects (published e.g. on myExperiment.org)

35

Tuesday, 14 February 12

Page 51: Value of Unique IDs in Academia, Vilnius - Identifying knowledge contributors

G. A. Thorisson, University of Leicester / University of Iceland

Conference on Unique Identifiers, Vilnius, Feb 14 2012 36

• .. and many more!– future post-publication open peer-review systems

– stuff we haven’t even thought of

• ORCID can potentially have a pivotal role here– as source of scholarly identities– as central aggregator & index of data on links between author & content

Tuesday, 14 February 12

Page 52: Value of Unique IDs in Academia, Vilnius - Identifying knowledge contributors

G. A. Thorisson, University of Leicester / University of Iceland

Conference on Unique Identifiers, Vilnius, Feb 14 2012

Lessons from social networking domain

• We NOW HAVE:– Lots of 3rd party applications built around social IDs that users already have on Twitter,

Facebook and other sites

• We WILL HAVE in the near future:– online scholarly communication tools built around ORCID IDs that scholars already

have– Rich ecosystem of ‘ORCID apps’ – Lower the barrier to participation - tackle the “multiple profiles syndrome”

• The 1st-generation ORCID central service will support this

37

Tuesday, 14 February 12

Page 53: Value of Unique IDs in Academia, Vilnius - Identifying knowledge contributors

G. A. Thorisson, University of Leicester / University of Iceland

Conference on Unique Identifiers, Vilnius, Feb 14 2012

Technologically, this is not rocket science

38

Tuesday, 14 February 12

Page 54: Value of Unique IDs in Academia, Vilnius - Identifying knowledge contributors

G. A. Thorisson, University of Leicester / University of Iceland

Conference on Unique Identifiers, Vilnius, Feb 14 2012

Technologically, this is not rocket science

38

Tuesday, 14 February 12

Page 55: Value of Unique IDs in Academia, Vilnius - Identifying knowledge contributors

G. A. Thorisson, University of Leicester / University of Iceland

Conference on Unique Identifiers, Vilnius, Feb 14 2012

Single sign-on with social IDs

39

Tuesday, 14 February 12

Page 56: Value of Unique IDs in Academia, Vilnius - Identifying knowledge contributors

G. A. Thorisson, University of Leicester / University of Iceland

Conference on Unique Identifiers, Vilnius, Feb 14 2012

Some points for discussion

• Is ORCID the right way forward?• International organization with many stakeholders

– Big mass, moves slowly, complex politics, BUT this also gives it ‘oomph’

• The upcoming contributor ID service will be centralized– Single place to point to when promoting in community – Developer-friendly: integration app with a single service

• The organization & registry is open (enough)– Enable reuse & collaboration with others, BUT slices of data will be for members only

• Engaging with end users - scholars to manage their profile• Journal publishers are involved and committed

– Will be key to adoption via journal article submission, BUT what about anti-OA and other questionable practices?

40

Tuesday, 14 February 12

Page 57: Value of Unique IDs in Academia, Vilnius - Identifying knowledge contributors

G. A. Thorisson, University of Leicester / University of Iceland

Conference on Unique Identifiers, Vilnius, Feb 14 2012 41

Tuesday, 14 February 12

Page 58: Value of Unique IDs in Academia, Vilnius - Identifying knowledge contributors

G. A. Thorisson, University of Leicester / University of Iceland

Conference on Unique Identifiers, Vilnius, Feb 14 2012 41

Tuesday, 14 February 12

Page 59: Value of Unique IDs in Academia, Vilnius - Identifying knowledge contributors

G. A. Thorisson, University of Leicester / University of Iceland

Conference on Unique Identifiers, Vilnius, Feb 14 2012 41

Tuesday, 14 February 12

Page 60: Value of Unique IDs in Academia, Vilnius - Identifying knowledge contributors

G. A. Thorisson, University of Leicester / University of Iceland

Conference on Unique Identifiers, Vilnius, Feb 14 2012 41

Tuesday, 14 February 12

Page 61: Value of Unique IDs in Academia, Vilnius - Identifying knowledge contributors

G. A. Thorisson, University of Leicester / University of Iceland

Conference on Unique Identifiers, Vilnius, Feb 14 2012 4229

An online identity as a digital scholar, anchored in a ORCID profile?

Tuesday, 14 February 12

Page 62: Value of Unique IDs in Academia, Vilnius - Identifying knowledge contributors

G. A. Thorisson, University of Leicester / University of Iceland

Conference on Unique Identifiers, Vilnius, Feb 14 2012 4229

ORCID ID: B-1242-2010G. Thorisson, Univ. LeicesterG. A. Thorisson, Univ. LeicesterG. A. Thorisson, Cold Spring Harbor Lab.

An online identity as a digital scholar, anchored in a ORCID profile?

Tuesday, 14 February 12

Page 63: Value of Unique IDs in Academia, Vilnius - Identifying knowledge contributors

G. A. Thorisson, University of Leicester / University of Iceland

Conference on Unique Identifiers, Vilnius, Feb 14 2012

Acknowledgements GEN2PHEN Consortium

http://www.gen2phen.org/about-gen2phen/partners

Prof Anthony J. Brookes Bioinformatics Group, Leicester

43

This work has received funding from the European Community's Seventh Framework Programme (FP7/2007-2013)under grant agreement number 200754 - the GEN2PHEN project.

Contact me!

<[email protected]> |<[email protected]>http://www.linkedin.com/in/mummihttp://www.twitter.com/gthorisson

http://www.gthorisson.namePublished under the CC BY license (http://creativecommons.org/licenses/by/3.0/)

Tuesday, 14 February 12