PLoS ONE Piwowar: Sharing Detailed Research Data Is Associated with Increased Citation Rate

15
Sharing Detailed Research Data is Associated with Increased Citation Rate Heather Piwowar Roger Day and Douglas Fridsma University of Pittsburgh Published in PLoS ONE, March 27 2007 Funded by NLM Training Grant Presented at NLM Trainee Conference, June 27 2007

description

Heather A Piwowar, Roger S Day, Douglas B Fridsma (2007) Sharing Detailed Research Data Is Associated with Increased Citation Rate PLoS ONE 2: 3. e308 Abstract: Sharing research data provides benefit to the general scientific community, but the benefit is less obvious for the investigator who makes his or her data available. We examined the citation history of 85 cancer microarray clinical trial publications with respect to the availability of their data. The 48% of trials with publicly available microarray data received 85% of the aggregate citations. Publicly available data was significantly (p = 0.006) associated with a 69% increase in citations, independently of journal impact factor, date of publication, and author country of origin using linear regression. This correlation between publicly available data and increased literature impact may further motivate investigators to share their detailed research data.

Transcript of PLoS ONE Piwowar: Sharing Detailed Research Data Is Associated with Increased Citation Rate

Page 1: PLoS ONE Piwowar:    Sharing Detailed Research Data Is Associated with Increased Citation Rate

Sharing Detailed Research Data is Associated with

Increased Citation Rate

Heather Piwowar Roger Day and Douglas Fridsma

University of Pittsburgh

Published in PLoS ONE, March 27 2007 Funded by NLM Training Grant

Presented at NLM Trainee Conference, June 27 2007

Page 2: PLoS ONE Piwowar:    Sharing Detailed Research Data Is Associated with Increased Citation Rate

PAST MEDICAL HISTORY: Past medical history showed she had superficial phlebitis times two in the past, had non-insulin dependent diabetes mellitus for four years. She had been hypothyroid for three years. HISTORY OF PRESENT ILLNESS: The patient is a 58-year-old female, …

Sharing research data

http://upload.wikimedia.org/wikipedia/commons/7/76/PeptideMSMS.jpg; http://en.wikipedia.org/wiki/Image:Helices.png; http://en.wikipedia.org/wiki/Image:Heatmap.png; http://en.wikipedia.org/wiki/Image:Microarray2.gif; http://zellig.cpmc.columbia.edu/medlee/demo/; htp://www.plosone.org/article/fetchArticle.action?articleURI=info:doi/10.1371/journal.pone.0000441

Page 3: PLoS ONE Piwowar:    Sharing Detailed Research Data Is Associated with Increased Citation Rate

Shared data benefits science

•  Verify •  Understand •  Extend •  Explore •  Combine •  Synergize •  Train •  Reduce

Page 4: PLoS ONE Piwowar:    Sharing Detailed Research Data Is Associated with Increased Citation Rate

•  Find •  Organize •  Document •  Deidentify •  Format •  Decide •  Ask •  Submit

•  Answer questions

•  Worry about mistakes being found •  Worry about data being misinterpreted •  Worry about being scooped

•  Forgo money and IP and prestige???

But… costly for authors

Page 5: PLoS ONE Piwowar:    Sharing Detailed Research Data Is Associated with Increased Citation Rate

So what’s in it for them?

Carrot. A currency of value? Citations.

$50!

Do trials which share their data receive more citations?

Page 6: PLoS ONE Piwowar:    Sharing Detailed Research Data Is Associated with Increased Citation Rate

Methods Cancer Microarray Trials

Ntzani and Ioannidis identified 85 trials published 1999-2003

Citations ISI Web of Science Citation Index, citations from 2004-2005

Data availability Publisher and lab websites, microarray databases, WayBack Internet Archive, Oncomine

Statistics Multivariate linear regression

Page 7: PLoS ONE Piwowar:    Sharing Detailed Research Data Is Associated with Increased Citation Rate

Results: Eligible trials

•  85 trials •  41 (48%) made data available •  Various locations:

–  Lab websites (28) –  Publisher websites (4) –  SMD (6) –  GEO (6) –  GEDP (2)

•  6239 total citations

Page 8: PLoS ONE Piwowar:    Sharing Detailed Research Data Is Associated with Increased Citation Rate

Results: Big picture

85 clinical trials used microarrays to study cancer

between 1999-2003

These 85 trials were cited 6239 times during 2004-2005

41 (48%) of these trials made their microarray data publicly available

on the internet

Trials which shared data received 5334 (85%) of these citations

Number of trials Number of citations

Page 9: PLoS ONE Piwowar:    Sharing Detailed Research Data Is Associated with Increased Citation Rate

Results: Distribution of citation counts

From: Piwowar HA, Day RS, Fridsma DB (2007) Sharing Detailed Research Data Is Associated with Increased Citation Rate.

PLoS ONE 2(3): e308. doi:10.1371/journal.pone.0000308

Page 10: PLoS ONE Piwowar:    Sharing Detailed Research Data Is Associated with Increased Citation Rate

Results: Multivariate regression

From: Piwowar HA, Day RS, Fridsma DB (2007) Sharing Detailed Research Data Is Associated with Increased Citation Rate.

PLoS ONE 2(3): e308. doi:10.1371/journal.pone.0000308

Page 11: PLoS ONE Piwowar:    Sharing Detailed Research Data Is Associated with Increased Citation Rate

Limitations

•  Outliers – Subset analysis of lower profile papers

•  Complex timing – Additional analysis of citations within 24 months

•  Association does not imply causation – Could be common cause

Page 12: PLoS ONE Piwowar:    Sharing Detailed Research Data Is Associated with Increased Citation Rate

Data sharing help on the way

•  Free, centralized databases – SMD, GEO, ArrayExpress

•  Standards – MIAME, CONSORT

•  Tools – De-id, caBIG

•  Community – Journals, Funders, Organizations, Blogs

Page 13: PLoS ONE Piwowar:    Sharing Detailed Research Data Is Associated with Increased Citation Rate

Conclusions

•  70% increase in citation impact for trials which make data available

•  Result holds for lower-profile publications

•  Hopefully a motivation for authors to share data and thus maximize its usefulness

Page 14: PLoS ONE Piwowar:    Sharing Detailed Research Data Is Associated with Increased Citation Rate

For more information •  Participate in the discussion on this paper

at PLoS ONE

•  Check out blogs on Open Access, Open Data, Open Notebook Science –  Peter Suber’s Open Access News blog –  Wikipedia: “Open Data” –  Nature Editorial: May 3, 2007

•  Contact Heather Piwowar for further discussion and enthusiasm! [email protected]

Page 15: PLoS ONE Piwowar:    Sharing Detailed Research Data Is Associated with Increased Citation Rate

Thank you •  Peter Suber’s blog: “Open Access News” •  Wikipedia: “Open Data” •  Nature Editorial: May 3, 2007

I support Open Data and share my literature, code, and data whenever possible.

Long term research interest: data reuse as an underutilized informatics resource

Questions?