Scott Edmunds at #FORCE2015 #bioCADDIE workshop: Pandas & Polar bears, data citation examples from...

Post on 12-Jul-2015

524 views 0 download

Tags:

Transcript of Scott Edmunds at #FORCE2015 #bioCADDIE workshop: Pandas & Polar bears, data citation examples from...

Pandas & Polar bears, data citation examples from GigaScience

Scott Edmunds, Executive Editor

11th Jan 2015 FORCE2015/BioCADDIE workshop1

GigaDB.org: publishing data since June 2011

• >150 datasets with DataCite DOIs • Follow & promote DataCite, DCC guidelines & FORCE11

Data Citation Principles• Worked with BMC & other journals to ensure correctly

cited in references

Tell people how to cite

Export to citation manager

Coming soon

Data+Citation: inclusion in the references

• Data submitted to NCBI databases:

• Submission to public databases complemented by its citable form in GigaDB (doi:10.5524/100012).

- Raw data SRA:SRA046843

- Assemblies of 3 strains Genbank:AHAO00000000-AHAQ00000000

- SNPs dbSNP:1056306

- CNVs- InDels dbVAR:nstd63

- SV}

In the references…

Is the DOI…

And in other publishers journals…

Although not always…

Data published in GigaDB July 2011…Cell paper May 2014

Change in policy?

See: http://f1000research.com/data-policies

Data used in at least 9 pubs (6 before Cell paper)

Thomson Reuters DCI: ✗FAIL

DataCite metadata in harvestable form (OAI-PMH)

GoogleScholar: ✗FAIL

- lists some DataCite DOIs, but says:

Datasets listed are the “result of approximations in the indexing algorithms.”

“Google Scholar's intended coverage is for scholarly articles. At this point, we don't include datasets. “

I’m afraid we are making promises to data creators about attribution and reward that we can’t keep. ”Make your data citeable!” is the cry. Ok. So citeable is step one. Cited is step two. But for the citation to be useful, it has to be indexed so that citation metrics can be tracked and admired and used.

Who is indexing data citations right now? As far as I can tell: absolutely no one.

Research Remix, 29th May 2012: http://researchremix.wordpress.com/2012/05/29/dear-research-data-advocate-please-sign-the-petition-oamonday/

Further readingLi, B; Zhang, G; Willersleve, E; Wang, J; Wang, J (2011): Genomic data from the polar bear (Ursus maritimus). GigaScience. http://dx.doi.org/10.5524/100008

Liu, S et al., Population Genomics Reveal Recent Speciation and Rapid Evolutionary Adaptation in Polar Bears. Cell. 2014; 157(4): 785-794

Edmunds, S., Pollard, T., Hole, B., & Basford, A. (2012). Adventures in data citation: sorghum genome data exemplifies the new gold standard BMC Research Notes, 5 (1) http://www.biomedcentral.com/1756-0500/5/223

Promoting Data Citation in Nature (and Pushing Past Panda Problems) http://blogs.biomedcentral.com/gigablog/2012/12/21/promoting-datacitation-in-nature/

The Latest Weapon in Publishing Data: the Polar Bear http://blogs.biomedcentral.com/gigablog/2014/05/14/the-latest-weapon-in-publishing-data-the-polar-bear/