Grab a bucket! It's raining data!

40
Photo: http://www.flickr.com/photos/peasap/655111542/ It’s raining data! Grab a bucket! Dorothea Salo University of Wisconsin Access 2009

description

For Access 2009 conference. Grab a bucket, it's raining data! Library data, research data, primary data, mashed-up data, raw data, cooked data, our data, other people's data... But which bucket should we grab? And can we really, truly fit all the data in one bucket? And don't we risk turning data into sludge if we mix it all together in our bucket? Finding a bucket is the easy part. Grappling with data acquisition, modeling, discovery, and reuse is hard. How will we do it? Can we?

Transcript of Grab a bucket! It's raining data!

Page 1: Grab a bucket! It's raining data!

Photo: http://www.flickr.com/photos/peasap/655111542/

It’s raining data!

Grab a bucket!

Dorothea SaloUniversity of Wisconsin

Access 2009

Page 2: Grab a bucket! It's raining data!

the...

of Open AccessPainting: “Cassandra,” Evelyn de MorganPhoto: http://commons.wikimedia.org/wiki/File:Cassandra1.jpeg

Page 3: Grab a bucket! It's raining data!

I’ve got nothing against

but the reality was...Photo: http://www.flickr.com/photos/y2bk/528300692/

Page 4: Grab a bucket! It's raining data!

... blurrier.

Photo: http://www.flickr.com/photos/jennsstuff/2965783700/

goals?

means?

something for nothing?

fit between content and container?

fit between user needs and system?

and so now, I may be becoming

Page 5: Grab a bucket! It's raining data!

the...

of Data Curation?

Page 6: Grab a bucket! It's raining data!

What do we know about data?

Photo: http://www.flickr.com/photos/kentbye/2053916246/

Page 7: Grab a bucket! It's raining data!

There’s a lot of data.

Photo: http://www.flickr.com/photos/noelzialee/2126153623/

Page 8: Grab a bucket! It's raining data!

Data are there to be interacted with.Photo: http://www.flickr.com/photos/jonevans/1032687817/

Page 9: Grab a bucket! It's raining data!

Data are wildly diverse in nature...

... as are their technical environments.Photo: http://www.flickr.com/photos/28481088@N00/670258156/

Page 10: Grab a bucket! It's raining data!

Data are already out there.

Photo: NASA (via http://nasaimages.org/), “Multiwavelength M81”

Page 11: Grab a bucket! It's raining data!

... but really want to be digital.

A lot of data are analog...

Photo: http://www.flickr.com/photos/mrbill/3452943573/

Page 12: Grab a bucket! It's raining data!

Data are project-based.

http://www.exploringthehyper.net/

Page 13: Grab a bucket! It's raining data!

Data are sloppy.

Photo: http://www.flickr.com/photos/midorisyu/2622024163/

Page 14: Grab a bucket! It's raining data!

Data aren’t standardized.

Photo: http://www.flickr.com/photos/mikewade/3463334719/

Page 15: Grab a bucket! It's raining data!

Our Big Bucket:

the digital library

Page 16: Grab a bucket! It's raining data!

Our other Big Bucket:

the institutional repository

Page 17: Grab a bucket! It's raining data!

Impedance mismatchesPhoto: http://www.flickr.com/photos/peasap/655111542/

Page 18: Grab a bucket! It's raining data!

What do we know about these?Photo: http://www.flickr.com/photos/schex/193912573/

Page 19: Grab a bucket! It's raining data!

Carefully built and tended

http://www.collectionscanada.gc.ca/naskapi/index-e.html

Page 20: Grab a bucket! It's raining data!

Production is a Taylorist’s dream.Photo: http://www.flickr.com/photos/villeneuve53/1808995620/

Page 21: Grab a bucket! It's raining data!

when it isn’t a Taylorist’s nightmare.Photo: http://www.flickr.com/photos/elsie/97542274/

Page 22: Grab a bucket! It's raining data!

What do we know about these?

Page 23: Grab a bucket! It's raining data!

inside our institutions.

We’re caged up

Photo: http://www.flickr.com/photos/annia316/115439737/

Page 24: Grab a bucket! It's raining data!

Any color...Photo: http://commons.wikimedia.org/wiki/File:Black_Ford_Model_T_in_HK.JPG

Page 25: Grab a bucket! It's raining data!

Bring it on; we’ll take anything!

... as long as it’s static and final.Photo: http://www.flickr.com/photos/orblivio/146691405/

Page 26: Grab a bucket! It's raining data!

Right, anything you’ve got!

... one file at a time.Photo: http://www.flickr.com/photos/jetalone/39990302/

Page 27: Grab a bucket! It's raining data!

Any look and feel...

Page 28: Grab a bucket! It's raining data!

... as long as it’s key-value pairs.

Any metadata you want!

Photo: http://www.flickr.com/photos/rattodisabina/2460905893/

Page 29: Grab a bucket! It's raining data!

Do anything you want...

... as long as it’s “download.”Photo: http://www.flickr.com/photos/procsilas/306417902/

Page 30: Grab a bucket! It's raining data!

Content models

Enough said.

Page 31: Grab a bucket! It's raining data!

So where does all that leave us?

Photo: http://www.flickr.com/photos/library_of_congress/2162653769/

Page 32: Grab a bucket! It's raining data!

We need bigger, better buckets.Photo: http://www.flickr.com/photos/jonevans/1032687817/

Page 33: Grab a bucket! It's raining data!

Silos are both necessary

and unacceptable.Photo: http://www.flickr.com/photos/jojakeman/2818910104/

Page 34: Grab a bucket! It's raining data!

We have a lot of modeling to do.

And meta-modeling.Photo: http://www.flickr.com/photos/crobj/727348790/

Page 35: Grab a bucket! It's raining data!

We have a lot of code to write.Photo: http://www.flickr.com/photos/fienna/170559081/

Page 36: Grab a bucket! It's raining data!

We can’t code or model in isolation.Photo: http://www.flickr.com/photos/naus3a01/240614578/

Page 37: Grab a bucket! It's raining data!

Fedora is the new world.

But Fedora must change.Photo: http://www.flickr.com/photos/mythwhisper/3361907495/

Page 38: Grab a bucket! It's raining data!

Solr brings it all togetherPhoto: http://www.flickr.com/photos/chantrybee/2911840052/

Page 39: Grab a bucket! It's raining data!

... the

of Data Curation.Vermeer: the Muse Clio, from “The Allegory of Painting”

Page 40: Grab a bucket! It's raining data!

This presentation is available under a Creative Commons Attribution 3.0 United States license.

Thank you!