Download - Grab a bucket! It's raining data!

Transcript
Page 1: Grab a bucket! It's raining data!

Photo: http://www.flickr.com/photos/peasap/655111542/

It’s raining data!

Grab a bucket!

Dorothea SaloUniversity of Wisconsin

Access 2009

Page 2: Grab a bucket! It's raining data!

the...

of Open AccessPainting: “Cassandra,” Evelyn de MorganPhoto: http://commons.wikimedia.org/wiki/File:Cassandra1.jpeg

Page 3: Grab a bucket! It's raining data!

I’ve got nothing against

but the reality was...Photo: http://www.flickr.com/photos/y2bk/528300692/

Page 4: Grab a bucket! It's raining data!

... blurrier.

Photo: http://www.flickr.com/photos/jennsstuff/2965783700/

goals?

means?

something for nothing?

fit between content and container?

fit between user needs and system?

and so now, I may be becoming

Page 5: Grab a bucket! It's raining data!

the...

of Data Curation?

Page 6: Grab a bucket! It's raining data!

What do we know about data?

Photo: http://www.flickr.com/photos/kentbye/2053916246/

Page 7: Grab a bucket! It's raining data!

There’s a lot of data.

Photo: http://www.flickr.com/photos/noelzialee/2126153623/

Page 8: Grab a bucket! It's raining data!

Data are there to be interacted with.Photo: http://www.flickr.com/photos/jonevans/1032687817/

Page 9: Grab a bucket! It's raining data!

Data are wildly diverse in nature...

... as are their technical environments.Photo: http://www.flickr.com/photos/28481088@N00/670258156/

Page 10: Grab a bucket! It's raining data!

Data are already out there.

Photo: NASA (via http://nasaimages.org/), “Multiwavelength M81”

Page 11: Grab a bucket! It's raining data!

... but really want to be digital.

A lot of data are analog...

Photo: http://www.flickr.com/photos/mrbill/3452943573/

Page 12: Grab a bucket! It's raining data!

Data are project-based.

http://www.exploringthehyper.net/

Page 13: Grab a bucket! It's raining data!

Data are sloppy.

Photo: http://www.flickr.com/photos/midorisyu/2622024163/

Page 14: Grab a bucket! It's raining data!

Data aren’t standardized.

Photo: http://www.flickr.com/photos/mikewade/3463334719/

Page 15: Grab a bucket! It's raining data!

Our Big Bucket:

the digital library

Page 16: Grab a bucket! It's raining data!

Our other Big Bucket:

the institutional repository

Page 17: Grab a bucket! It's raining data!

Impedance mismatchesPhoto: http://www.flickr.com/photos/peasap/655111542/

Page 18: Grab a bucket! It's raining data!

What do we know about these?Photo: http://www.flickr.com/photos/schex/193912573/

Page 19: Grab a bucket! It's raining data!

Carefully built and tended

http://www.collectionscanada.gc.ca/naskapi/index-e.html

Page 20: Grab a bucket! It's raining data!

Production is a Taylorist’s dream.Photo: http://www.flickr.com/photos/villeneuve53/1808995620/

Page 21: Grab a bucket! It's raining data!

when it isn’t a Taylorist’s nightmare.Photo: http://www.flickr.com/photos/elsie/97542274/

Page 22: Grab a bucket! It's raining data!

What do we know about these?

Page 23: Grab a bucket! It's raining data!

inside our institutions.

We’re caged up

Photo: http://www.flickr.com/photos/annia316/115439737/

Page 24: Grab a bucket! It's raining data!

Any color...Photo: http://commons.wikimedia.org/wiki/File:Black_Ford_Model_T_in_HK.JPG

Page 25: Grab a bucket! It's raining data!

Bring it on; we’ll take anything!

... as long as it’s static and final.Photo: http://www.flickr.com/photos/orblivio/146691405/

Page 26: Grab a bucket! It's raining data!

Right, anything you’ve got!

... one file at a time.Photo: http://www.flickr.com/photos/jetalone/39990302/

Page 27: Grab a bucket! It's raining data!

Any look and feel...

Page 28: Grab a bucket! It's raining data!

... as long as it’s key-value pairs.

Any metadata you want!

Photo: http://www.flickr.com/photos/rattodisabina/2460905893/

Page 29: Grab a bucket! It's raining data!

Do anything you want...

... as long as it’s “download.”Photo: http://www.flickr.com/photos/procsilas/306417902/

Page 30: Grab a bucket! It's raining data!

Content models

Enough said.

Page 31: Grab a bucket! It's raining data!

So where does all that leave us?

Photo: http://www.flickr.com/photos/library_of_congress/2162653769/

Page 32: Grab a bucket! It's raining data!

We need bigger, better buckets.Photo: http://www.flickr.com/photos/jonevans/1032687817/

Page 33: Grab a bucket! It's raining data!

Silos are both necessary

and unacceptable.Photo: http://www.flickr.com/photos/jojakeman/2818910104/

Page 34: Grab a bucket! It's raining data!

We have a lot of modeling to do.

And meta-modeling.Photo: http://www.flickr.com/photos/crobj/727348790/

Page 35: Grab a bucket! It's raining data!

We have a lot of code to write.Photo: http://www.flickr.com/photos/fienna/170559081/

Page 36: Grab a bucket! It's raining data!

We can’t code or model in isolation.Photo: http://www.flickr.com/photos/naus3a01/240614578/

Page 37: Grab a bucket! It's raining data!

Fedora is the new world.

But Fedora must change.Photo: http://www.flickr.com/photos/mythwhisper/3361907495/

Page 38: Grab a bucket! It's raining data!

Solr brings it all togetherPhoto: http://www.flickr.com/photos/chantrybee/2911840052/

Page 39: Grab a bucket! It's raining data!

... the

of Data Curation.Vermeer: the Muse Clio, from “The Allegory of Painting”

Page 40: Grab a bucket! It's raining data!

This presentation is available under a Creative Commons Attribution 3.0 United States license.

Thank you!