A Peek Inside the Carolina Digital Repository Michael Daines Digital Repository Analyst UNC –...

25
A Peek Inside the Carolina Digital Repository Michael Daines Digital Repository Analyst UNC – Chapel Hill

Transcript of A Peek Inside the Carolina Digital Repository Michael Daines Digital Repository Analyst UNC –...

Page 1: A Peek Inside the Carolina Digital Repository Michael Daines Digital Repository Analyst UNC – Chapel Hill.

A Peek Inside theCarolina Digital Repository

Michael DainesDigital Repository Analyst

UNC – Chapel Hill

Page 2: A Peek Inside the Carolina Digital Repository Michael Daines Digital Repository Analyst UNC – Chapel Hill.
Page 3: A Peek Inside the Carolina Digital Repository Michael Daines Digital Repository Analyst UNC – Chapel Hill.

Goals

Page 4: A Peek Inside the Carolina Digital Repository Michael Daines Digital Repository Analyst UNC – Chapel Hill.

What’s in the repository?

Page 5: A Peek Inside the Carolina Digital Repository Michael Daines Digital Repository Analyst UNC – Chapel Hill.

What’s in the repository?

• 41158 images• 18671 texts (PDF, Microsoft Word, text files)• 11856 audio files• 1438 datasets• 54 video files

(As of July 17, 2013)

Page 6: A Peek Inside the Carolina Digital Repository Michael Daines Digital Repository Analyst UNC – Chapel Hill.

What’s in the repository?

• Research Laboratories of Archaeology35502 images (photographs and scans)

• Electronic Theses and Dissertations4035 PDFs

• BioMed Central1777 PDFs (articles)

(As of July 17, 2013)

Page 7: A Peek Inside the Carolina Digital Repository Michael Daines Digital Repository Analyst UNC – Chapel Hill.

How to show what we have?

Page 8: A Peek Inside the Carolina Digital Repository Michael Daines Digital Repository Analyst UNC – Chapel Hill.
Page 9: A Peek Inside the Carolina Digital Repository Michael Daines Digital Repository Analyst UNC – Chapel Hill.
Page 10: A Peek Inside the Carolina Digital Repository Michael Daines Digital Repository Analyst UNC – Chapel Hill.

“Peek”

https://github.com/UNC-Libraries/peek

Page 11: A Peek Inside the Carolina Digital Repository Michael Daines Digital Repository Analyst UNC – Chapel Hill.
Page 12: A Peek Inside the Carolina Digital Repository Michael Daines Digital Repository Analyst UNC – Chapel Hill.
Page 13: A Peek Inside the Carolina Digital Repository Michael Daines Digital Repository Analyst UNC – Chapel Hill.
Page 14: A Peek Inside the Carolina Digital Repository Michael Daines Digital Repository Analyst UNC – Chapel Hill.
Page 15: A Peek Inside the Carolina Digital Repository Michael Daines Digital Repository Analyst UNC – Chapel Hill.

How do we findinteresting images?

Page 16: A Peek Inside the Carolina Digital Repository Michael Daines Digital Repository Analyst UNC – Chapel Hill.

Cover pages?

Page 17: A Peek Inside the Carolina Digital Repository Michael Daines Digital Repository Analyst UNC – Chapel Hill.

Random pages?

Page 18: A Peek Inside the Carolina Digital Repository Michael Daines Digital Repository Analyst UNC – Chapel Hill.
Page 19: A Peek Inside the Carolina Digital Repository Michael Daines Digital Repository Analyst UNC – Chapel Hill.

How do we findinteresting images?

Query → Download → Split → Resize → Choose

Page 20: A Peek Inside the Carolina Digital Repository Michael Daines Digital Repository Analyst UNC – Chapel Hill.

Query, Download

Solr queryDownload public datastreams

Page 21: A Peek Inside the Carolina Digital Repository Michael Daines Digital Repository Analyst UNC – Chapel Hill.

Split, Resize

CoreGraphicsImageMagick

Page 22: A Peek Inside the Carolina Digital Repository Michael Daines Digital Repository Analyst UNC – Chapel Hill.

Choose

Page 23: A Peek Inside the Carolina Digital Repository Michael Daines Digital Repository Analyst UNC – Chapel Hill.

Initial set

2000 objects35855 images split

425 images for homepage

Page 24: A Peek Inside the Carolina Digital Repository Michael Daines Digital Repository Analyst UNC – Chapel Hill.

Further work

• Larger sample?• Automation?• Integration with repository?• Collaborative filtering?• Image classification?• No processing step?• A/V objects?• Bias?

Page 25: A Peek Inside the Carolina Digital Repository Michael Daines Digital Repository Analyst UNC – Chapel Hill.

Try it!

https://cdr.lib.unc.edu/https://github.com/UNC-Libraries/peek

https://github.com/UNC-Libraries/peek-data