SIL rapid capture
-
Upload
keri-thompson -
Category
Education
-
view
753 -
download
2
description
Transcript of SIL rapid capture
More information on the SIL digitization program than you require
Keri Thompson
Smithsonian Institution Libraries
SPIN Rapid Capture Workshop February 16, 2012
“MORE”
Boutique Digitization
Boutique One-offs Item-based workflow Tailored metadata Hand-crafted data,
much user intervention
Opportunistic staffing
Project specific grants
Illustration by A.E. Marty (1882-1974)Gazette du Bon Genre, July 1920Smithsonian Institution Libraries
Mass Digitization
Prêt à lire
Standardization Format-based
workflow and metadata model
Automate as much as possible
Assigned staff Funding stream
New York Millinery and Supply Co. , 1901 Smithsonian Institution Libraries
Ramping Up Find your niche Secure Funding Hire Staff Purchase Equipment Standardize on metadata, processes Automate!
i.e., find magic automation wizard
Our Little Corner of the Web
10 original partner institutions
Digitizing legacy literature of taxonomy
Over 50,000 titles, over 100,000 items, almost 38 million pages
Numbers!
1999
2000
2001
2002
2003
2004
2005
2006
2007
2008
2009
2010
2011
0
2000
4000
6000
8000
10000
12000
14000
Digitization at SI Li-braries 1999-present
Total Items
At Internet Archive >10TB
Storage estimates
Locally >7.5TB
not rapid
rapid
too rapid
Funding Multiple grants Over multiple years Lather, rinse, repeat
Kalamazoo Tank & Silo Co.Catalog, ca. 1909Smithsonian Institution Libraries
Human Resources Started in 2008 with
2 FTE technicians (Grant) .7 FTE manager .5 FTE cataloger Vendor scanning only And a host of others!
In 2012 have 1 FTE technician (Grant) 2 FTE librarians (Grant) .3 FTE manager 1 scanning technician (Grant) And a host of others!
International Time Recording Co. Time Recording Card Clocks , 1914 , p.12Smithsonian Institution Libraries
PhaseOne P65, CaptureOne
BC100,CaptureOne
Canon 5D MkII, Biblio
Equipment
In-House Scanning
P65, 60.5MP camera
Strobe lights Image capture Filenaming Crop, rotate No post-
processing Convert to .tiff
Process(es)(es)
Vendo
r
Website
presentation
storageRequests
Special projects
In-house use (exhibitions, brochures)
“gap-fills”
Data sources
Select & Dedupe
Check out and
Ship
Harvest to Local
Repository
ScanningCheck in and QC
Item available in IA/BHL
Check inAdd link
SIRISWorkflow
DB
Internet Archive
Item levelmetadataInitiate
workflow
Mark asscanned
URLs in MARC recordTitle levelMARC
JP2000s+ metadata
Generalized workflow
Standardize Process and Data
Common staging area Metadata Model
Title level (MARC) metadata Item level metadata
volume, issue, date, barcode Page level metadata
sequence, page number, page type
Common storage area Common presentation area
Ericsson LM, Can Efficiency be Measured? Stockholm, Sweden, 1946Smithsonian Institution Libraries
Automate Metadata Capture & Transformation
Extract title level metadata MARC MARCXML
Extract item level metadata From SIRIS SQL db xml file
Page level metadata Interface for easy data entry
File creation and conversion Upload to staging area
National Cash RegisterAnnual Report, 1953Smithsonian Institution Libraries
Select & Dedupe
Check out and
Ship
Temp. Backup to
NAS
SIRISWorkflow
DB
Internet Archive
Item levelmetadata
Initiate workflow
Title levelMARC
JP2000s+ metadata
Scanning
Packages files for transfer
Creates metadata “Bucket”Transforms Images, creates
derivativesPage
level metadata
added
.tiffs
Maca
w
Check in and QC
Item available in IA/BHL
Check inAdd link
URLs in MARC record
In-house workflow with Macaw
Mark asscanned
Metadata Collection and Workflow (Macaw)
Room for Improvement Quality Speed Embed metadata
Kenwood Bicycle Mfg. Co.Catalogue for 1895 , 1895Smithsonian Institution Libraries
Future
Increase throughput Scan non-book items (MSS) Scan un-cataloged items Frictionless repurposing Output to METS Islandora Local delivery interface
Collier’s, October 18, 1952Smithsonian Institution Libraries