Shallcross code4lib-midwest 20150724
-
Upload
mikeum -
Category
Technology
-
view
255 -
download
1
Transcript of Shallcross code4lib-midwest 20150724
ArchivesSpace-Archivematica-Dspace Workflow Integration
Mike Shallcross Code4Lib Midwest
July 24, 2015 [email protected]
Background
• Bentley Historical Library (est. 1935)
– Official archives of U-M and repository for papers/records of individuals and organizations in MI
– 2014 reorg: consolidated physical/digital processing
– BHL currently hosts 1+ TB of digital archives in DSpace
• April 2014: $355K grant from Mellon Foundation
– Partner with U-M Library (administratively separate)
– Integrate functionality of ArchivesSpace, Archivematica, DSpace in an end-to-end workflow
7/24/2015 1
Ye Olde Days…
• 1997-2009: – Highly manual procedures for born-digital content
(minimal preservation actions) – Very limited resources (staff, storage, IT support)
• 2010-2011: MeMail Project (email preservation grant) – Additional staff and storage infrastructure – Developed more robust workflows (still manual)
• 2011-2014: – Automation of key steps in workflow: AutoPro – Standardize metadata creation/collection
7/24/2015 2
14 May 2015 3
AutoPro
• Comprised of 28 Windows CMD.EXE scripts.
• Guides staff through 6 step workflow with single interface for 20+ apps/utilities. – Virus scans and checksum calculation.
– Identify sensitive personal data
– Create preservation copies of files.
– Arrange, package, and describe content.
• Documents all actions: log files and PREMIS
• NOT a long-term solution – Limitations of user interface/shell scripts
– Support and maintenance not sustainable
14 May 2015 4
Viva La Evolution!
7/24/2015 5
http://blog.sharkhacks.com/2015/01/evolutionary-pains.html
Available Community Solutions • ArchivesSpace:
– Archival management system.
– Create accession records, track locations, generate archival description (EAD).
• Archivematica: – Ingest tool: produces Archival Information Packages (AIPs)
– Extensive technical and preservation metadata.
• DSpace: – Preservation and access
– Persistent URLs, secure/managed storage, access controls
14 May 2015 6
Can we make these play nice?
Goals
• Facilitate creation/reuse of metadata
• Streamline the ingest and deposit of content in repository
• Find solutions that meet Bentley needs but are flexible and scalable for others – Modular: institutions may adopt some, none, or all
– Employ open standards so that other repository platforms could be used
• Share code and documentation with archives and digital preservation communities
7/24/2015 7
Project Status: • April-August, 2014: developer search (failed)
• May 2014: MLibrary announces plans for Hydra
• Dec. 2014: budget reallocation and contract with Artefactual Systems
• Jan. 2015: Artefactual Systems site visit – Review current procedures and collections
– Define use cases
– Analyze existing functionality of platforms, identify development needs
• Jan.-April 2015: Workflow refinement, user stories, and development prioritization plus BHL tool evaluation and data migration
• April 2015: agile development sprints begin
7/24/2015 8
Key Development Tasks
• ‘Appraisal and Arrangement’ tab in Archivematica – Characterize transfers, identify sensitive data, preview
content – Tag content for restrictions, deaccession, and intellectual
arrangement of materials
• Archivematica-ArchivesSpace integration – ASpace ‘pane’ within Archivematica interface – Create/edit ASpace components; drag & drop content to
associate with description and create SIPs – Archivematica create/update records in ASpace using API
• Automate deposit of AIPs in DSpace; update ArchivesSpace with handles
7/24/2015 9
1. Accession Record (AS)
7/24/2015 10
2. Transfer (AM)
7/24/2015 11
3. Characterize Content (AM)
7/24/2015 12
4. ID Sensitive Data (AM)
7/24/2015 13
5. Browse and Review Content
7/24/2015 14
6. Tag content (AM)
7/24/2015 15
7. Associate content with archival description (AM)
7/24/2015 16
8. Ingest SIPs (AM) & Create Records (AS)
7/24/2015 17
9. Deposit data (from AM, via SWORD) and metadata (from AS, via ResourceSync)
7/24/2015 18
Systems of Record
• ASpace: administrative, descriptive, and rights metadata
• Archivematica: technical and preservation metadata
• Repository: manage content and enforce access restrictions.
7/24/2015 19
Updates and More Info…
• Society of American Archivists demo: 8/22/2015
• 2015 iPRES: poster session and workshop (with ArchivesSpace, Archivematica, and BitCurator)
• Artefactual Systems appraisal & arrangement wiki
• Artefactual Labs Github
• Jira user stories
• Project blog: http://archival-integration.blogspot.com/
• Twitter: @umbhlcuration
Questions?
7/24/2015 20
Thanks!