Islandora Overview: PASIG May 2013

180
Islandora Overview Mark Leggott, University of PEI/DiscoveryGarden PASIG - Washington DC May, 2013

description

An overview of the Islandora project and open source framework, including sample productions sites. Islandora is a digital asset management system that can accommodate any type of data, and is designed for digital library collections, research data, enterprise document management, and more.

Transcript of Islandora Overview: PASIG May 2013

Page 1: Islandora Overview: PASIG May 2013

Islandora OverviewMark Leggott, University of PEI/DiscoveryGarden

PASIG - Washington DC May, 2013

Page 2: Islandora Overview: PASIG May 2013

Note: Red text indicates a link.

Page 3: Islandora Overview: PASIG May 2013

Open Source

Page 4: Islandora Overview: PASIG May 2013
Page 5: Islandora Overview: PASIG May 2013
Page 6: Islandora Overview: PASIG May 2013

Islandora 101

Page 7: Islandora Overview: PASIG May 2013

Project Foundations

• Developed at University of PEI (2007)

• UPEI has FT staff and project staff (AIF)

• DiscoveryGarden is commercial services/support company - sustainability

• 25+ staff at DGI, 6 at UPEI

• Both teams maintain/contribute to code

Page 8: Islandora Overview: PASIG May 2013

Conceptualizing

Initializing

Creating/Analyzing

Reporting

Formalizing

Popularizing

Research Institutes

Libraries & Archives

Museums

Media Organizations

Health Centres

Government Agencies

Private Companies

Universities & Colleges

NGOs & Non-ProfitsOther

Access Collaboration Preservation

E-Mail, Letters, Published Research,

Requirements

Meeting Minutes, Grants, Data Collection,

Acquisitions

Forms, Data, Cataloguing,

Findings, Discussion

Reports, Theses, Datasets, Visualizations

Articles, Curricular Content, Policies,

Exhibits

Blogs, Twitter, Newspapers, iTunesU, Flickr

Information Life Cycle

Page 9: Islandora Overview: PASIG May 2013

Object Space

User Space

Individual

Group

Department

Museum

University

External

Private Shared Open

Collabora

te

Publish

Re-U

se

Cre

ate

Preservation, Migration, Transformation

Page 10: Islandora Overview: PASIG May 2013

Basics

• Drupal+Fedora+Other OS = ecosystem

• Flexible UI on top of Fedora + other apps

• Support for 180+ languages via Drupal

• Focus on robust preservation features and services + flexibility in data models and UI

• VM/code, documentation, lists, Camps

Page 11: Islandora Overview: PASIG May 2013

Key Components

• Core - Islandora, Tuque, Solr, XML Forms, FITS, Workflow, Solution Packs (SPs)

• SPs - add specific+tested functionality

• Image, Large Image, PDF, Audio, Video, Book/Paged Image Document

• Newspapers, Digital Humanities, IR, Chem

• DuraCloud integration via Vault Module

Page 12: Islandora Overview: PASIG May 2013
Page 13: Islandora Overview: PASIG May 2013

Drupal UI

HTML

CSS/Themes

User Roles/PermissionsEditorial WorkFlow

Modules (LDAP/BibUtils/Etc

HooksSPARQL

LDAP

FCK

BibUtils

FormsAPI

ImageAPI

Tabs

Page 14: Islandora Overview: PASIG May 2013

Drupal UI

HTML

CSS/Themes

User Roles/PermissionsEditorial WorkFlow

Modules (LDAP/BibUtils/Etc

HooksSPARQL

LDAP

FCK

BibUtils

FormsAPI

ImageAPI

Tabs

XSLTs PHP/Python Snippets

Micro Services Engine

Tika

Kakadu

SWFTools

OOffice

Djatoka

Tesseract

R

Wowza

Islandora

JMS JMS

Code Snippets/Applications

Page 15: Islandora Overview: PASIG May 2013

Drupal UI

HTML

CSS/Themes

User Roles/PermissionsEditorial WorkFlow

Modules (LDAP/BibUtils/Etc

HooksSPARQL

LDAP

FCK

BibUtils

FormsAPI

ImageAPI

Tabs

XSLTs PHP/Python Snippets

Micro Services Engine

Tika

Kakadu

SWFTools

OOffice

Djatoka

Tesseract

R

Wowza

Islandora

JMS JMS

Code Snippets/Applications

Process

Any Metadata&

Any Data

Solr/Lucene

GSearch

Mulgara

MySQL

Content Models XACML Policies

Fedora

SPARQLREST

SOAP

Page 16: Islandora Overview: PASIG May 2013

Drupal UI

HTML

CSS/Themes

User Roles/PermissionsEditorial WorkFlow

Modules (LDAP/BibUtils/Etc

HooksSPARQL

LDAP

FCK

BibUtils

FormsAPI

ImageAPI

Tabs

XSLTs PHP/Python Snippets

Micro Services Engine

Tika

Kakadu

SWFTools

OOffice

Djatoka

Tesseract

R

Wowza

Islandora

JMS JMS

Code Snippets/Applications

Process

Any Metadata&

Any Data

Solr/Lucene

GSearch

Mulgara

MySQL

Content Models XACML Policies

Fedora

SPARQLREST

SOAP

Page 17: Islandora Overview: PASIG May 2013

• Fedora Object Model

• Flexibility supports any data model

• Atomistic and compound objects

• Support for RDF allows integration of specific ontologies

imagined:208361 (PID)

Object Properties

Relations (RELS-EXT)

Dublin Core (DC)

Audit Trail (AUDIT)

JP2K Web (JP2)

JP2K Archival (LOSSLES_JP2)

Low Res JPEG (JPG)

Thumbnail (TN)

Descriptive Metadata (MODS)

Object Model - IslandImagined/Large Image

Digital Object Identifier

System PropertiesManage & Track Object

Reserved DatastreamsKey Object Metadata

DatastreamsAggregates Content Items

Page 18: Islandora Overview: PASIG May 2013

• MicroServices

• PHP/Python/Java

• Drives integration of external services for data transformation +

• Log via Fedora audit

• Taverna integration

Page 19: Islandora Overview: PASIG May 2013

• Simple Workflow Module

• Simple approach to Editorial Workflow

• Provides “human” nodes in the services framework

• Upcoming version support more granular controls and workflow states/actions

Page 20: Islandora Overview: PASIG May 2013
Page 21: Islandora Overview: PASIG May 2013

• XML Form Builder

• Create a rich form for any XML schema

• Multiple forms for specific schemas

• Control access via security policies

Page 22: Islandora Overview: PASIG May 2013
Page 23: Islandora Overview: PASIG May 2013

Administration

• Flexible admin options

• Standard Drupal admin functions

• + ability to maintain aspects of Fedora and other apps via Admin interfaces

• Solution Packs increasingly adding greater configuration options

Page 24: Islandora Overview: PASIG May 2013
Page 25: Islandora Overview: PASIG May 2013
Page 26: Islandora Overview: PASIG May 2013
Page 27: Islandora Overview: PASIG May 2013
Page 28: Islandora Overview: PASIG May 2013

Preservation Services

• Fedora provides robust service framework

• TechDS+DescDS+RightsDS+AuditDSs transformed to a Dynamic PREMIS record

• Adding DuraCloud support via “Vault”

• Adding Archivematica integration as an optional preservation component

Page 29: Islandora Overview: PASIG May 2013

Islandora Community

Page 30: Islandora Overview: PASIG May 2013

Community

• Estimate 150+ Islandora sites worldwide in production or development

• 500+ people on Google Groups List

• Some projects starting to contribute back

• Libraries bulk of use now, but includes museums, archives, private companies

Page 32: Islandora Overview: PASIG May 2013
Page 33: Islandora Overview: PASIG May 2013
Page 34: Islandora Overview: PASIG May 2013
Page 35: Islandora Overview: PASIG May 2013

discoverygarden

• Commercial UPEI spin-off - full service

• Installation, Configuration, Customization

• Support, System Audit, Consulting

• Hosting, Platforms, Vendor partnerships

• Primary codebase contributor

Page 36: Islandora Overview: PASIG May 2013

DGI and Oracle

• discoverygarden working with Oracle to test/certify Islandora on Oracle systems

• SAM/QFS optimization for HFS

Page 37: Islandora Overview: PASIG May 2013

• Non-profit membership organization

• Provides members with a range of services, including Islandora hosting/setup

• Shared/Individual/Group repositories

• Working with discoverygarden to provide customization services when desired

Page 38: Islandora Overview: PASIG May 2013

Code

Page 39: Islandora Overview: PASIG May 2013

Releases

• General goal is to release 4 times per year, or now 7-8 with 2 versions

• Latest “Islandora 6” for March

• First full “Islandora 7” for March

• Goal is to release bug fixes for 6, focus on new developments in 7

Page 41: Islandora Overview: PASIG May 2013
Page 42: Islandora Overview: PASIG May 2013
Page 43: Islandora Overview: PASIG May 2013
Page 44: Islandora Overview: PASIG May 2013
Page 45: Islandora Overview: PASIG May 2013
Page 46: Islandora Overview: PASIG May 2013

Islandora 6

• March 2013 Release

• Improved documentation, print book

• XACML Editor, Workflows

• Forms Autocomplete, FITS integration

• Smoother SP Installation

Page 47: Islandora Overview: PASIG May 2013

Islandora 7

• 1st full release for Drupal 7

• New admin interface/functions

• All new SPs, SeaDragon, IAV

• Complete integration of Tuque API

• Clip tool for SeaDragon

Page 48: Islandora Overview: PASIG May 2013

Contribs

• WARC SP (Nick Ruest, York)

• Administrative Dashboard (Peter MacDonald, Hamilton)

• Relationship Editor/Ontology Management (Giancarlo Birello and Rosie Le Faive)

• Batch Ingester (Colorado Alliance)

• Black Thumbnail Bug (Aaron Collie)

Page 49: Islandora Overview: PASIG May 2013

Standard SPs

• Image, Large Image, Audio, Video, Book, PDF, Newspaper

• Includes MODS for, DC mapping, sample data, viewer(s), TechMD extraction

• Solution Pack module makes it easier to create new ones, modify existing

Page 50: Islandora Overview: PASIG May 2013

Book SP

• Code simplified and made more modular

• Can enable IA viewer for books, Open Seadragon for page images

• Tesseract OCR support standard

• Page manipulation, PDF creation

Page 51: Islandora Overview: PASIG May 2013

Image/Large Image SP

• GIF/PNG/JPG + TIFF/J2K support

• Conversion of TIFF to J2K

• DC + MODS

• Option to use OpenSeadragon viewer

Page 52: Islandora Overview: PASIG May 2013

Tools Modules

• FITS Extractor, creates technical metadata

• Batch Import (RIS, EndNote, PubMed, DOI)

• OCR, Tesseract with OCR/HOCR

• MARCXML, ingest and view MARC data

• XACML Editor, rebuilt XML FormBuilder

Page 53: Islandora Overview: PASIG May 2013

Bridge

• Upcoming module which will allow tighter integration with Drupal, using Ver 7 Nodes or Entities

• Create content via Drupal - sync’d to Fedora and visa versa

• Facilitate re-creation of entire repo, including interface a future goal

Page 54: Islandora Overview: PASIG May 2013

DropBox

• Alpha module provides sync between DropBox and Islandora

• Creates Collection objects for each folder and a separate file object for each contained file with all relationships

• Provides basic DC record for metadata

• Upcoming for Google Drive, DataFlow ++

Page 55: Islandora Overview: PASIG May 2013

Fedora 4

• Fedora Futures project to review & rebuild Fedora for next major release

• Looking to provide better support for large files, large collections and optimized ingest

• Pilot project us using ModeShape as the core repository

• Islandora team already has pilot integration

Page 56: Islandora Overview: PASIG May 2013

Roadmap

• More SPs: Research Data, Digital Humanities, Chemistry, Conferences

• Image Annotation tool (Shared Canvas from Stanford - OAC compliant)

• Full Bridge development

• Integration of Microservices + Taverna

Page 57: Islandora Overview: PASIG May 2013

Trying Islandora

• Try production sites (list on last slide)

• Play in sandbox.islandora.ca (cleaned daily)

• Download VM from islandora.ca

• Install code referring to documentation

• iCamps: PEI, Europe, Australia, US east+west

• Documentation: Jira, videos, GitHub, Jenkins

Page 58: Islandora Overview: PASIG May 2013

Islandora in the Cloud

Page 59: Islandora Overview: PASIG May 2013

DuraCloud

• UPEI and DGI committed to supporting DuraCloud in the Islandora interface

• Works with CloudSync as the bridge between Fedora and DuraCloud

• Can be used with or without Islandora managed collections

Page 60: Islandora Overview: PASIG May 2013

DGI Examples• DGI 5 Islandora clients using DC + backup

• Largest has 2 TB of mostly image J2Ks

• full site (objects/MySQL/Drupal) with DC and DCStool using Continuous mode

• + backup up using Zmanda/S3: D/W/M/Y

• + experimental backup to Glacier

• 3 sites using DC/DCS for full backup of IR

Page 61: Islandora Overview: PASIG May 2013

Islandora Vault Module

• New module for managing DC+CS services

• “Vault” component on Manage Tab

• Manage CS sets/tasks for Collections

• View Health Check at Object level (e.g. check for matching checksums)

• Defining default actions for mismatches

Page 62: Islandora Overview: PASIG May 2013

Vault Setup

Page 63: Islandora Overview: PASIG May 2013
Page 64: Islandora Overview: PASIG May 2013

Collection Restore

Page 65: Islandora Overview: PASIG May 2013
Page 66: Islandora Overview: PASIG May 2013

Object Health

Page 67: Islandora Overview: PASIG May 2013

Reports

Page 68: Islandora Overview: PASIG May 2013

Next Steps

• Tighter integration and more UI functions

• Automated recovery (Auto vs Manual)

• Full Fedora/Collection restore

• Support for private DuraCloud instances

• Add integration with Glacier+

Page 69: Islandora Overview: PASIG May 2013

DGI Platforms

• Islandora Platform solutions from discoverygarden released at OR in July

• Initial offerings IR and Digital Collections

• 1-button setup/payment/management

• Additional platforms before end of 2013

• Research Data and Digital Humanities

Page 70: Islandora Overview: PASIG May 2013
Page 71: Islandora Overview: PASIG May 2013
Page 72: Islandora Overview: PASIG May 2013

Sustainability

• Non-profit Islandora Foundation will help maintain code, documentation, training, community participation and more

• Membership model

• Partner - $10K, Board, Resources, Camps

• Collaborator - $4K, Roadmap

• Member - $2K, links

Page 73: Islandora Overview: PASIG May 2013

Progress

• Non-profit registered

• UPEI and discoverygarden Partners

• Commitment from other members in 1st month sufficient to hire 1 staff person

• Goal is to have 2 FT staff by Fall 2013

Page 74: Islandora Overview: PASIG May 2013

Research Data

Page 75: Islandora Overview: PASIG May 2013

@

Physical Data Model

• UPEI/DGI developing a generic data tool to work with systems researchers use now

• Provide a range of filesystem sync tools

• Minimal service - store data in repository

• Enhance with metadata, transform services

• Project metadata CASRAI/VIVO/CERIF +

Page 76: Islandora Overview: PASIG May 2013

@

Fedora Repository

DescMDTechMD

AdminMDAssets

Local File System

DropBox

Box.net

DataStage

Google Drive

Private Cloud

Storage

Generic Research Data SP

(+ Standard SPs, Viewers)

Sync

Extract

Transform

Enrich

Check

MintTaverna DataCite

FITS + Authority

Islandora Generic Research Data Architecture

Islandora Framework

Islandora VRE (Virtual

Research Environment)

Islandora IR (Research Articles)

BackupsRegional &

National TDRs

Page 77: Islandora Overview: PASIG May 2013

@

Intellectual Data Model

• Smithsonian/DGI developing Sidora system to respond to specific research data needs

• Custom interface, Content Models and Forms, adding Taverna/R integration

• Camera trap images, archaeological data, carbon sequestration data

• File browse interface for all operations

Page 78: Islandora Overview: PASIG May 2013

@

Fedora Repository

DescMDTechMD

AdminMDAssets

Image SP + FGDC,

DwC

Numeric Data SP +

FGDC, DDI

Panama Dig Data +

LIDO

Research Articles

Sidora Application

Taverna R

FITS + Authority

The Smithsonian Data Architecture

Islandora Framework

Sidora

Page 79: Islandora Overview: PASIG May 2013

@

Intellectual Data Model

Physical Data Model

Page 80: Islandora Overview: PASIG May 2013

@

Page 81: Islandora Overview: PASIG May 2013

@

Page 82: Islandora Overview: PASIG May 2013

@

Page 83: Islandora Overview: PASIG May 2013

@

Additional

• Domain specific Solutions Packs for 2013

• Digital Humanities

• Chemistry

• Biodiversity

• Taverna+R++ integration

Page 84: Islandora Overview: PASIG May 2013

Examples

Page 85: Islandora Overview: PASIG May 2013

Institutional Repository

Page 86: Islandora Overview: PASIG May 2013
Page 87: Islandora Overview: PASIG May 2013
Page 88: Islandora Overview: PASIG May 2013
Page 89: Islandora Overview: PASIG May 2013
Page 90: Islandora Overview: PASIG May 2013
Page 91: Islandora Overview: PASIG May 2013
Page 92: Islandora Overview: PASIG May 2013
Page 93: Islandora Overview: PASIG May 2013
Page 94: Islandora Overview: PASIG May 2013
Page 95: Islandora Overview: PASIG May 2013
Page 96: Islandora Overview: PASIG May 2013
Page 97: Islandora Overview: PASIG May 2013
Page 98: Islandora Overview: PASIG May 2013
Page 99: Islandora Overview: PASIG May 2013
Page 100: Islandora Overview: PASIG May 2013
Page 101: Islandora Overview: PASIG May 2013
Page 102: Islandora Overview: PASIG May 2013
Page 104: Islandora Overview: PASIG May 2013
Page 105: Islandora Overview: PASIG May 2013
Page 106: Islandora Overview: PASIG May 2013

Digital Collections

Page 108: Islandora Overview: PASIG May 2013
Page 109: Islandora Overview: PASIG May 2013
Page 110: Islandora Overview: PASIG May 2013
Page 112: Islandora Overview: PASIG May 2013
Page 113: Islandora Overview: PASIG May 2013
Page 114: Islandora Overview: PASIG May 2013
Page 116: Islandora Overview: PASIG May 2013
Page 117: Islandora Overview: PASIG May 2013
Page 118: Islandora Overview: PASIG May 2013
Page 119: Islandora Overview: PASIG May 2013
Page 120: Islandora Overview: PASIG May 2013
Page 121: Islandora Overview: PASIG May 2013
Page 126: Islandora Overview: PASIG May 2013
Page 127: Islandora Overview: PASIG May 2013
Page 128: Islandora Overview: PASIG May 2013
Page 129: Islandora Overview: PASIG May 2013
Page 130: Islandora Overview: PASIG May 2013
Page 131: Islandora Overview: PASIG May 2013
Page 132: Islandora Overview: PASIG May 2013
Page 133: Islandora Overview: PASIG May 2013
Page 134: Islandora Overview: PASIG May 2013

Research Data

Page 135: Islandora Overview: PASIG May 2013

UPEI VRE

• Rich implementation of Islandora

• Used for digital stewardship of research, administrative and learning assets of UPEI

• Over 150 VREs with wide range of features

• VRE Management Team with 4 librarians

• Standard no cost, extra features charged

Page 136: Islandora Overview: PASIG May 2013
Page 137: Islandora Overview: PASIG May 2013
Page 138: Islandora Overview: PASIG May 2013
Page 139: Islandora Overview: PASIG May 2013
Page 140: Islandora Overview: PASIG May 2013
Page 141: Islandora Overview: PASIG May 2013
Page 142: Islandora Overview: PASIG May 2013
Page 143: Islandora Overview: PASIG May 2013
Page 144: Islandora Overview: PASIG May 2013
Page 145: Islandora Overview: PASIG May 2013
Page 146: Islandora Overview: PASIG May 2013
Page 147: Islandora Overview: PASIG May 2013
Page 148: Islandora Overview: PASIG May 2013
Page 149: Islandora Overview: PASIG May 2013
Page 150: Islandora Overview: PASIG May 2013
Page 151: Islandora Overview: PASIG May 2013
Page 152: Islandora Overview: PASIG May 2013
Page 153: Islandora Overview: PASIG May 2013
Page 154: Islandora Overview: PASIG May 2013
Page 155: Islandora Overview: PASIG May 2013
Page 156: Islandora Overview: PASIG May 2013
Page 157: Islandora Overview: PASIG May 2013
Page 158: Islandora Overview: PASIG May 2013
Page 159: Islandora Overview: PASIG May 2013
Page 160: Islandora Overview: PASIG May 2013
Page 161: Islandora Overview: PASIG May 2013
Page 162: Islandora Overview: PASIG May 2013
Page 163: Islandora Overview: PASIG May 2013
Page 164: Islandora Overview: PASIG May 2013

Consortia

Page 165: Islandora Overview: PASIG May 2013
Page 166: Islandora Overview: PASIG May 2013
Page 167: Islandora Overview: PASIG May 2013
Page 168: Islandora Overview: PASIG May 2013
Page 169: Islandora Overview: PASIG May 2013
Page 170: Islandora Overview: PASIG May 2013
Page 171: Islandora Overview: PASIG May 2013
Page 172: Islandora Overview: PASIG May 2013

Admin Collections

Page 173: Islandora Overview: PASIG May 2013
Page 174: Islandora Overview: PASIG May 2013
Page 175: Islandora Overview: PASIG May 2013
Page 176: Islandora Overview: PASIG May 2013
Page 177: Islandora Overview: PASIG May 2013
Page 178: Islandora Overview: PASIG May 2013
Page 179: Islandora Overview: PASIG May 2013

Links• General: islandora.ca, discoverygarden.ca, islandora.ca/if, sandbox.islandora.ca,

wiki.duraspace.org/display/FF/Fedora+Futures+Home, duracloud.org

• Code: github.com/Islandora, jenkins.discoverygarden.ca, travis-ci.org/Islandora/islandora/pull_requests, wiki.duraspace.org/display/ISLANDORA/Islandora, jira.duraspace.org/browse/ISLANDORA

• Institutional Repositories: islandscholar.ca, digital.march.es/ceacs-ir, digital.grinnell.edu/drupal/, digitalunc.coalliance.org/

• Digital Library Collections: peildo.ca, digital.march.es/clamor, digital.march.es/merce, newspapers.vre.upei.ca, mirc.sc.edu, islandimagined.ca, vre2.upei.ca/pwc/, atmintis.mb.vu.lt/en, unbound.williams.edu

• Research Data: library.upei.ca/vre, www.taverna.org.uk/, vdp.vre3.upei.ca/, modernistcommons.ca, vre2.upei.ca/herbarium/, discoveryspace.upei.ca/parca, discoveryspace.upei.ca/quantumchem/, upeikerrlab.ca

• Consortia: cairnrepo.ca, adrresources.coalliance.org

Note: some of these sites require authentication access - contact Mark for more information.

Page 180: Islandora Overview: PASIG May 2013

Questions?Mark Leggott - University of PEI/discoverygarden

[email protected]

Kathleen Van Ekris - [email protected]