Data Spider for Legacy Infrastructure: Capturing content from multiple file shares, rebuilding...

22
Data Spider for Legacy Infrastructure: Capturing content from multiple file shares, rebuilding complete metadata from smaller pieces Oksana Kurysheva @aviriel

description

In ideal world the systems are integrated and talking to each other through APIs. However, real life is not ideal, especially in legacy infrastructure. This presentation provides an example of capturing legacy content from 100+ geographically distributed sites into central digital archive based on Alfresco.

Transcript of Data Spider for Legacy Infrastructure: Capturing content from multiple file shares, rebuilding...

Page 1: Data Spider for Legacy Infrastructure: Capturing content from multiple file shares, rebuilding complete metadata from smaller pieces

Data Spider for Legacy Infrastructure:

Capturing content from multiple file

shares, rebuilding complete metadata

from smaller pieces Oksana Kurysheva

@aviriel

Page 2: Data Spider for Legacy Infrastructure: Capturing content from multiple file shares, rebuilding complete metadata from smaller pieces

Data Spider for Legacy Infrastructure:

Capturing content from multiple file

shares, rebuilding complete metadata

from smaller pieces Oksana Kurysheva

@aviriel

Page 3: Data Spider for Legacy Infrastructure: Capturing content from multiple file shares, rebuilding complete metadata from smaller pieces
Page 4: Data Spider for Legacy Infrastructure: Capturing content from multiple file shares, rebuilding complete metadata from smaller pieces
Page 5: Data Spider for Legacy Infrastructure: Capturing content from multiple file shares, rebuilding complete metadata from smaller pieces
Page 6: Data Spider for Legacy Infrastructure: Capturing content from multiple file shares, rebuilding complete metadata from smaller pieces
Page 7: Data Spider for Legacy Infrastructure: Capturing content from multiple file shares, rebuilding complete metadata from smaller pieces
Page 8: Data Spider for Legacy Infrastructure: Capturing content from multiple file shares, rebuilding complete metadata from smaller pieces
Page 9: Data Spider for Legacy Infrastructure: Capturing content from multiple file shares, rebuilding complete metadata from smaller pieces
Page 10: Data Spider for Legacy Infrastructure: Capturing content from multiple file shares, rebuilding complete metadata from smaller pieces
Page 11: Data Spider for Legacy Infrastructure: Capturing content from multiple file shares, rebuilding complete metadata from smaller pieces

Spider Configuration

Page 12: Data Spider for Legacy Infrastructure: Capturing content from multiple file shares, rebuilding complete metadata from smaller pieces

Sync Logic

Page 13: Data Spider for Legacy Infrastructure: Capturing content from multiple file shares, rebuilding complete metadata from smaller pieces

Hierarchy: Case / Session / File

Page 14: Data Spider for Legacy Infrastructure: Capturing content from multiple file shares, rebuilding complete metadata from smaller pieces

Search Requirements

Page 15: Data Spider for Legacy Infrastructure: Capturing content from multiple file shares, rebuilding complete metadata from smaller pieces
Page 16: Data Spider for Legacy Infrastructure: Capturing content from multiple file shares, rebuilding complete metadata from smaller pieces
Page 17: Data Spider for Legacy Infrastructure: Capturing content from multiple file shares, rebuilding complete metadata from smaller pieces

Custom Auto-Versioning on

Properties Change

Page 18: Data Spider for Legacy Infrastructure: Capturing content from multiple file shares, rebuilding complete metadata from smaller pieces

Deduplication

Page 19: Data Spider for Legacy Infrastructure: Capturing content from multiple file shares, rebuilding complete metadata from smaller pieces

Experience in Real Life

Page 20: Data Spider for Legacy Infrastructure: Capturing content from multiple file shares, rebuilding complete metadata from smaller pieces

Roadmap

Page 21: Data Spider for Legacy Infrastructure: Capturing content from multiple file shares, rebuilding complete metadata from smaller pieces

Resources

Related Blog Post http://blog.itdhq.com/tagged/data-spider

Github Repository

https://github.com/ITDSystems/data-spider

Page 22: Data Spider for Legacy Infrastructure: Capturing content from multiple file shares, rebuilding complete metadata from smaller pieces

Resources

Related Blog Post http://blog.itdhq.com/tagged/data-spider

Github Repository https://github.com/ITDSystems/data-spider