Rafe Nauen - Rafe's Field Guide to Constellation Sentences (Family Constellation)
Sequence Services Phase 2 Webinar Series: Constellation Technology and Genestack
-
Upload
pistoia-alliance -
Category
Technology
-
view
1.108 -
download
0
description
Transcript of Sequence Services Phase 2 Webinar Series: Constellation Technology and Genestack
![Page 1: Sequence Services Phase 2 Webinar Series: Constellation Technology and Genestack](https://reader036.fdocuments.net/reader036/viewer/2022062513/555069fcb4c9052d158b4613/html5/thumbnails/1.jpg)
Constellation Technologies& GeneStack
Development of Sequence Services 2 in the Constellation
Framework
1
![Page 2: Sequence Services Phase 2 Webinar Series: Constellation Technology and Genestack](https://reader036.fdocuments.net/reader036/viewer/2022062513/555069fcb4c9052d158b4613/html5/thumbnails/2.jpg)
ConstellationExperts in big data and bioinformatics
2
• Spin out from STFC (Science and Technology Facilities Council)– Largest research facility in UK specialising in large data computing
• CERN, European physics and astronomy science• Supporting all UK disciplines in computing
• Strong IT & Bioinformatics expertise– Strong Bioinformatics delivery expertise– Strong connections into European academia– Excellent access to newly developed applications, tools and algorithms
• Supplier of cloud computing services to large Pharma.• Partners for Pistoia SS2
– Microsoft Azure– STFC
![Page 3: Sequence Services Phase 2 Webinar Series: Constellation Technology and Genestack](https://reader036.fdocuments.net/reader036/viewer/2022062513/555069fcb4c9052d158b4613/html5/thumbnails/3.jpg)
Service ServiceService
Constellation’s “Roadmap”
Service
Core
Text Mining/Search
GenomeAnalysis
Data Integration
“Workflow Management”
Seamless Integration with Client systems
API
“AppMarket”
![Page 4: Sequence Services Phase 2 Webinar Series: Constellation Technology and Genestack](https://reader036.fdocuments.net/reader036/viewer/2022062513/555069fcb4c9052d158b4613/html5/thumbnails/4.jpg)
IT
• IT– Platform Design– Support– Maintenance– Testing– Stability / Scalability– Security
• Bioinformatics– Novel Algorithms– Research– Scientific support– Discovery– Analysis– Value Added
4
BioinformaticsIT
![Page 5: Sequence Services Phase 2 Webinar Series: Constellation Technology and Genestack](https://reader036.fdocuments.net/reader036/viewer/2022062513/555069fcb4c9052d158b4613/html5/thumbnails/5.jpg)
• Hosted– Single Vendor– Hardware limitations– Restricted storage– Limited cost models– “Lock in”
• Cloud– Vendor Agnostic– As required– Selectable storage– Best model available– “Flexible”
5
HostingCloud
![Page 6: Sequence Services Phase 2 Webinar Series: Constellation Technology and Genestack](https://reader036.fdocuments.net/reader036/viewer/2022062513/555069fcb4c9052d158b4613/html5/thumbnails/6.jpg)
Vendor Agnostic
Cloud Vision
6
Flexible Storage
Flexible Compute
TrueCloud
Client Business
Logic
MinimiseSupport
“Bioinformatics Marketplace”
Virtual Organisation
ClientApplications
Academic or bespoke solutions
Your Informatics
![Page 7: Sequence Services Phase 2 Webinar Series: Constellation Technology and Genestack](https://reader036.fdocuments.net/reader036/viewer/2022062513/555069fcb4c9052d158b4613/html5/thumbnails/7.jpg)
High Level Architecture
Distributed Storage
Distributed Compute
BioinformaticsSystems
Workflow Tools
Portal
Workflow UIDeployed
Workflow (Apps)Bioinformatics
UIs
Bioinformatics Applications
![Page 8: Sequence Services Phase 2 Webinar Series: Constellation Technology and Genestack](https://reader036.fdocuments.net/reader036/viewer/2022062513/555069fcb4c9052d158b4613/html5/thumbnails/8.jpg)
Our goal for SS2• We believed the end goal was a flexible platform where ALL the
application described in SS2 scope could be deployed for individual clients as required.
• Platform should be scalable where security, support and maintenance can be easily managed.– Reducing support costs allows for more focus on research
• Bioinformatics applications added as required:– GeneStack (Analysis Portal)– VIB (Arctix) (Workflow) (in discussion)– EBI (Services) (in discussion)
• Workflow delivered as a fundamental development principle• Development of the “AppMarket” for Bioinformatics
8
![Page 9: Sequence Services Phase 2 Webinar Series: Constellation Technology and Genestack](https://reader036.fdocuments.net/reader036/viewer/2022062513/555069fcb4c9052d158b4613/html5/thumbnails/9.jpg)
9
CompanySpecific
Integrating3rd PartySystems
SecureScalableStorage
WorkflowCore
IntegrationWith otherSystems
FutureDevelopment
![Page 10: Sequence Services Phase 2 Webinar Series: Constellation Technology and Genestack](https://reader036.fdocuments.net/reader036/viewer/2022062513/555069fcb4c9052d158b4613/html5/thumbnails/10.jpg)
Deliverables achieved• Portal with access to all the “Must Have” Web Services described in the SS2 documentation
– Constellation Managed Administration Interface to allow organisational mapping of users to Programs / Projects / Applications
• “Tool Box” of Integrated Applications– Galaxy– Secure Ensembl– Secure CellProfiler– Content Search (New development)
• Galaxy workflow engine with integrating applications deployed as a secure web application to cover “Must Have” tools– Restricted set of apps based on feedback from “testing pool” (Restrictions based on Need/Security)– Tools can be added on request
• Scalable storage and compute (dependant on need and security)– Structured Program - Project – User mapping– Cost effective data storage and compute
• Initial Integration with another Bioinformatics Vendor (GeneStack)
10
![Page 11: Sequence Services Phase 2 Webinar Series: Constellation Technology and Genestack](https://reader036.fdocuments.net/reader036/viewer/2022062513/555069fcb4c9052d158b4613/html5/thumbnails/11.jpg)
Other Available SAAS tools
• Secure EnsEMBL– Private copy of EnsEMBL (Rackspace)– Secure UI and API Access– Ability to map DAS (secure or Public)
• Parallelised CellProfiler– Private scalable version of CellProfiler on Azure
11