Pistoia Alliance conference April 2016: Big Data: Mathew Woodwark

14
Big Biomedical Data Pistoia Panel Discussion 19 th April 2016 Mathew Woodwark, Director of Research Bioinformatics MedImmune

Transcript of Pistoia Alliance conference April 2016: Big Data: Mathew Woodwark

Page 1: Pistoia Alliance conference April 2016: Big Data: Mathew Woodwark

Big Biomedical DataPistoia Panel Discussion19th April 2016Mathew Woodwark, Director of Research BioinformaticsMedImmune

Page 2: Pistoia Alliance conference April 2016: Big Data: Mathew Woodwark
Page 3: Pistoia Alliance conference April 2016: Big Data: Mathew Woodwark

3

Goal: Integrative Informatics

Page 4: Pistoia Alliance conference April 2016: Big Data: Mathew Woodwark

Collect, Store and Integrate

Integrated data analysis

Biological Understanding

} Target IDTarget SelectionTarget ValidationBiomarkers

Public Collaborations Internal

Engineering better molecules, enabling better projects

Data Sources

Multiple OmicsData types

Tissue P

henomics

Flow C

ytometry

Screening

Proteom

ics

Transcriptomics

Exom

e

Whole G

enome

Clinical

Phenotype

Data Warehouse

Page 5: Pistoia Alliance conference April 2016: Big Data: Mathew Woodwark

Integrative Informatics Architecture

5

Portal and query layer

Data Marts

Integration Engine

Metadata and business

rules

Structured data

Semi-structured

data

Un-structured

data

Visualisation Layer

Queries to build data marts

Structured data

Semi-structured

data

Un-structured

data

External Internal

Rich viz components – display connected info in multiple formats

Organized by data type or by process. Data standards guide assembly

Data Extraction for further analysis in specialized tools

Reusable data connectors

Drill into underlying detailed data

Persistent and temporary marts

Able to integrate internal and external data

QC and ETL?

Page 6: Pistoia Alliance conference April 2016: Big Data: Mathew Woodwark

6

Genomics Big Data Considerations1: Genomic Data

Page 7: Pistoia Alliance conference April 2016: Big Data: Mathew Woodwark

7

Genomics Big Data Considerations2: Security and Access

Page 8: Pistoia Alliance conference April 2016: Big Data: Mathew Woodwark

8

Genomics Big Data Considerations3: Consent

Page 9: Pistoia Alliance conference April 2016: Big Data: Mathew Woodwark

9

Genomics Big Data Considerations4: Geography

Page 10: Pistoia Alliance conference April 2016: Big Data: Mathew Woodwark

10

Genomics Big Data Considerations5: Phenotypic data

Page 11: Pistoia Alliance conference April 2016: Big Data: Mathew Woodwark

11

Genomics Big Data Considerations6: Integration with other data

Tissue P

henomics

Flow

Cytom

etry

Screening

Proteom

ics

Transcriptomi

cs

Page 12: Pistoia Alliance conference April 2016: Big Data: Mathew Woodwark

12

Genomics Big Data Considerations7: Global Collaboration

Tissue P

henomics

Flow

Cytom

etry

Screening

Proteom

ics

Transcriptomi

cs

Page 13: Pistoia Alliance conference April 2016: Big Data: Mathew Woodwark

13

Genomics Big Data Considerations8: More integration

Tissue P

henomics

Flow

Cytom

etry

Screening

Proteom

ics

Transcriptomi

cs

Page 14: Pistoia Alliance conference April 2016: Big Data: Mathew Woodwark

14

Its complicated!And a work in progress…

…for all of us!