PSB2016 Computational Microbiology Workshop

18
Research is to see what everybody else has seen and to think what nobody else has thought. Albert Szent-Györgyi Image by J.W. McGuire/NIH

Transcript of PSB2016 Computational Microbiology Workshop

Page 1: PSB2016 Computational Microbiology Workshop

Research is to see what everybody else has seen and to think what

nobody else has thought.�Albert Szent-Györgyi

Image by J.W. McGuire/NIH

Page 2: PSB2016 Computational Microbiology Workshop

Image from You Don’t Know Jack. Vol 3.

Page 3: PSB2016 Computational Microbiology Workshop

Unsupervised discovery �from large gene expression compendia with ADAGE

Casey Greene

Page 4: PSB2016 Computational Microbiology Workshop

Analysis with Denoising Autoencoders of �Gene Expression (ADAGE)

Tan et al. Pac Sym Bio 2015; Tan et al. In Press. mSystems

Page 5: PSB2016 Computational Microbiology Workshop

ADAGE Identifies Genes’ Pathways

Assign Pathway

Page 6: PSB2016 Computational Microbiology Workshop

… and produces useful networks

Page 7: PSB2016 Computational Microbiology Workshop

The Transcription Factor Anr Controls P.a. Response to Low O2

Low O2

O2

O2

O2

O2

O2 O2

O2 O2

O2

O2

O2

O2

O2

O2

O2 O2

O2

O2 O2

O2

O2

O2 O2

O2

O2

O2 O2 O2

O2 O2

O2

O2

O2

Anr

CF Lung Epithelium

Page 8: PSB2016 Computational Microbiology Workshop

Node42 reflects Anr Activity

E−GEOD−17179

} wt

}}Δanr

Δdnr

E−GEOD−17296

}}}}}}

ΔanrΔroxSR

ΔanrΔroxSR

wt

wt

}}

EXP

STAT

O2

E−GEO

D−52445

O2

Node42 - Anr ActivityE−G

EOD

−33160

O2

A

B

−15 0 10Value

Color KeyColor Key

−10 0 10Value

Color Key

Value−10 0 10

−10 0 15

Color Key

Value

}}Δanr

wt

}}Δanr

wt }}Δanr

wt

−5 0 5

Color Key

Value

Color Key

Value−4 0 4

Color Key

Value−2 0 2

Microarray RNAseq PAO1

RNAseq J215

C

Page 9: PSB2016 Computational Microbiology Workshop

New Experiment Validates Node 42’s Low-O2 Signature

CF lung epithelial cells Jack Hammond

E−GEOD−17179

} wt

}}Δanr

Δdnr

E−GEOD−17296

}}}}}}

ΔanrΔroxSR

ΔanrΔroxSR

wt

wt

}}

EXP

STAT

O2

E−GEO

D−52445

O2

Node42 - Anr Activity

E−GEO

D−33160

O2

A

B

−15 0 10Value

Color KeyColor Key

−10 0 10Value

Color Key

Value−10 0 10

−10 0 15

Color Key

Value

}}Δanr

wt

}}Δanr

wt }}Δanr

wt

−5 0 5

Color Key

Value

Color Key

Value−4 0 4

Color Key

Value−2 0 2

Microarray RNAseq PAO1

RNAseq J215

CE−GEOD−17179

} wt

}}Δanr

Δdnr

E−GEOD−17296

}}}}}}

ΔanrΔroxSR

ΔanrΔroxSR

wt

wt

}}

EXP

STAT

O2

E−GEO

D−52445

O2

Node42 - Anr Activity

E−GEO

D−33160

O2

A

B

−15 0 10Value

Color KeyColor Key

−10 0 10Value

Color Key

Value−10 0 10

−10 0 15

Color Key

Value

}}Δanr

wt

}}Δanr

wt }}Δanr

wt

−5 0 5

Color Key

Value

Color Key

Value−4 0 4

Color Key

Value−2 0 2

Microarray RNAseq PAO1

RNAseq J215

C

Page 10: PSB2016 Computational Microbiology Workshop

ADAGE complements PCA/ICA

E−GEOD−17179} wt

}}Δanr

Δdnr

E−GEOD−17296

}}}}}}

ΔanrΔroxSR

ΔanrΔroxSR

wt

wt

}}

EXP

STAT

O2

O2

Node42

O2

E−GEOD−33160

E−GEOD−52445

PC4 PC7 IC14

} wt

}}Δanr

Δdnr

O2

} wt

}}Δanr

Δdnr

O2

} wt

}}Δanr

Δdnr

O2−0.5 0 0.51Value

Color Key

}}}}}}

ΔanrΔroxSR

ΔanrΔroxSR

wt

wt

}}

EXP

STAT

}}}}}}

ΔanrΔroxSR

ΔanrΔroxSR

wt

wt

}}

EXP

STAT

}}}}}}

ΔanrΔroxSR

ΔanrΔroxSR

wt

wt

}}

EXP

STAT

−1 0 1 2Value

Color Key

O2 O2 O2−2−1 0 1 2 3Value

Color Key

O2 O2 O2−0.5 0.5 1.5

Value

Color Key

−2−1 0 1Value

Color Key

−3−2−1 0 1Value

Color Key

−1 0 1Value

Color Key

−1 0 1 2 3 4Value

Color Key

−1.5−0.5 0.5Value

Color Key

−0.5 0 0.5Value

Color Key

−0.4 0 0.4Value

Color Key

−1 0 1 2Value

Color Key

IC49

} wt

}}Δanr

Δdnr

O2

}}}}}}

ΔanrΔroxSR

ΔanrΔroxSR

wt

wt

}}

EXP

STAT

O2

O2

Color Key

Color Key

Color Key

−1 0 1Value

Color Key

−0.5 0.5 1.5Value

−0.5 0 0.51Value

−1 0 1 2Value

}}Δanr

wt

}Δanr

wt

Anr-Microarray

Anr-RNAseq

}}Δanr

wt

}}Δanr

wt

}}Δanr

wt

}}Δanr

wt

Value

Color Key

Value

Color Key

Value

Color Key

Value

Color Key

−0.6 0.60 −0.1 0 0.1 −0.1 0 0.1 0.2 −0.1 0 0.1

Value

Color Key

Value

Color Key

Value

Color Key

Value

Color Key

−15 0 10Value

Color Key

Color Key

−10 0 10Value

Color Key

Value−10 0 10

−10 0 15

Color Key

Value

−5 0 5

Color Key

Value

Color Key

Value

}}}

}}

Δanr

wt

PAO1

J215

}Δanr

wt

}}}

}}

Δanr

wt

PAO1

J215

}Δanr

wt

}}}

}}

Δanr

wt

PAO1

J215

}Δanr

wt

}}}

}}

Δanr

wt

PAO1

J215

}Δanr

wt

}}}

}}

Δanr

wt

PAO1

J215

−10 0 10 −1.5 0 1 −1 0 1 −0.05 0 0.1 −0.2 0 0.2

Page 11: PSB2016 Computational Microbiology Workshop

Cross-platform normalization of microarray and RNA-seq data for machine learning applications

Thompson, Tan, Greene. In Press. PeerJ. https://peerj.com/preprints/1460/ Jeff Thompson

Page 12: PSB2016 Computational Microbiology Workshop

Cross-platform normalization of microarray and RNA-seq data for machine learning applications

Thompson, Tan, Greene. In Press. PeerJ. https://peerj.com/preprints/1460/

Page 13: PSB2016 Computational Microbiology Workshop

New Experiment Validates Node 42’s Low-O2 Signature

CF lung epithelial cells Jack Hammond

E−GEOD−17179

} wt

}}Δanr

Δdnr

E−GEOD−17296

}}}}}}

ΔanrΔroxSR

ΔanrΔroxSR

wt

wt

}}

EXP

STAT

O2

E−GEO

D−52445

O2

Node42 - Anr Activity

E−GEO

D−33160

O2

A

B

−15 0 10Value

Color KeyColor Key

−10 0 10Value

Color Key

Value−10 0 10

−10 0 15

Color Key

Value

}}Δanr

wt

}}Δanr

wt }}Δanr

wt

−5 0 5

Color Key

Value

Color Key

Value−4 0 4

Color Key

Value−2 0 2

Microarray RNAseq PAO1

RNAseq J215

CE−GEOD−17179

} wt

}}Δanr

Δdnr

E−GEOD−17296

}}}}}}

ΔanrΔroxSR

ΔanrΔroxSR

wt

wt

}}

EXP

STAT

O2

E−GEO

D−52445

O2

Node42 - Anr Activity

E−GEO

D−33160

O2

A

B

−15 0 10Value

Color KeyColor Key

−10 0 10Value

Color Key

Value−10 0 10

−10 0 15

Color Key

Value

}}Δanr

wt

}}Δanr

wt }}Δanr

wt

−5 0 5

Color Key

Value

Color Key

Value−4 0 4

Color Key

Value−2 0 2

Microarray RNAseq PAO1

RNAseq J215

C

E−GEOD−17179

} wt

}}Δanr

Δdnr

E−GEOD−17296

}}}}}}

ΔanrΔroxSR

ΔanrΔroxSR

wt

wt

}}

EXP

STAT

O2

E−GEO

D−52445

O2

Node42 - Anr Activity

E−GEO

D−33160

O2

A

B

−15 0 10Value

Color KeyColor Key

−10 0 10Value

Color Key

Value−10 0 10

−10 0 15

Color Key

Value

}}Δanr

wt

}}Δanr

wt }}Δanr

wt

−5 0 5

Color Key

Value

Color Key

Value−4 0 4

Color Key

Value−2 0 2

Microarray RNAseq PAO1

RNAseq J215

C

Page 14: PSB2016 Computational Microbiology Workshop

ADAGE analysis of publicly available gene expression data collections illuminates Pseudomonas aeruginosa-host interactions�bioRxiv: http://dx.doi.org/10.1101/030650�In Press @ mSystems

Page 15: PSB2016 Computational Microbiology Workshop

How do we move from �this to mechanisms?

What “pathways” did my experiment affect?

Page 16: PSB2016 Computational Microbiology Workshop

ADAGE-based Pathway Analysis of Transcriptomic Changes

Page 17: PSB2016 Computational Microbiology Workshop

ADAGE Webserver coming soon! http://www.greenelab.com/webservers

Page 18: PSB2016 Computational Microbiology Workshop

Jie Tan+ (Grad Student) Gregory Way (Grad Student) Brett Beaulieu-Jones (Grad Student) René Zelaya (Programmer) Matt Huyck (Programmer) Kathy Chen (Undergrad) Mulin Xiong (Undergrad) Deb Hogan (Hogan Lab/Dartmouth) Jack Hammond (Hogan Lab/Dartmouth) Jeff Thompson (Marsit Lab/Dartmouth) Data: All investigators who publicly release their gene expression data. Images: Artists who release their work under a Creative Commons license. Funding: G&B Moore Investigator in Data-Driven Discovery National Science Foundation Cystic Fibrosis Foundation Norris Cotton Cancer Center Prouty Grant American Cancer Society Dartmouth SYNERGY +Neukom Institute Graduate Fellowship Find us online: http://www.greenelab.com Twitter: @GreeneScientist