Epigenomics: SomeEpigenomics: Some Statistical ...hji/courses/genomics/Epigenetics.pdf ·...

42
Epigenomics: Some Epigenomics: Some Epigenomics: Some Epigenomics: Some Statistical Applications Statistical Applications Statistical Applications Statistical Applications Rafael A. Irizarry Rafael A. Irizarry Department of Biostatistics J h H ki Bl b Sh l fP bli John Hopkins Bloomberg School of Public Health

Transcript of Epigenomics: SomeEpigenomics: Some Statistical ...hji/courses/genomics/Epigenetics.pdf ·...

Page 1: Epigenomics: SomeEpigenomics: Some Statistical ...hji/courses/genomics/Epigenetics.pdf · Epigenetics: the grammar of life. DNA methylationDNA methylation Obsered to expeced = Pr(CG)

Epigenomics: SomeEpigenomics: SomeEpigenomics: Some Epigenomics: Some Statistical ApplicationsStatistical ApplicationsStatistical Applications Statistical Applications

Rafael A. IrizarryRafael A. IrizarryDepartment of BiostatisticsJ h H ki Bl b S h l f P bliJohn Hopkins Bloomberg School of Public

Health

Page 2: Epigenomics: SomeEpigenomics: Some Statistical ...hji/courses/genomics/Epigenetics.pdf · Epigenetics: the grammar of life. DNA methylationDNA methylation Obsered to expeced = Pr(CG)

AcknowledgementsAcknowledgementsAcknowledgementsAcknowledgements

• Tom Albert Nimblegen• Tom Albert, Nimblegen• Benilton Carvalho, JHU Biostatistics• Andy Feinberg Lab, JHU Medicine• Todd Richmond, NimblegenTodd Richmond, Nimblegen• Hao Wu, JHU Biostatistics

J W B Bi t t• Jean Wu, Brown Biostat• Vasan Yegnasubramanian, JHU g

Oncology

Page 3: Epigenomics: SomeEpigenomics: Some Statistical ...hji/courses/genomics/Epigenetics.pdf · Epigenetics: the grammar of life. DNA methylationDNA methylation Obsered to expeced = Pr(CG)

OutlineOutlineOutlineOutline

• Quick Introduction to Epigenetics• Quick Introduction to Epigenetics• Introduction to Methylationy• Overview of competing technologies

Review: Expression arrays lessons• Review: Expression arrays lessons• Comparisonp• Role of statisticians

Page 4: Epigenomics: SomeEpigenomics: Some Statistical ...hji/courses/genomics/Epigenetics.pdf · Epigenetics: the grammar of life. DNA methylationDNA methylation Obsered to expeced = Pr(CG)

Genetics: the alphabet of lifeGenetics: the alphabet of life

• Letters of DNALetters of DNA sequence carry the informationinformation

Page 5: Epigenomics: SomeEpigenomics: Some Statistical ...hji/courses/genomics/Epigenetics.pdf · Epigenetics: the grammar of life. DNA methylationDNA methylation Obsered to expeced = Pr(CG)

EpigeneticsEpigenetics

Page 6: Epigenomics: SomeEpigenomics: Some Statistical ...hji/courses/genomics/Epigenetics.pdf · Epigenetics: the grammar of life. DNA methylationDNA methylation Obsered to expeced = Pr(CG)

(3.4x10-10 meters/bp) x (6x109 bp/genome) = ~2 meters/genome

Radius of the nucleus is ~ 10 µM !!!µ

Klug and Cummings, 1997

Page 7: Epigenomics: SomeEpigenomics: Some Statistical ...hji/courses/genomics/Epigenetics.pdf · Epigenetics: the grammar of life. DNA methylationDNA methylation Obsered to expeced = Pr(CG)

[(6 x 109 bp/genome) / (195 bp/nucleosome)] = ~ 30.8 x 106 nucleosomes/genome~ 5 % of nuclear volume

Page 8: Epigenomics: SomeEpigenomics: Some Statistical ...hji/courses/genomics/Epigenetics.pdf · Epigenetics: the grammar of life. DNA methylationDNA methylation Obsered to expeced = Pr(CG)

http://www.albany.edu/~achm110/solenoidchriomatin.html

Page 9: Epigenomics: SomeEpigenomics: Some Statistical ...hji/courses/genomics/Epigenetics.pdf · Epigenetics: the grammar of life. DNA methylationDNA methylation Obsered to expeced = Pr(CG)

Epigenetics: the grammar of Epigenetics: the grammar of lifelifelifelife

Page 10: Epigenomics: SomeEpigenomics: Some Statistical ...hji/courses/genomics/Epigenetics.pdf · Epigenetics: the grammar of life. DNA methylationDNA methylation Obsered to expeced = Pr(CG)

DNA methylationDNA methylationDNA methylationDNA methylation

DNA methyltransferase I (DNMT1)Obsered to expeced = Pr(CG) / { Pr(C) Pr(G) }

Page 11: Epigenomics: SomeEpigenomics: Some Statistical ...hji/courses/genomics/Epigenetics.pdf · Epigenetics: the grammar of life. DNA methylationDNA methylation Obsered to expeced = Pr(CG)

DNA methylation can lead to silencing of gene expressionDNA methylation can lead to silencing of gene expression

HDACMeCP2

Sin3A HDACMeCP2

Sin3A

HDAC2

HDAC1

MBD3

RbAp46RbAp48

Mi 2

MTA268kD

66kDHDAC2

HDAC1

MBD3

RbAp46RbAp48

Mi 2

MTA268kD

66kD>2 MDalton Complex

MBD2Mi-2 MBD2Mi-2

Robertson and Wolffe, Nat Rev Genet, 2000

Page 12: Epigenomics: SomeEpigenomics: Some Statistical ...hji/courses/genomics/Epigenetics.pdf · Epigenetics: the grammar of life. DNA methylationDNA methylation Obsered to expeced = Pr(CG)

ENCODE TrackENCODE TrackENCODE TrackENCODE Track

Page 13: Epigenomics: SomeEpigenomics: Some Statistical ...hji/courses/genomics/Epigenetics.pdf · Epigenetics: the grammar of life. DNA methylationDNA methylation Obsered to expeced = Pr(CG)

Expression Array LessonsExpression Array Lessons

Page 14: Epigenomics: SomeEpigenomics: Some Statistical ...hji/courses/genomics/Epigenetics.pdf · Epigenetics: the grammar of life. DNA methylationDNA methylation Obsered to expeced = Pr(CG)

NormalizationNormalizationNormalizationNormalization

Page 15: Epigenomics: SomeEpigenomics: Some Statistical ...hji/courses/genomics/Epigenetics.pdf · Epigenetics: the grammar of life. DNA methylationDNA methylation Obsered to expeced = Pr(CG)

Probe effectProbe effectProbe effectProbe effect

I t it B k d P b Eff t Q tit EIntensity = Background + Probe Effect x Quantity x Error

Page 16: Epigenomics: SomeEpigenomics: Some Statistical ...hji/courses/genomics/Epigenetics.pdf · Epigenetics: the grammar of life. DNA methylationDNA methylation Obsered to expeced = Pr(CG)

Sequence effect for BGSequence effect for BGSequence effect for BGSequence effect for BGWu et al. (2004) JASA 99(468) 909

Aff ∑ ∑ 125

jbk CGTAj

kj kAffinity =

= ∈∑ ∑= 1

1 },,,{,μ µj,k ~ smooth function

of k

Page 17: Epigenomics: SomeEpigenomics: Some Statistical ...hji/courses/genomics/Epigenetics.pdf · Epigenetics: the grammar of life. DNA methylationDNA methylation Obsered to expeced = Pr(CG)

Back to MethylationBack to Methylation

High throughput of course….High throughput of course….

Page 18: Epigenomics: SomeEpigenomics: Some Statistical ...hji/courses/genomics/Epigenetics.pdf · Epigenetics: the grammar of life. DNA methylationDNA methylation Obsered to expeced = Pr(CG)

Densities for three methodsDensities for three methodsDensities for three methodsDensities for three methods

HCT116 lots of methylation

DKO very little methylation

Page 19: Epigenomics: SomeEpigenomics: Some Statistical ...hji/courses/genomics/Epigenetics.pdf · Epigenetics: the grammar of life. DNA methylationDNA methylation Obsered to expeced = Pr(CG)

Hunh?Hunh?Hunh?Hunh?

Page 20: Epigenomics: SomeEpigenomics: Some Statistical ...hji/courses/genomics/Epigenetics.pdf · Epigenetics: the grammar of life. DNA methylationDNA methylation Obsered to expeced = Pr(CG)

MeDIP (like ChIPchip)MeDIP (like ChIPchip)

Total Reverse crosslinks Amplify Label/hybridizeAmplify Label/hybridize

Crosslink YLyse & Sonicate

IP Reverse crosslinks

Amplify Label/hybridize

Other controls for IP(e.g., no antibody, non-

specific antibody)

Page 21: Epigenomics: SomeEpigenomics: Some Statistical ...hji/courses/genomics/Epigenetics.pdf · Epigenetics: the grammar of life. DNA methylationDNA methylation Obsered to expeced = Pr(CG)

Some DataSome DataSome DataSome Data

Page 22: Epigenomics: SomeEpigenomics: Some Statistical ...hji/courses/genomics/Epigenetics.pdf · Epigenetics: the grammar of life. DNA methylationDNA methylation Obsered to expeced = Pr(CG)

Problem: Not specificProblem: Not specificProblem: Not specificProblem: Not specific

Page 23: Epigenomics: SomeEpigenomics: Some Statistical ...hji/courses/genomics/Epigenetics.pdf · Epigenetics: the grammar of life. DNA methylationDNA methylation Obsered to expeced = Pr(CG)

HELP: Two enzymesHELP: Two enzymesHELP: Two enzymesHELP: Two enzymesCuts at CCGG Cuts at CMCGG

No Methylation

Page 24: Epigenomics: SomeEpigenomics: Some Statistical ...hji/courses/genomics/Epigenetics.pdf · Epigenetics: the grammar of life. DNA methylationDNA methylation Obsered to expeced = Pr(CG)

HELP after PCRHELP after PCRHELP after PCRHELP after PCR

No Methylation

Page 25: Epigenomics: SomeEpigenomics: Some Statistical ...hji/courses/genomics/Epigenetics.pdf · Epigenetics: the grammar of life. DNA methylationDNA methylation Obsered to expeced = Pr(CG)

HELPHELPHELPHELP

Methylation

Page 26: Epigenomics: SomeEpigenomics: Some Statistical ...hji/courses/genomics/Epigenetics.pdf · Epigenetics: the grammar of life. DNA methylationDNA methylation Obsered to expeced = Pr(CG)

HELPHELPHELPHELP

No Methylation

Page 27: Epigenomics: SomeEpigenomics: Some Statistical ...hji/courses/genomics/Epigenetics.pdf · Epigenetics: the grammar of life. DNA methylationDNA methylation Obsered to expeced = Pr(CG)

Problem with HELPProblem with HELPProblem with HELPProblem with HELPCuts at CCGG Cuts at CMCGG

No Methylation

Page 28: Epigenomics: SomeEpigenomics: Some Statistical ...hji/courses/genomics/Epigenetics.pdf · Epigenetics: the grammar of life. DNA methylationDNA methylation Obsered to expeced = Pr(CG)

The ProblemThe ProblemThe ProblemThe Problem

Ob d t d P (CG) / { P (C) P (G) }Obsered to expeced = Pr(CG) / { Pr(C) Pr(G) }

Page 29: Epigenomics: SomeEpigenomics: Some Statistical ...hji/courses/genomics/Epigenetics.pdf · Epigenetics: the grammar of life. DNA methylationDNA methylation Obsered to expeced = Pr(CG)

Proportion of neighboring CpG also Proportion of neighboring CpG also th l t d/ t th l t dth l t d/ t th l t dmethylated/not methylatedmethylated/not methylated

Page 30: Epigenomics: SomeEpigenomics: Some Statistical ...hji/courses/genomics/Epigenetics.pdf · Epigenetics: the grammar of life. DNA methylationDNA methylation Obsered to expeced = Pr(CG)

McRBC on Tiling arrayMcRBC on Tiling arrayMcRBC on Tiling arrayMcRBC on Tiling array

Page 31: Epigenomics: SomeEpigenomics: Some Statistical ...hji/courses/genomics/Epigenetics.pdf · Epigenetics: the grammar of life. DNA methylationDNA methylation Obsered to expeced = Pr(CG)

ROC nowROC nowROC nowROC now

Page 32: Epigenomics: SomeEpigenomics: Some Statistical ...hji/courses/genomics/Epigenetics.pdf · Epigenetics: the grammar of life. DNA methylationDNA methylation Obsered to expeced = Pr(CG)

ENCODE TrackENCODE TrackENCODE TrackENCODE Track

Page 33: Epigenomics: SomeEpigenomics: Some Statistical ...hji/courses/genomics/Epigenetics.pdf · Epigenetics: the grammar of life. DNA methylationDNA methylation Obsered to expeced = Pr(CG)

Problems for StatisticiansProblems for StatisticiansProblems for StatisticiansProblems for Statisticians

• Background Correction +• Background Correction + Normalization

• Probability Model for Segments• Use these to from null and alternative• Use these to from null and alternative

models… we need power!• Use these to create bump finding

algorithmsg• Adapt to high-throughput sequencing

Page 34: Epigenomics: SomeEpigenomics: Some Statistical ...hji/courses/genomics/Epigenetics.pdf · Epigenetics: the grammar of life. DNA methylationDNA methylation Obsered to expeced = Pr(CG)

Supplemental SlidesSupplemental Slides

Page 35: Epigenomics: SomeEpigenomics: Some Statistical ...hji/courses/genomics/Epigenetics.pdf · Epigenetics: the grammar of life. DNA methylationDNA methylation Obsered to expeced = Pr(CG)

McRBC: One enzymeMcRBC: One enzymeMcRBC: One enzymeMcRBC: One enzymeCuts at AmCG or GmCG Input

No Methylation

Page 36: Epigenomics: SomeEpigenomics: Some Statistical ...hji/courses/genomics/Epigenetics.pdf · Epigenetics: the grammar of life. DNA methylationDNA methylation Obsered to expeced = Pr(CG)

McRBC after GelMcRBC after GelMcRBC after GelMcRBC after Gel

No Methylation

Page 37: Epigenomics: SomeEpigenomics: Some Statistical ...hji/courses/genomics/Epigenetics.pdf · Epigenetics: the grammar of life. DNA methylationDNA methylation Obsered to expeced = Pr(CG)

McRBC after GelMcRBC after GelMcRBC after GelMcRBC after Gel

No Methylation

Page 38: Epigenomics: SomeEpigenomics: Some Statistical ...hji/courses/genomics/Epigenetics.pdf · Epigenetics: the grammar of life. DNA methylationDNA methylation Obsered to expeced = Pr(CG)

McRBCMcRBCMcRBCMcRBC

Methylation

Page 39: Epigenomics: SomeEpigenomics: Some Statistical ...hji/courses/genomics/Epigenetics.pdf · Epigenetics: the grammar of life. DNA methylationDNA methylation Obsered to expeced = Pr(CG)

McRBC after GELMcRBC after GELMcRBC after GELMcRBC after GEL

Methylation

Page 40: Epigenomics: SomeEpigenomics: Some Statistical ...hji/courses/genomics/Epigenetics.pdf · Epigenetics: the grammar of life. DNA methylationDNA methylation Obsered to expeced = Pr(CG)

McRBC after GELMcRBC after GELMcRBC after GELMcRBC after GEL

Methylation

Page 41: Epigenomics: SomeEpigenomics: Some Statistical ...hji/courses/genomics/Epigenetics.pdf · Epigenetics: the grammar of life. DNA methylationDNA methylation Obsered to expeced = Pr(CG)
Page 42: Epigenomics: SomeEpigenomics: Some Statistical ...hji/courses/genomics/Epigenetics.pdf · Epigenetics: the grammar of life. DNA methylationDNA methylation Obsered to expeced = Pr(CG)