Linkage Disequilibrium - University of Michigan€¦ · •Hardy-Weinberg Equilibrium...

64
Linkage Disequilibrium Biostatistics 666

Transcript of Linkage Disequilibrium - University of Michigan€¦ · •Hardy-Weinberg Equilibrium...

Page 1: Linkage Disequilibrium - University of Michigan€¦ · •Hardy-Weinberg Equilibrium •Relationship between allele and genotype frequencies that holds for most genetic markers •Exact

Linkage DisequilibriumBiostatistics 666

Page 2: Linkage Disequilibrium - University of Michigan€¦ · •Hardy-Weinberg Equilibrium •Relationship between allele and genotype frequencies that holds for most genetic markers •Exact

Logistics: Office Hours, Exams

• Office hours on Fridays from 3-4 pm.

• Working to reserve a room in the School of Public Health.

• Aiming for mid-term on October 25. • Let me know immediately if you will need accommodations.

My e-mail is [email protected]

Page 3: Linkage Disequilibrium - University of Michigan€¦ · •Hardy-Weinberg Equilibrium •Relationship between allele and genotype frequencies that holds for most genetic markers •Exact

Previously …

• Basic properties of a locus• Allele Frequencies

• Genotype Frequencies

• Hardy-Weinberg Equilibrium• Relationship between allele and genotype frequencies that holds for most

genetic markers

• Exact Tests for Hardy-Weinberg Equilibrium

Page 4: Linkage Disequilibrium - University of Michigan€¦ · •Hardy-Weinberg Equilibrium •Relationship between allele and genotype frequencies that holds for most genetic markers •Exact

Today …

• We’ll consider properties of pairs of alleles

• Haplotype frequencies

• Linkage equilibrium

• Linkage disequilibrium

Page 5: Linkage Disequilibrium - University of Michigan€¦ · •Hardy-Weinberg Equilibrium •Relationship between allele and genotype frequencies that holds for most genetic markers •Exact

Intuition

• Genetic variants band together through time and populations …

• … this often results in correlated distributions for nearby variants

• The phenomenon is termed linkage disequilibrium

Page 6: Linkage Disequilibrium - University of Michigan€¦ · •Hardy-Weinberg Equilibrium •Relationship between allele and genotype frequencies that holds for most genetic markers •Exact

Let’s consider the history of two neighboring alleles…

Page 7: Linkage Disequilibrium - University of Michigan€¦ · •Hardy-Weinberg Equilibrium •Relationship between allele and genotype frequencies that holds for most genetic markers •Exact

A C G T A C G T A C G T A C G T G T A C G A C G

A C G T A A G T A C G T A C G T G T A C G A C G

Alleles that exist today arose through ancient mutation events…

After Mutation

Before Mutation

Mutation

A C G T A A G T A C G T A C G T G T A C G A C G

Page 8: Linkage Disequilibrium - University of Michigan€¦ · •Hardy-Weinberg Equilibrium •Relationship between allele and genotype frequencies that holds for most genetic markers •Exact

Alleles that exist today arose through ancient mutation events…

A

A

C

After Mutation

Before Mutation

Mutation

Page 9: Linkage Disequilibrium - University of Michigan€¦ · •Hardy-Weinberg Equilibrium •Relationship between allele and genotype frequencies that holds for most genetic markers •Exact

One allele arose first, and then the other…

A G

C G

A G

C G

C C

After Mutation

Before Mutation

Mutation

Page 10: Linkage Disequilibrium - University of Michigan€¦ · •Hardy-Weinberg Equilibrium •Relationship between allele and genotype frequencies that holds for most genetic markers •Exact

Recombination generates new arrangements for ancestral alleles

A G

C G

C C

A G

C G

C C

A C

After Recombination

Before Recombination

Recombinant Haplotype

Page 11: Linkage Disequilibrium - University of Michigan€¦ · •Hardy-Weinberg Equilibrium •Relationship between allele and genotype frequencies that holds for most genetic markers •Exact

Linkage Disequilibrium

• Chromosomes are mosaics

• Extent and conservation of mosaic pieces depends on

• Recombination rate• Mutation rate• Population size• Natural selection

• Combinations of alleles at very close markers reflect ancestral haplotypes

Ancestor

Present-day

Page 12: Linkage Disequilibrium - University of Michigan€¦ · •Hardy-Weinberg Equilibrium •Relationship between allele and genotype frequencies that holds for most genetic markers •Exact

Why is linkage disequilibrium important for genetic studies?

Benefits … Challenges …

Page 13: Linkage Disequilibrium - University of Michigan€¦ · •Hardy-Weinberg Equilibrium •Relationship between allele and genotype frequencies that holds for most genetic markers •Exact

Tagging SNPs

• In a typical short chromosome segment, there are only a few distinct haplotypes

• Carefully selected SNPs can determine status of other SNPs

Haplo1 T T T

Haplo2 T T T

Haplo3 T T T

Haplo4 T T T

Haplo5 T T T 10%

30%

20%

20%

20%

Page 14: Linkage Disequilibrium - University of Michigan€¦ · •Hardy-Weinberg Equilibrium •Relationship between allele and genotype frequencies that holds for most genetic markers •Exact

Basic Descriptors ofLinkage Disequilibrium

Page 15: Linkage Disequilibrium - University of Michigan€¦ · •Hardy-Weinberg Equilibrium •Relationship between allele and genotype frequencies that holds for most genetic markers •Exact

Commonly Used Descriptors

• Haplotype Frequencies• How often do we see each allele combination along a chromosome?

• Contain all the information provided by other summary measures

• Commonly used summaries• D

• D’

• r2 or Δ2

Page 16: Linkage Disequilibrium - University of Michigan€¦ · •Hardy-Weinberg Equilibrium •Relationship between allele and genotype frequencies that holds for most genetic markers •Exact

Haplotype Frequencies

0.1Totals

A Locus

Totals BLocus

bB

a

A

abaB

AbAB

pp

p

p

ppa

ppA

bB

Page 17: Linkage Disequilibrium - University of Michigan€¦ · •Hardy-Weinberg Equilibrium •Relationship between allele and genotype frequencies that holds for most genetic markers •Exact

Linkage EquilibriumExpected for Distant Loci

)1)(1(

)1(

)1(

BAbaab

BABaaB

BAbAAb

BAAB

ppppp

ppppp

ppppp

ppp

Page 18: Linkage Disequilibrium - University of Michigan€¦ · •Hardy-Weinberg Equilibrium •Relationship between allele and genotype frequencies that holds for most genetic markers •Exact

Linkage DisequilibriumExpected for Nearby Loci

)1)(1(

)1(

)1(

BAbaab

BABaaB

BAbAAb

BAAB

ppppp

ppppp

ppppp

ppp

Page 19: Linkage Disequilibrium - University of Michigan€¦ · •Hardy-Weinberg Equilibrium •Relationship between allele and genotype frequencies that holds for most genetic markers •Exact

Disequilibrium Coefficient DAB

ABbaab

ABBaaB

ABbAAb

ABBAAB

BAABAB

Dppp

Dppp

Dppp

Dppp

pppD

Page 20: Linkage Disequilibrium - University of Michigan€¦ · •Hardy-Weinberg Equilibrium •Relationship between allele and genotype frequencies that holds for most genetic markers •Exact

DAB is hard to interpret

• Sign is arbitrary …• A common convention is to set…

• A, B as the common alleles

• a, b as the rare allele

• Range depends on allele frequencies• Hard to compare between markers

• Can you see why the range of DAB depends on allele frequencies?

Page 21: Linkage Disequilibrium - University of Michigan€¦ · •Hardy-Weinberg Equilibrium •Relationship between allele and genotype frequencies that holds for most genetic markers •Exact

What is the range of DAB?

• What are the maximum and minimum possible values of DAB when • pA = 0.3 and pB = 0.3

• pA = 0.2 and pB = 0.1

• Can you derive a general formula for this range?

Page 22: Linkage Disequilibrium - University of Michigan€¦ · •Hardy-Weinberg Equilibrium •Relationship between allele and genotype frequencies that holds for most genetic markers •Exact

D’ – A scaled version of D

0),min(

0),min(

'

AB

BabA

AB

AB

baBA

AB

AB

Dpppp

D

Dpppp

D

D

• Ranges between –1 and +1• More likely to take extreme values when allele frequencies are small

• 1 implies at least one of the observed haplotypes was not observed

Page 23: Linkage Disequilibrium - University of Michigan€¦ · •Hardy-Weinberg Equilibrium •Relationship between allele and genotype frequencies that holds for most genetic markers •Exact

More on D’

• Pluses:• D’ = 1 or D’ = -1 means no evidence for recombination between the markers

• If allele frequencies are similar, high D’ implies markers are good surrogates for each other

• Minuses:• D’ estimates inflated in small samples

• D’ estimates inflated when one allele is rare

Page 24: Linkage Disequilibrium - University of Michigan€¦ · •Hardy-Weinberg Equilibrium •Relationship between allele and genotype frequencies that holds for most genetic markers •Exact

Raw |D’| data from Chr22

0.00

0.20

0.40

0.60

0.80

1.00

0 200 400 600 800 1000

Physical Distance (kb)

D'

Dawson et al, Nature, 2002

Page 25: Linkage Disequilibrium - University of Michigan€¦ · •Hardy-Weinberg Equilibrium •Relationship between allele and genotype frequencies that holds for most genetic markers •Exact

² (also called r2)

n

pppp

D

BBAA

AB

2

²

)1()1(²

2

• Ranges between 0 and 1• 1 when the two markers provide identical information

• 0 when they are in perfect equilibrium

• Expected value is 1/2n

Page 26: Linkage Disequilibrium - University of Michigan€¦ · •Hardy-Weinberg Equilibrium •Relationship between allele and genotype frequencies that holds for most genetic markers •Exact

More on r2

• r2 = 1 implies the markers provide exactly the same information

• The measure preferred by population geneticists

• Measures loss in efficiency when marker A is replaced with marker B in an association study

• With some simplifying assumptions (e.g. see Pritchard and Przeworski, 2001)

Page 27: Linkage Disequilibrium - University of Michigan€¦ · •Hardy-Weinberg Equilibrium •Relationship between allele and genotype frequencies that holds for most genetic markers •Exact

Raw ² data from Chr22

0.00

0.20

0.40

0.60

0.80

1.00

0 200 400 600 800 1000

Physical Distance (kb)

r2

Dawson et al, Nature, 2002

Page 28: Linkage Disequilibrium - University of Michigan€¦ · •Hardy-Weinberg Equilibrium •Relationship between allele and genotype frequencies that holds for most genetic markers •Exact

0.00

0.25

0.50

0.75

1.00

0 200 400 600 800 1000

Physical Distance (kb)

D'

0.00

0.25

0.50

0.75

1.00

0 200 400 600 800 1000

Physical Distance (kb)

²

VariabilityOf

Pair-Wise LD

0.00

0.33

0.67

1.00

0 500 1000

Physical Distance (kb)

²

0.00

0.33

0.67

1.00

0 500 1000

Physical Distance (kb)

D'

MedianQuartilesDeciles

Page 29: Linkage Disequilibrium - University of Michigan€¦ · •Hardy-Weinberg Equilibrium •Relationship between allele and genotype frequencies that holds for most genetic markers •Exact

Dense Region 2

• Chromosome 21• 57 markers / 130 kb

• 37.37 – 37.50 Mb

• High LD region

• SNP picking (8/57 = 14%)• 5 unique SNPs

• 3 tagging SNPs

• Others, average r² = 0.94

D’

0.0 1.0

International HapMap Consortium Data

Page 30: Linkage Disequilibrium - University of Michigan€¦ · •Hardy-Weinberg Equilibrium •Relationship between allele and genotype frequencies that holds for most genetic markers •Exact

Dense Region 1

• Chromosome 7• 157 markers / 520 kb

• 27.0 – 27.5 Mb

• Average LD region

• SNP picking (33/157 = 21%)• 12 unique SNPs

• 21 tagging SNPs

• Others, average r² = 0.73

D’

0.0 1.0

International HapMap Consortium Data

Page 31: Linkage Disequilibrium - University of Michigan€¦ · •Hardy-Weinberg Equilibrium •Relationship between allele and genotype frequencies that holds for most genetic markers •Exact

Linkage Disequilibrium in Three Regions

Abecasis et al, Am J Hum Genet, 2001

Page 32: Linkage Disequilibrium - University of Michigan€¦ · •Hardy-Weinberg Equilibrium •Relationship between allele and genotype frequencies that holds for most genetic markers •Exact

Chr22 High LD: 22-27 Mb

Dawson et al, Nature, 2001

Page 33: Linkage Disequilibrium - University of Michigan€¦ · •Hardy-Weinberg Equilibrium •Relationship between allele and genotype frequencies that holds for most genetic markers •Exact

Chr22 Low LD: 27-32 Mb

Dawson et al, Nature, 2001

Page 34: Linkage Disequilibrium - University of Michigan€¦ · •Hardy-Weinberg Equilibrium •Relationship between allele and genotype frequencies that holds for most genetic markers •Exact

When doeslinkage equilibrium hold?

Page 35: Linkage Disequilibrium - University of Michigan€¦ · •Hardy-Weinberg Equilibrium •Relationship between allele and genotype frequencies that holds for most genetic markers •Exact

Equilibrium or Disequilibrium?

• We will outline a simple justification for linkage equilibrium at most loci• For now, we will ignore drift in allele frequencies over time.

• In practice, extent of disequilibrium results form balance of factors• Distance between markers

• Genetic drift (a function of population size)

• Random mating

• …

• In our argument, random mating and recombination ensure that mutations spread from original haplotype to all haplotypes in the population…

Page 36: Linkage Disequilibrium - University of Michigan€¦ · •Hardy-Weinberg Equilibrium •Relationship between allele and genotype frequencies that holds for most genetic markers •Exact

Generation t,Initial Configuration

bB

aABbaABBa

AABbAABBA

pp

pDppDppa

pDppDppA

bB

Assume arbitrary values for the allele frequenciesand disequilibrium coefficient

Page 37: Linkage Disequilibrium - University of Michigan€¦ · •Hardy-Weinberg Equilibrium •Relationship between allele and genotype frequencies that holds for most genetic markers •Exact

Generation t+1,Without Recombination

bB

aABbaABBa

AABbAABBA

pp

pDppDppa

pDppDppA

bB

Haplotype Frequencies Remain Stable Over TimeOutcome has probability 1- r

Page 38: Linkage Disequilibrium - University of Michigan€¦ · •Hardy-Weinberg Equilibrium •Relationship between allele and genotype frequencies that holds for most genetic markers •Exact

Generation t+1,With Recombination

bB

ababA

AbABA

pp

pppppa

pppppA

bB

Haplotype Frequencies Are Function of Allele FrequenciesOutcome has probability r

Page 39: Linkage Disequilibrium - University of Michigan€¦ · •Hardy-Weinberg Equilibrium •Relationship between allele and genotype frequencies that holds for most genetic markers •Exact

Generation t+1,Overall

bB

aABbaABbA

AABbAABBA

pp

pDrppDrppa

pDrppDrppA

bB

)1()1(

)1()1(

Disequilibrium Decreases…

Page 40: Linkage Disequilibrium - University of Michigan€¦ · •Hardy-Weinberg Equilibrium •Relationship between allele and genotype frequencies that holds for most genetic markers •Exact

Recombination Rate (r)

• Probability of an odd number of crossovers between two loci

• Proportion of time alleles from two different grand-parents occur in the same gamete

• Increases with physical (base-pair) distance, but rate of increase varies across genome

Page 41: Linkage Disequilibrium - University of Michigan€¦ · •Hardy-Weinberg Equilibrium •Relationship between allele and genotype frequencies that holds for most genetic markers •Exact

0.000

0.100

0.200

0.300

0.400

0.500

0.600

0.700

0.800

0.900

1.000

1 10 100 1000

Generations

Dis

eq

uilib

riu

m

Ө = 0.5

Ө = 0.1

Ө = 0.01

Ө = 0.001

Ө = 0.0001

r = 0.5

Decay of D with Time

r = 0.1

r = 0.01

r = 0.001

r = 0.0001

Page 42: Linkage Disequilibrium - University of Michigan€¦ · •Hardy-Weinberg Equilibrium •Relationship between allele and genotype frequencies that holds for most genetic markers •Exact

Predictions

• Disequilibrium will decay each generation• In a large population

• After t generations…• Dt

AB = (1-r)tD0AB

• A better model should allow for changes in allele frequencies over time…

Page 43: Linkage Disequilibrium - University of Michigan€¦ · •Hardy-Weinberg Equilibrium •Relationship between allele and genotype frequencies that holds for most genetic markers •Exact

Linkage Equilibrium

• In a large random mating population haplotype frequencies converge to a simple function of allele frequencies

Page 44: Linkage Disequilibrium - University of Michigan€¦ · •Hardy-Weinberg Equilibrium •Relationship between allele and genotype frequencies that holds for most genetic markers •Exact

More Examples of Linkage Disequilibrium Data

How much disequilibrium is there?

What are good predictors of disequilibrium?

Page 45: Linkage Disequilibrium - University of Michigan€¦ · •Hardy-Weinberg Equilibrium •Relationship between allele and genotype frequencies that holds for most genetic markers •Exact

Empirical of LD (chromosome 2, R²)

0

0.2

0.4

0.6

0 50 100 150 200 250

Distance (kb)

Sta

tis

tic

Han, Japanese

Yoruba

CEPH

LD extends further in CEPH and the Han/Japanese than in the Yoruba

Comparing Populations …

International HapMap Consortium, Nature, 2005

Page 46: Linkage Disequilibrium - University of Michigan€¦ · •Hardy-Weinberg Equilibrium •Relationship between allele and genotype frequencies that holds for most genetic markers •Exact

Variation in Linkage DisequilibriumAlong The Genome

Page 47: Linkage Disequilibrium - University of Michigan€¦ · •Hardy-Weinberg Equilibrium •Relationship between allele and genotype frequencies that holds for most genetic markers •Exact

Comparing Genomic Regions …

• Rather than compare curves directly, it is convenient to a pick a summary for the decay curves

• One common summary is the distance at which the curve crosses a threshold of interest (say 0.50)

Page 48: Linkage Disequilibrium - University of Michigan€¦ · •Hardy-Weinberg Equilibrium •Relationship between allele and genotype frequencies that holds for most genetic markers •Exact

Extent of Linkage Disequilibrium

0

4

8

12

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22

Chromosome

Ha

lf-L

ife

of

LD extends further in the larger chromosomes, which have lower recombination rates

Page 49: Linkage Disequilibrium - University of Michigan€¦ · •Hardy-Weinberg Equilibrium •Relationship between allele and genotype frequencies that holds for most genetic markers •Exact

But within each chromosome, there is still huge variability!

Page 50: Linkage Disequilibrium - University of Michigan€¦ · •Hardy-Weinberg Equilibrium •Relationship between allele and genotype frequencies that holds for most genetic markers •Exact

Average Extent:

Median Extent:

10th percentile:

90th percentile:

Extent of LD

Extent of LD (r²)

Pro

po

rtio

n o

f G

en

om

e

0 10 20 30 40 50

0.0

00

.02

0.0

40

.06

0.0

80

.10

11.9 kb

7.8 kb

3.5 kb

20.9 kb

Extent of LD Across the Genome

(in 1Mb Windows)

Page 51: Linkage Disequilibrium - University of Michigan€¦ · •Hardy-Weinberg Equilibrium •Relationship between allele and genotype frequencies that holds for most genetic markers •Exact

Genomic Variation in LD (CEPH)

Expected r2 at 30kb

Bright Red > 0.88

Dark Blue < 0.12

Smith et al, Genome Research, 2005

Page 52: Linkage Disequilibrium - University of Michigan€¦ · •Hardy-Weinberg Equilibrium •Relationship between allele and genotype frequencies that holds for most genetic markers •Exact

Genomic Variation in LD(JPT + CHB)

Expected r2 at 30kb

Bright Red > 0.88

Dark Blue < 0.12

Smith et al, Genome Research, 2005

Page 53: Linkage Disequilibrium - University of Michigan€¦ · •Hardy-Weinberg Equilibrium •Relationship between allele and genotype frequencies that holds for most genetic markers •Exact

Genomic Variation in LD(YRI)

Expected r2 at 10kb

Bright Red > 0.88

Dark Blue < 0.12

Smith et al, Genome Research, 2005

Page 54: Linkage Disequilibrium - University of Michigan€¦ · •Hardy-Weinberg Equilibrium •Relationship between allele and genotype frequencies that holds for most genetic markers •Exact

Genomic Distribution of LD(YRI)

Expected r2 at 30kb

Bright Red > 0.88

Dark Blue < 0.12

Smith et al, Genome Research, 2005

Page 55: Linkage Disequilibrium - University of Michigan€¦ · •Hardy-Weinberg Equilibrium •Relationship between allele and genotype frequencies that holds for most genetic markers •Exact

Sequence Composition vs. LD(some selected comparisons)

(Low LD) (High LD)

Q1 Q2 Q3 Q4 Trend

Basic Sequence Features

GC Bases (%) 40.8 43.5 41.0 39.6 39.0 Decreases With LD

Bases in CpG Islands (%) 0.7 0.9 0.7 0.6 0.6 Decreases With LD

Polymorphism (P * 10,000) 10.1 11.9 10.6 9.6 8.3 Decreases With LD

Genes and Related Features

Known Genes (per 1000 kb) 6.4 6.6 6.1 6.2 6.7 U shaped

Genic Bases (Exon, Intron, UTR, %) 38.5 37.6 34.6 36.0 45.9 U shaped

Coding Bases (%) 1.2 1.1 1.0 1.1 1.4 U shaped

Conserved Non-Coding Sequence (%) 1.4 1.6 1.5 1.3 1.1 Decreases with LD

Repeat Content

Total Bases in Repeats (%) 47.9 44.2 46.4 48.6 52.3 Increases with LD

Bases in LINE repeats (%) 20.9 16.5 19.9 22.4 24.9 Increases with LD

Bases in SINE repeats (%) 13.6 14.7 13.1 12.6 14.0 U shaped

Genome Quartiles, Defined Using LD

Genome

Smith et al, Genome Research, 2005

Page 56: Linkage Disequilibrium - University of Michigan€¦ · •Hardy-Weinberg Equilibrium •Relationship between allele and genotype frequencies that holds for most genetic markers •Exact

Gene Function in Regions of High and Low Disequilibrium

Annotated

Gene Function (GO Term) Genes Low LD High LD χ² P-value

All Swissprot Entries Examined 7520 2305 2045 - -

DNA metabolism 366 74 139 35.37 <.0001

Immune Response 622 232 94 34.36 <.0001

Cell cycle 493 119 177 26.24 <.0001

Protein Metabolism 1193 318 375 23.33 <.0001

Organelle Organization and Biogenesis 444 107 152 19.65 <.0001

Intracellular Transport 263 56 95 19.61 <.0001

Organogenesis 805 294 162 16.49 0.00005

Cell Organization and Metabolism 545 138 178 16.43 0.00005

RNA Metabollism 208 41 71 15.33 0.00009

Results from a comparison of the distribution of 40 most common

gene classifications in the GENE Ontology Database

Page 57: Linkage Disequilibrium - University of Michigan€¦ · •Hardy-Weinberg Equilibrium •Relationship between allele and genotype frequencies that holds for most genetic markers •Exact

Implications for Association Studies

Page 58: Linkage Disequilibrium - University of Michigan€¦ · •Hardy-Weinberg Equilibrium •Relationship between allele and genotype frequencies that holds for most genetic markers •Exact

Obesity and the NEGR1 locus

Multiple nearby SNPs show evidence for association with obesity.

The associated alleles appear together often, making it hard to pinpoint the causal variant.

Rapid shifts in association signal often correspond to recombination ‘hotspots’.

Willer et al, Nature Genetics, 2009

Page 59: Linkage Disequilibrium - University of Michigan€¦ · •Hardy-Weinberg Equilibrium •Relationship between allele and genotype frequencies that holds for most genetic markers •Exact

Linkage Disequilibrium in Association Studies:

Tag SNP Picking

• Many nearby SNPs will typically provide similar evidence for association

• To decrease genotyping costs, most association studies will examine selected “tag SNPs” in each region

• The most common tagging strategy focuses on pairwise r2 between SNPs

Page 60: Linkage Disequilibrium - University of Michigan€¦ · •Hardy-Weinberg Equilibrium •Relationship between allele and genotype frequencies that holds for most genetic markers •Exact

Linkage Disequilibrium in Association Studies:

Pairwise Tagging Algorithm• Select an r2 threshold, typically 0.5 or 0.8

• SNPs with r2 above threshold can serve as proxies for each other

• For each marker being considered, count the number of SNPs with r2 above threshold

• Genotype SNP with the largest number of pairwise “proxies”

• Remove SNP and all the SNPs it tags from consideration

• Repeat the previous three steps until there are no more SNPs to pick or genotyping budget is exhausted

Carlson et al, AJHG, 2004

Page 61: Linkage Disequilibrium - University of Michigan€¦ · •Hardy-Weinberg Equilibrium •Relationship between allele and genotype frequencies that holds for most genetic markers •Exact

Potential Number of tag SNPs

Current tag SNP panels typically examine 500,000 – 1,000,000 SNPs for a cost of

$50 - $100 per sample.

The International HapMap Consortium, Nature, 2007

Page 62: Linkage Disequilibrium - University of Michigan€¦ · •Hardy-Weinberg Equilibrium •Relationship between allele and genotype frequencies that holds for most genetic markers •Exact

Today …

• Basic descriptors of linkage disequilibrium

• Learn when linkage disequilibrium is expected to hold (or not!)

Page 63: Linkage Disequilibrium - University of Michigan€¦ · •Hardy-Weinberg Equilibrium •Relationship between allele and genotype frequencies that holds for most genetic markers •Exact

Additional Reading I

• Dawson E et al (2002). A first-generation linkage disequilibrium map of human chromosome 22. Nature 418:544-548

• The International HapMap Consortium. (2005). A haplotype map of the human genome. Nature 437:1299-320

• Carlson CS et al (2004). Selecting a maximally informative set of single-nucleotide polymorphisms for association analyses using linkage disequilibrium. Am J Hum Genet 74:106-120

Page 64: Linkage Disequilibrium - University of Michigan€¦ · •Hardy-Weinberg Equilibrium •Relationship between allele and genotype frequencies that holds for most genetic markers •Exact

Additional Reading II

• Cardon and Bell (2001) Association study designs for complex diseases. Nature Reviews Genetics 2:91-99

• Surveys important issues in analyzing population data.

• Illustrates shift from focus on linkage to association mapping for complex traits.