Population Genetics: The Hardy-Weinberg Model

37
Outline Population Random Mating Non-Overlapping Generations The Hardy-Weinberg Model for One Locus Extra: Another formulation of the Hardy-Weinberg Law Population Genetics: The Hardy-Weinberg Model Sorin Istrail Department of Computer Science Brown University, Providence [email protected] September 13, 2012 Sorin Istrail Population Genetics: The Hardy-Weinberg Model

Transcript of Population Genetics: The Hardy-Weinberg Model

Page 1: Population Genetics: The Hardy-Weinberg Model

OutlinePopulation

Random MatingNon-Overlapping Generations

The Hardy-Weinberg Model for One LocusExtra: Another formulation of the Hardy-Weinberg Law

Population Genetics: The Hardy-Weinberg Model

Sorin Istrail

Department of Computer ScienceBrown University, Providence

[email protected]

September 13, 2012

Sorin Istrail Population Genetics: The Hardy-Weinberg Model

Page 2: Population Genetics: The Hardy-Weinberg Model

OutlinePopulation

Random MatingNon-Overlapping Generations

The Hardy-Weinberg Model for One LocusExtra: Another formulation of the Hardy-Weinberg Law

Inspirational Quotes

To begin with, we must emphasize a statement which I am sure you have heard before, butwhich must be repeated again and again. It is that the sciences do not try to explain, theyhardly even try to interpret, they mainly make models. By a model is meant a mathematicalconstruct which, with the addition of certain verbal interpretations, describes observed phenomena.The justification of such a mathematical construct is solely and precisely that it is expected towork -that is, correctly to describe phenomena from a reasonably wide area. Furthermore, it mustsatisfy certain esthetic criteria -that is, in relation to how much it describes, it must be rathersimple. I think it is worth while insisting on these vague terms - for instance, on the use of theword rather. One cannot tell exactly how ”simple” simple is. Some of the theories that we haveadopted, some of the models with which we are very happy and of which we are very proud wouldprobably not impress someone exposed to them for the first time as being particularly simple.

John von Neumann,“Method in the Physical Sciences”

“Nothing in Biology Makes Sense Except in the Light of Evolution” is an essay published in 1973 by evolutionarybiologist Theodosius Dobzhansky.

Sorin Istrail Population Genetics: The Hardy-Weinberg Model

Page 3: Population Genetics: The Hardy-Weinberg Model

OutlinePopulation

Random MatingNon-Overlapping Generations

The Hardy-Weinberg Model for One LocusExtra: Another formulation of the Hardy-Weinberg Law

Outline

1 Outline

2 PopulationPopulation Substructure

3 Random Mating

4 Non-Overlapping Generations

5 The Hardy-Weinberg Model for One Locus

6 Extra: Another formulation of the Hardy-Weinberg Law

Sorin Istrail Population Genetics: The Hardy-Weinberg Model

Page 4: Population Genetics: The Hardy-Weinberg Model

OutlinePopulation

Random MatingNon-Overlapping Generations

The Hardy-Weinberg Model for One LocusExtra: Another formulation of the Hardy-Weinberg Law

Population Substructure

Population

In population genetics, the term population does not refer to theentire species, but to a group of organisms of the same speciesliving within a sufficiently restricted geographic area, such that anymember can mate with any other member of opposite sex. Thereare difficulties with this definition. One relates to the fact thatgeography creates some typically non-random pattern in the spatialdistribution of organisms; the members are not uniformlydistributed but they are in clusters or colonies, hard to defineformally.

Sorin Istrail Population Genetics: The Hardy-Weinberg Model

Page 5: Population Genetics: The Hardy-Weinberg Model

OutlinePopulation

Random MatingNon-Overlapping Generations

The Hardy-Weinberg Model for One LocusExtra: Another formulation of the Hardy-Weinberg Law

Population Substructure

Geographic areas that are favorable for habitat are intertwined withareas that are not favorable: towns, cities, rivers, mountains.

Sorin Istrail Population Genetics: The Hardy-Weinberg Model

Page 6: Population Genetics: The Hardy-Weinberg Model

OutlinePopulation

Random MatingNon-Overlapping Generations

The Hardy-Weinberg Model for One LocusExtra: Another formulation of the Hardy-Weinberg Law

Population Substructure

Population Substructure

Local interbreeding units of possible large geographical structureare the local units. Such units are of major importance becauseadaptive evolution takes place through systematic changes in allelefrequencies.

Sorin Istrail Population Genetics: The Hardy-Weinberg Model

Page 7: Population Genetics: The Hardy-Weinberg Model

OutlinePopulation

Random MatingNon-Overlapping Generations

The Hardy-Weinberg Model for One LocusExtra: Another formulation of the Hardy-Weinberg Law

Random Mating

In sexual organisms genotypes are not transmitted to the nextgeneration.

Genotype (generation n) → Gametes → Genotype (generationn + 1)

The frequency of a genotype in a population is the genotypefrequency.

The Genotype of the mating pair determines the genotypes ofthe offspring.

Sorin Istrail Population Genetics: The Hardy-Weinberg Model

Page 8: Population Genetics: The Hardy-Weinberg Model

OutlinePopulation

Random MatingNon-Overlapping Generations

The Hardy-Weinberg Model for One LocusExtra: Another formulation of the Hardy-Weinberg Law

Random Mating

In sexual organisms genotypes are not transmitted to the nextgeneration.

Genotype (generation n) → Gametes → Genotype (generationn + 1)

The frequency of a genotype in a population is the genotypefrequency.

The Genotype of the mating pair determines the genotypes ofthe offspring.

Sorin Istrail Population Genetics: The Hardy-Weinberg Model

Page 9: Population Genetics: The Hardy-Weinberg Model

OutlinePopulation

Random MatingNon-Overlapping Generations

The Hardy-Weinberg Model for One LocusExtra: Another formulation of the Hardy-Weinberg Law

Random Mating

In sexual organisms genotypes are not transmitted to the nextgeneration.

Genotype (generation n) → Gametes → Genotype (generationn + 1)

The frequency of a genotype in a population is the genotypefrequency.

The Genotype of the mating pair determines the genotypes ofthe offspring.

Sorin Istrail Population Genetics: The Hardy-Weinberg Model

Page 10: Population Genetics: The Hardy-Weinberg Model

OutlinePopulation

Random MatingNon-Overlapping Generations

The Hardy-Weinberg Model for One LocusExtra: Another formulation of the Hardy-Weinberg Law

Random Mating

In sexual organisms genotypes are not transmitted to the nextgeneration.

Genotype (generation n) → Gametes → Genotype (generationn + 1)

The frequency of a genotype in a population is the genotypefrequency.

The Genotype of the mating pair determines the genotypes ofthe offspring.

Sorin Istrail Population Genetics: The Hardy-Weinberg Model

Page 11: Population Genetics: The Hardy-Weinberg Model

OutlinePopulation

Random MatingNon-Overlapping Generations

The Hardy-Weinberg Model for One LocusExtra: Another formulation of the Hardy-Weinberg Law

Random Mating

Definition

The Random Mating Model. Mating pairs have the samefrequencies as if they were formed by random collisions betweengenotypes = the chance that an organism mates with another isbased on the genotype frequencies in that population.

Sorin Istrail Population Genetics: The Hardy-Weinberg Model

Page 12: Population Genetics: The Hardy-Weinberg Model

OutlinePopulation

Random MatingNon-Overlapping Generations

The Hardy-Weinberg Model for One LocusExtra: Another formulation of the Hardy-Weinberg Law

Random Mating

Consider a population with genotype frequencies AA 20%, Aa30% and aa 50%. Suppose the mating is random: then AAfemales mate with the AA, Aa and aa males in proportion of20%, 30% and 50% respectively.

In humans, some mating is random e.g., by blood type; somemating is not random: by height or skin color.

Sorin Istrail Population Genetics: The Hardy-Weinberg Model

Page 13: Population Genetics: The Hardy-Weinberg Model

OutlinePopulation

Random MatingNon-Overlapping Generations

The Hardy-Weinberg Model for One LocusExtra: Another formulation of the Hardy-Weinberg Law

Non-Overlapping Generations

The cycle of birth, maturation, death include the death of allorganisms present in each generation before the members of thenext generation mature (with is defined as the age of the sexualreproduction).

Sorin Istrail Population Genetics: The Hardy-Weinberg Model

Page 14: Population Genetics: The Hardy-Weinberg Model

OutlinePopulation

Random MatingNon-Overlapping Generations

The Hardy-Weinberg Model for One LocusExtra: Another formulation of the Hardy-Weinberg Law

The Hardy-Weinberg Model for One Locus

Random mating in the model with non-overlappinggenerations.

We assume that the forces that change allele frequencies arenegligible, forces such as mutation, migration, naturalselection.

Population must be large in size so that the allele frequencieswould not change by changing the sample.

Sorin Istrail Population Genetics: The Hardy-Weinberg Model

Page 15: Population Genetics: The Hardy-Weinberg Model

OutlinePopulation

Random MatingNon-Overlapping Generations

The Hardy-Weinberg Model for One LocusExtra: Another formulation of the Hardy-Weinberg Law

Axioms for the Hardy-Weinberg Model

1 The organism is diploid

2 Reproduction is sexual

3 Generations re non-overlapping

4 The gene (locus) under consideration has two allele

5 The allele frequencies are the same for both males and females

6 Mating is random

7 Population size is very large (in theory infinite)

8 Mutation can be ignored

9 Migration is negligible

10 Natural selection does not affect the alleles underconsideration

Sorin Istrail Population Genetics: The Hardy-Weinberg Model

Page 16: Population Genetics: The Hardy-Weinberg Model

OutlinePopulation

Random MatingNon-Overlapping Generations

The Hardy-Weinberg Model for One LocusExtra: Another formulation of the Hardy-Weinberg Law

Axioms for the Hardy-Weinberg Model

1 The organism is diploid

2 Reproduction is sexual

3 Generations re non-overlapping

4 The gene (locus) under consideration has two allele

5 The allele frequencies are the same for both males and females

6 Mating is random

7 Population size is very large (in theory infinite)

8 Mutation can be ignored

9 Migration is negligible

10 Natural selection does not affect the alleles underconsideration

Sorin Istrail Population Genetics: The Hardy-Weinberg Model

Page 17: Population Genetics: The Hardy-Weinberg Model

OutlinePopulation

Random MatingNon-Overlapping Generations

The Hardy-Weinberg Model for One LocusExtra: Another formulation of the Hardy-Weinberg Law

Axioms for the Hardy-Weinberg Model

1 The organism is diploid

2 Reproduction is sexual

3 Generations re non-overlapping

4 The gene (locus) under consideration has two allele

5 The allele frequencies are the same for both males and females

6 Mating is random

7 Population size is very large (in theory infinite)

8 Mutation can be ignored

9 Migration is negligible

10 Natural selection does not affect the alleles underconsideration

Sorin Istrail Population Genetics: The Hardy-Weinberg Model

Page 18: Population Genetics: The Hardy-Weinberg Model

OutlinePopulation

Random MatingNon-Overlapping Generations

The Hardy-Weinberg Model for One LocusExtra: Another formulation of the Hardy-Weinberg Law

Axioms for the Hardy-Weinberg Model

1 The organism is diploid

2 Reproduction is sexual

3 Generations re non-overlapping

4 The gene (locus) under consideration has two allele

5 The allele frequencies are the same for both males and females

6 Mating is random

7 Population size is very large (in theory infinite)

8 Mutation can be ignored

9 Migration is negligible

10 Natural selection does not affect the alleles underconsideration

Sorin Istrail Population Genetics: The Hardy-Weinberg Model

Page 19: Population Genetics: The Hardy-Weinberg Model

OutlinePopulation

Random MatingNon-Overlapping Generations

The Hardy-Weinberg Model for One LocusExtra: Another formulation of the Hardy-Weinberg Law

Axioms for the Hardy-Weinberg Model

1 The organism is diploid

2 Reproduction is sexual

3 Generations re non-overlapping

4 The gene (locus) under consideration has two allele

5 The allele frequencies are the same for both males and females

6 Mating is random

7 Population size is very large (in theory infinite)

8 Mutation can be ignored

9 Migration is negligible

10 Natural selection does not affect the alleles underconsideration

Sorin Istrail Population Genetics: The Hardy-Weinberg Model

Page 20: Population Genetics: The Hardy-Weinberg Model

OutlinePopulation

Random MatingNon-Overlapping Generations

The Hardy-Weinberg Model for One LocusExtra: Another formulation of the Hardy-Weinberg Law

Axioms for the Hardy-Weinberg Model

1 The organism is diploid

2 Reproduction is sexual

3 Generations re non-overlapping

4 The gene (locus) under consideration has two allele

5 The allele frequencies are the same for both males and females

6 Mating is random

7 Population size is very large (in theory infinite)

8 Mutation can be ignored

9 Migration is negligible

10 Natural selection does not affect the alleles underconsideration

Sorin Istrail Population Genetics: The Hardy-Weinberg Model

Page 21: Population Genetics: The Hardy-Weinberg Model

OutlinePopulation

Random MatingNon-Overlapping Generations

The Hardy-Weinberg Model for One LocusExtra: Another formulation of the Hardy-Weinberg Law

Axioms for the Hardy-Weinberg Model

1 The organism is diploid

2 Reproduction is sexual

3 Generations re non-overlapping

4 The gene (locus) under consideration has two allele

5 The allele frequencies are the same for both males and females

6 Mating is random

7 Population size is very large (in theory infinite)

8 Mutation can be ignored

9 Migration is negligible

10 Natural selection does not affect the alleles underconsideration

Sorin Istrail Population Genetics: The Hardy-Weinberg Model

Page 22: Population Genetics: The Hardy-Weinberg Model

OutlinePopulation

Random MatingNon-Overlapping Generations

The Hardy-Weinberg Model for One LocusExtra: Another formulation of the Hardy-Weinberg Law

Axioms for the Hardy-Weinberg Model

1 The organism is diploid

2 Reproduction is sexual

3 Generations re non-overlapping

4 The gene (locus) under consideration has two allele

5 The allele frequencies are the same for both males and females

6 Mating is random

7 Population size is very large (in theory infinite)

8 Mutation can be ignored

9 Migration is negligible

10 Natural selection does not affect the alleles underconsideration

Sorin Istrail Population Genetics: The Hardy-Weinberg Model

Page 23: Population Genetics: The Hardy-Weinberg Model

OutlinePopulation

Random MatingNon-Overlapping Generations

The Hardy-Weinberg Model for One LocusExtra: Another formulation of the Hardy-Weinberg Law

Axioms for the Hardy-Weinberg Model

1 The organism is diploid

2 Reproduction is sexual

3 Generations re non-overlapping

4 The gene (locus) under consideration has two allele

5 The allele frequencies are the same for both males and females

6 Mating is random

7 Population size is very large (in theory infinite)

8 Mutation can be ignored

9 Migration is negligible

10 Natural selection does not affect the alleles underconsideration

Sorin Istrail Population Genetics: The Hardy-Weinberg Model

Page 24: Population Genetics: The Hardy-Weinberg Model

OutlinePopulation

Random MatingNon-Overlapping Generations

The Hardy-Weinberg Model for One LocusExtra: Another formulation of the Hardy-Weinberg Law

Axioms for the Hardy-Weinberg Model

1 The organism is diploid

2 Reproduction is sexual

3 Generations re non-overlapping

4 The gene (locus) under consideration has two allele

5 The allele frequencies are the same for both males and females

6 Mating is random

7 Population size is very large (in theory infinite)

8 Mutation can be ignored

9 Migration is negligible

10 Natural selection does not affect the alleles underconsideration

Sorin Istrail Population Genetics: The Hardy-Weinberg Model

Page 25: Population Genetics: The Hardy-Weinberg Model

OutlinePopulation

Random MatingNon-Overlapping Generations

The Hardy-Weinberg Model for One LocusExtra: Another formulation of the Hardy-Weinberg Law

In the Hardy-Weinberg Model the mathematical relation betweenthe allele frequencies and the genotype frequencies is given by:

AA : p2 Aa : 2pq aa : q2

in which

p2, 2pq, q2 are the frequencies of the genotypes AA,Aa, aa inzygotes and

p, q are the allele frequencies of A and a in gametes in theprevious generation and

p + q = 1

Sorin Istrail Population Genetics: The Hardy-Weinberg Model

Page 26: Population Genetics: The Hardy-Weinberg Model

OutlinePopulation

Random MatingNon-Overlapping Generations

The Hardy-Weinberg Model for One LocusExtra: Another formulation of the Hardy-Weinberg Law

In the Hardy-Weinberg Model the mathematical relation betweenthe allele frequencies and the genotype frequencies is given by:

AA : p2 Aa : 2pq aa : q2

in which

p2, 2pq, q2 are the frequencies of the genotypes AA,Aa, aa inzygotes and

p, q are the allele frequencies of A and a in gametes in theprevious generation and

p + q = 1

Sorin Istrail Population Genetics: The Hardy-Weinberg Model

Page 27: Population Genetics: The Hardy-Weinberg Model

OutlinePopulation

Random MatingNon-Overlapping Generations

The Hardy-Weinberg Model for One LocusExtra: Another formulation of the Hardy-Weinberg Law

In the Hardy-Weinberg Model the mathematical relation betweenthe allele frequencies and the genotype frequencies is given by:

AA : p2 Aa : 2pq aa : q2

in which

p2, 2pq, q2 are the frequencies of the genotypes AA,Aa, aa inzygotes and

p, q are the allele frequencies of A and a in gametes in theprevious generation and

p + q = 1

Sorin Istrail Population Genetics: The Hardy-Weinberg Model

Page 28: Population Genetics: The Hardy-Weinberg Model

OutlinePopulation

Random MatingNon-Overlapping Generations

The Hardy-Weinberg Model for One LocusExtra: Another formulation of the Hardy-Weinberg Law

Fundamental Theorem 1

Theorem

The Hardy-Weinberg frequencies are attained after one generationof random mating irrespective of the genotype frequencies in theparental generation.

Sorin Istrail Population Genetics: The Hardy-Weinberg Model

Page 29: Population Genetics: The Hardy-Weinberg Model

OutlinePopulation

Random MatingNon-Overlapping Generations

The Hardy-Weinberg Model for One LocusExtra: Another formulation of the Hardy-Weinberg Law

Fundamental Theorem 2 [Constancy of Allele Frequency inGenerations

Theorem

The Hardy-Weinberg law implies the constancy of allelefrequencies in every generation and therefore the genotypiccomposition of the population. This means that in the absence ofspecific evolutionary forces to change allele frequencies, themechanism of Mendelian inheritance, by itself keeps the allelefrequencies constant and thus preserves genetic variation.

Sorin Istrail Population Genetics: The Hardy-Weinberg Model

Page 30: Population Genetics: The Hardy-Weinberg Model

OutlinePopulation

Random MatingNon-Overlapping Generations

The Hardy-Weinberg Model for One LocusExtra: Another formulation of the Hardy-Weinberg Law

A new notation

Let us denote the two alleles at fixed locus A1 and A2.Suppose that in a generation the frequencies of the threegenotypes A1A1,A1A2,A2A2 are denoted by X , 2Y and Zrespectively.

We can calculate again the frequencies of the mating pairs:e.g., for mating type A1A1×A1A1 the frequency is X 2; for themating pair A1A1 × A1A2 the frequency is 4XY ; and so on.

Sorin Istrail Population Genetics: The Hardy-Weinberg Model

Page 31: Population Genetics: The Hardy-Weinberg Model

OutlinePopulation

Random MatingNon-Overlapping Generations

The Hardy-Weinberg Model for One LocusExtra: Another formulation of the Hardy-Weinberg Law

A new notation

The Medelian rules indicate that the outcome of theA1A1 × A1A1 mating is A1A1 and similarly we can computethe probabilities of obtaining such a genotype from the othermatings types as follows:

A1A1 is obtained from the A1A1 × A1A1 mating withprobability 1A1A1 is obtained from the A1A1 × A1A2 mating withprobability 1

2A1A1 is obtained from the A1A2 × A1A2 mating withprobability 1

4and note that the frequencies of the three mating types areX 2, 4XY and 4Y 2

Sorin Istrail Population Genetics: The Hardy-Weinberg Model

Page 32: Population Genetics: The Hardy-Weinberg Model

OutlinePopulation

Random MatingNon-Overlapping Generations

The Hardy-Weinberg Model for One LocusExtra: Another formulation of the Hardy-Weinberg Law

A new notation

Therefore, if X ′, 2Y ′,Z ′ are the corresponding three genotypesA1A1,A1A2,A2A2 in the next generation then we have:

X ′ = X 2 + 12(4XY ) + 1

4(4Y 2) = X 2 + 2XY + Y 2 = (X + Y )2

2Y ′ = 12(4XY )+ 1

2(4Y 2)+2XZ + 12(4YZ ) = 2(X +Y )(Y +Z )

Z ′ = 14(4Y 2) + 1

2(4YZ ) + Z 2 = (Y + Z )2

Sorin Istrail Population Genetics: The Hardy-Weinberg Model

Page 33: Population Genetics: The Hardy-Weinberg Model

OutlinePopulation

Random MatingNon-Overlapping Generations

The Hardy-Weinberg Model for One LocusExtra: Another formulation of the Hardy-Weinberg Law

A new notation

X ′ = (X + Y )2

2Y ′ = 2(X + Y )(Y + Z )

Z ′ = (Y + Z )2

In the next generation X ”, 2Y ”,Z ” we can easily see that:X ” = (X ′ + Y ′)2 = X ′,Y ” = Y ′,Z ” = Z ′ The genotypefrequencies obtained in the second generation are maintainedin all subsequent generations.

Sorin Istrail Population Genetics: The Hardy-Weinberg Model

Page 34: Population Genetics: The Hardy-Weinberg Model

OutlinePopulation

Random MatingNon-Overlapping Generations

The Hardy-Weinberg Model for One LocusExtra: Another formulation of the Hardy-Weinberg Law

A new notation

Frequencies having this property can be characterized by therelation:

(Y ′)2 = X ′Z ′

Sorin Istrail Population Genetics: The Hardy-Weinberg Model

Page 35: Population Genetics: The Hardy-Weinberg Model

OutlinePopulation

Random MatingNon-Overlapping Generations

The Hardy-Weinberg Model for One LocusExtra: Another formulation of the Hardy-Weinberg Law

A new notation

Populations for which Y 2 = XZ are said to have thegenotypic frequencies in the Hardy-Weinberg form.

Let us observe that although there are possible changes ingenotypic frequencies between generation 1 and generation 2,the frequency x = X + Y of the allele A1 does not changebetween these two generations, nor between any generationsfollowing them.

Sorin Istrail Population Genetics: The Hardy-Weinberg Model

Page 36: Population Genetics: The Hardy-Weinberg Model

OutlinePopulation

Random MatingNon-Overlapping Generations

The Hardy-Weinberg Model for One LocusExtra: Another formulation of the Hardy-Weinberg Law

A new notation

Because X + 2Y + Z = 1 only two of the three frequenciesX , 2Y ,Z are independent (the other follows from them).

If the Y 2 = XZ holds then only one of the three frequencies isindependent (the other two follows from them).

The most convenient as an independent quantity is the thefrequency x of the allele A1.

With this notation we now reformulate the Hardy-WeinbergLaw.

Sorin Istrail Population Genetics: The Hardy-Weinberg Model

Page 37: Population Genetics: The Hardy-Weinberg Model

OutlinePopulation

Random MatingNon-Overlapping Generations

The Hardy-Weinberg Model for One LocusExtra: Another formulation of the Hardy-Weinberg Law

Hardy-Weinberg Theorem

Theorem

Under the assumptions stated, a population, having genotypicfrequencies X (of A1A1), 2Y (of A1A2) and Z (of A2A2),achieves, after one generation of random mating, stablegenotypic frequencies

x2, 2x(1− x), (1− x)2

where x = X + Y , 1− x = Y + Z .

If the initial frequencies X , 2Y ,Z are already of the formx2, 2x(1− x), (1− x)2 then these frequencies are stable for allgenerations.

Sorin Istrail Population Genetics: The Hardy-Weinberg Model