PV92 PCR/Informatics Kit Population Genetics and Informatics.

34
PV92 PCR/Informatics PV92 PCR/Informatics Kit Kit Population Genetics and Population Genetics and Informatics Informatics

Transcript of PV92 PCR/Informatics Kit Population Genetics and Informatics.

Page 1: PV92 PCR/Informatics Kit Population Genetics and Informatics.

PV92 PCR/Informatics KitPV92 PCR/Informatics Kit

Population Genetics and Population Genetics and InformaticsInformatics

Page 2: PV92 PCR/Informatics Kit Population Genetics and Informatics.

To estimate frequency of To estimate frequency of Alu Alu

within a population:within a population:

Amplify Amplify Alu Alu insert from insert from representative sample populationrepresentative sample population

Calculate the expected allelic and Calculate the expected allelic and genotypic frequenciesgenotypic frequencies

Perform Chi-squared TestPerform Chi-squared Test

Page 3: PV92 PCR/Informatics Kit Population Genetics and Informatics.

Calculating Allelic and Genotypic Calculating Allelic and Genotypic FrequenciesFrequencies

Within your class, how unique is your particular combination of Alu alleles? By calculating an allele frequency, you can begin to answer this question. An allele frequency is the percentage of a particular allele within a population of alleles. It is expressed as a decimal. You can calculate an allele frequency for the Alu PV92 insertion in your class by combining all your data. For example, imagine that there are 38 students in your class and the genotype distribution within the class is as follows: GenotypeGenotype +/+ +/- -/- TotalTotal

(N)(N)

# of people# of people 25 5 825 5 8 3838

Page 4: PV92 PCR/Informatics Kit Population Genetics and Informatics.

Calculating Allelic Frequencies: + Calculating Allelic Frequencies: + alleleallele

Total number of alleles = 2N = Total number of alleles = 2N = 2(38)2(38) = 76 = 76

Number of + alleles =Number of + alleles =

+/+=+/+= 25 with two 25 with two + + alleles alleles = 50 + = 50 + allelesalleles

+/- = +/- = 5 with one 5 with one + + allelesalleles = 5 + = 5 + allelesalleles

Total= Total= 55 + 55 + allelesalleles

Frequency of + Frequency of + = = number of + allelesnumber of + alleles = = 55 55 = = 0.720.72 total alleles total alleles 7676

Calculation for the – alleles would be similar, and Calculation for the – alleles would be similar, and the result would be .28the result would be .28

Notice: (+ allele) + (- allele) = 1.0Notice: (+ allele) + (- allele) = 1.0

Page 5: PV92 PCR/Informatics Kit Population Genetics and Informatics.

Calculating Genotypic Calculating Genotypic FrequenciesFrequencies

How does the distribution of Alu genotypes in your class compare with the distribution in other populations? For this analysis, you need to calculate a genotype frequency, the percentage of individuals within a population having a particular genotype. Remember that the term allele refers to one of several different forms of a particular genetic site whereas the term genotype refers to the specific alleles that an organism carries. You can calculate the frequency of each genotype in your class by counting how many students have a particular genotype and dividing that number by the total number of students.

Given the ethnic makeup of your class, might you expect something different?

How can you estimate what the expected frequency should be?

Page 6: PV92 PCR/Informatics Kit Population Genetics and Informatics.

Calculating Calculating ObservedObserved Genotypic FrequenciesGenotypic Frequencies

GenotypeGenotype +/+ (p2)(p2) +/- (2pq)(2pq) -/- (q2)(q2) TotalTotal (N) (N)

# of people# of people 25 5 825 5 8 38 38

Observed Observed 0.66 0.13 0.21 0.66 0.13 0.21 1.001.00

frequencyfrequency

Calculation:Calculation:+/+ genotypic frequency == # with genotype

total number of people (N)total number of people (N)

== 25/3825/38 = .66= .66

Page 7: PV92 PCR/Informatics Kit Population Genetics and Informatics.

Alu and Population GeneticsAlu and Population Genetics If within an infinitely large population no mutations

are acquired, no genotypes are lost or gained, mating is random, and all genotypes are equally viable, then that population is said to be in Hardy-Weinberg equilibrium. In such populations, the allele frequencies will remain constant generation after generation. Genotype frequencies within this population can then be calculated from allele frequencies by using the equation:

p2 + 2pq + q2 = 1.0 p and q are the allele frequencies for two

alternate forms of a genetic site. The genotype frequency of the homozygous condition is either p2 or q2 (depending on which allele you assign to p and which to q). The heterozygous genotype frequency is 2pq.

Page 8: PV92 PCR/Informatics Kit Population Genetics and Informatics.

Alu and Population GeneticsAlu and Population GeneticsHardy-Weinberg EquilibriumHardy-Weinberg Equilibrium

pp22 + 2pq + q + 2pq + q22 = 1 = 1

+/+ = p+/+ = p22

+/- = 2pq+/- = 2pq-/- = q-/- = q22

pppp pqpq

qqqqpqpq

p p

pp

qq

qq

Page 9: PV92 PCR/Informatics Kit Population Genetics and Informatics.

Using Hardy-WeinbergUsing Hardy-Weinberg

Determine Determine pp22,, 2pq, and q2pq, and q22 values= values= Expected genotypic frequenciesExpected genotypic frequencies

p = 0.72 , so q = 0.28p = 0.72 , so q = 0.28 since since p + q = 1p + q = 1 pp22 ++ 2pq2pq + q+ q22 = 1= 1

(0.72)(0.72)22 + 2 (0.72)(0.28)+ 2 (0.72)(0.28) + (0.28)+ (0.28)22 = 1= 1 0.520.52 ++ 0.400.40 + + 0.080.08 = 1= 1

p2 = 0.52p2 = 0.52 2pq = 0.402pq = 0.40 q2 = 0.08q2 = 0.08

Page 10: PV92 PCR/Informatics Kit Population Genetics and Informatics.

Calculate Calculate Expected Expected Number of Number of GenotypesGenotypes

Expected number of genotype =Expected number of genotype =

Genotypic frequency x population Genotypic frequency x population number (N)number (N)

GenotypeGenotype Expected Expected numbernumber+/+ +/+ 0.52 x 38 = 0.52 x 38 = 2020+/+/- - 0.40 x 38 = 0.40 x 38 = 1515

--//- - 0.08 x 38 = 0.08 x 38 = 33

Page 11: PV92 PCR/Informatics Kit Population Genetics and Informatics.

Chi Squared TestChi Squared Test Chi-square, a statistical test used for comparing

observed frequencies with expected frequencies. The larger the chi-square value, the greater is the difference between the observed and the expected values.

When using the Chi-square analysis, we test the null hypothesis that there is no difference between samples (observed and expected) and we assume that if there is any difference, then it arose simply by chance and is not real.

Our null hypothesis is that your class is in Hardy-Weinberg equilibrium. Whether or not we can accept the null hypothesis is given by a p-value.

If the calculated p-value is less than 0.05, the null hypothesis is disproved; the population is not in Hardy-Weinberg equilibrium.

If the p-value is greater than 0.05, the population may be in Hardy-Weinberg equilibrium; we can not prove that it is not in Hardy-Weinberg equilibrium.

Page 12: PV92 PCR/Informatics Kit Population Genetics and Informatics.

Chi Squared TestChi Squared Test

As an example, let’s say that Chi-square analysis of some data gives a p-value of 0.17. This means that there is a 17% probability that the difference between the observed and the expected values is due to chance. It also means that there is an 83% (100% - 17% = 83%) probability that the difference is not due to chance; the difference is real.

Page 13: PV92 PCR/Informatics Kit Population Genetics and Informatics.

Chi SquaredChi Squared Test Test

E

EOX

22

Observed Observed Expected Expected (O-E)2(O-E)2

EE+/+ +/+ 2525 20 20 1.251.25 +/- +/- 55 15 15 6.666.66-/- -/- 88 3 3 8.338.33

Total Total 17.1217.12

X2 Critical Value (from statistics table) = 5.9X2 Critical Value (from statistics table) = 5.9

17.12 is above 5.9 so the ratio is not accepted.17.12 is above 5.9 so the ratio is not accepted.

Page 14: PV92 PCR/Informatics Kit Population Genetics and Informatics.

Allele ServerAllele Server

Cold Spring Harbor LabCold Spring Harbor Lab

DNA Learning CenterDNA Learning Center

Page 15: PV92 PCR/Informatics Kit Population Genetics and Informatics.

http:// vector.cshl.orghttp:// vector.cshl.org

Click on Resources

Page 16: PV92 PCR/Informatics Kit Population Genetics and Informatics.

Click on BioserversClick on Bioservers

Click on Bioservers

Page 17: PV92 PCR/Informatics Kit Population Genetics and Informatics.

Enter the Allele ServerEnter the Allele Server

Page 18: PV92 PCR/Informatics Kit Population Genetics and Informatics.

Allele ServerAllele Server

Click on Manage Groups

Page 19: PV92 PCR/Informatics Kit Population Genetics and Informatics.

Select Type of DataSelect Type of Data

Select Group

Page 20: PV92 PCR/Informatics Kit Population Genetics and Informatics.

Your GroupYour Group

Scroll down to “Your Group”

Page 21: PV92 PCR/Informatics Kit Population Genetics and Informatics.

Click on Add GroupClick on Add Group

Click on Add Group

Page 22: PV92 PCR/Informatics Kit Population Genetics and Informatics.

Fill Out FormFill Out Form

Page 23: PV92 PCR/Informatics Kit Population Genetics and Informatics.

Click on Edit GroupClick on Edit Group

Click on Edit Group

Page 24: PV92 PCR/Informatics Kit Population Genetics and Informatics.

Fill out CompletelyFill out Completely

Page 25: PV92 PCR/Informatics Kit Population Genetics and Informatics.

Click on Individuals TabClick on Individuals Tab

Click on Individuals Tab

Page 26: PV92 PCR/Informatics Kit Population Genetics and Informatics.

Add Each Student With Add Each Student With InformationInformation

Add as much info as you canAdd as much info as you can Genotype ( +/+, +/-, -/- )Genotype ( +/+, +/-, -/- ) GenderGender Personal InfoPersonal Info

Page 27: PV92 PCR/Informatics Kit Population Genetics and Informatics.

Click on DoneClick on Done

Done

Page 28: PV92 PCR/Informatics Kit Population Genetics and Informatics.

Select Your GroupSelect Your Group

Select and then press OK

Page 29: PV92 PCR/Informatics Kit Population Genetics and Informatics.

Click to AnalyzeClick to Analyze

Click Here

Then Click Here

Page 30: PV92 PCR/Informatics Kit Population Genetics and Informatics.

Then Look at the Terse and Then Look at the Terse and Verbose TabsVerbose Tabs

Page 31: PV92 PCR/Informatics Kit Population Genetics and Informatics.

ExtensionsExtensions Is your class in Hardy-Weinberg Is your class in Hardy-Weinberg

Equilibrium?Equilibrium? Compare your group to other existing Compare your group to other existing

groups.groups. Form an explanation for the origination Form an explanation for the origination

of Alu and how it spread throughout of Alu and how it spread throughout different populations.different populations.

Have students do manual calculations Have students do manual calculations first and then compare to the first and then compare to the computer generated version.computer generated version.

Page 32: PV92 PCR/Informatics Kit Population Genetics and Informatics.

Plotting Alu PV92 on a World Plotting Alu PV92 on a World MapMap The Alu element first appeared tens of millions of years ago and since

that time, it has been increasing within our genome at the rate of about one copy every 100 years.

It is difficult to tell how Alu arose. It shows a striking similarity to a gene (called 7SL RNA) that performs a vital function in our metabolism. But Alu , it seems, has no function. It is self-serving and, like a parasite, takes advantage of us for its own replication without providing us any advantage to our own survival.

Most Alu elements are “fixed”; they are found at the same chromosomal site in every person on the planet. Fixed Alu elements must have arose very early in our evolution, well before Homo sapiens appeared. When modern humans did arise some 200,000 years ago, the vast majority of our Alu insertions came to us already intact in our DNA.

The Alu PV92 insertion, however, is not fixed. This insertion may or may not be present on one or both of a person’s number 16 chromosomes. Since not everyone has the Alu PV92 element, it must have arisen after the initial human population began growing.

It is a widely held belief that modern humans originated in Africa and then disseminated across the planet. Did the Alu PV92 insert arise in Africa or on some other continent during our spread across the globe?

In the following exercise, you will plot the “+” allele frequencies for various populations on a world map and make some determination as to where this Alu arose and how it might have spread across continents.

Page 33: PV92 PCR/Informatics Kit Population Genetics and Informatics.

.18 .20

.96

.26

.09

.35

.30

.18

.52

.86

.15

.53

.12

Page 34: PV92 PCR/Informatics Kit Population Genetics and Informatics.

Classroom How To…Classroom How To…

We have worksheets for calculating We have worksheets for calculating allelic and genotypic frequencies for allelic and genotypic frequencies for your class your class

BABEC has the materials for the BABEC has the materials for the world population dataworld population data