BIOMATHlMATICS - NC State Department of · PDF fileBIOMATHlMATICS TRAINING PROGKAM ... method...

BIOMATHlMATICS TRAINING PROGKAM

A GENETIC ANALYSIS OF SERUM CHOLESTEROL;·.AND BLOOD PRESSUE LEVELS IN A LARGE PEDIGREE;.

,'../~'.Y;.'

by

Kelvin Kwoklen Lee

Department of BiostatisticsUniversity of North Carolina at Chapel Hill

Institute of Statistics Mimeo Series No. 1174

JUNE 1978

ABSTRACT'

KELVIN KWOKLEN LEE. A Genetic Analysis of Sennn Cholesterol andBlood Pressure Levels in a Large Pedigree. (Under thedirection of Robert C. Elston.)

Serum cholesterol and blood pressure data from a five-generation

pedigree from Bay City, Michigan with 235 members are analyzed. The

method of pedigree analysis does not require that the pedigree be

divided up into nuclear families, nor does it rely on arbitrary cutoff

points to dichotomize or trichotomize quantitative data. The method

involves calculating the likelihood of observing the phenotypes in

the pedigree based on a genetic hypothesis. Max~ likelihood esti

mates of parameters are obtained, and hypothesis testing is based on

the likelihood ratio criterion.

This study considers three specific underlying genetic models:

(1) major gene model - a single identifiable gene that can account for

a significant portion of the phenotypic variance; (2) polygenic model

the phenotype is controlled by a large number of equal and additive

gene effects; (3) a mixed model - allows for segregation of a major

gene together with polygenic and environmental background.

Hitherto, analyses of pedigree data using the mixed model have

not been attempted for lack of an efficient algorithm to calculate

the likelihood. As a first approach, an easily calculable conditional

likelihood function is maximized.

The results of the analyses indicate an autosomal dominant gene

for hypercholesterolemia segregating in this pedigree. The data are

consistent with an autosomal recessive gene segregating for systolic

hypertension at least in the main branc!l of the pedigree. Little

evidence for a major gene segregating for diastolic hypertension is

detected in this pedigree.

ACN'JOWLEDGMENTS

I wish to express my deep appreciation to my advisor, Dr. R.C.Elston, who suggested the topic of this dissertation and who provided

invaluable guidance, constant encouragement, anti patient tmderstanding.

Appreciation is also expressed to the other members of my advisory

committee, Drs.R.C. Elandt-Johnson, R.R. Kuebler, M.J. Symons, and

H.A. Tyroler; they made many valuable and helpful suggestions.

I am grateful to Dr. Kurt Hirschhorn for generously providing the

data which are analyzed in this dissertation.

The programming aid of Ellen Kaplan is gratefully acknowledged.

Without her programming support, this work may never have been com-

pleted. Thanks go to Geoffrey Day of the Radiation Effects Research

Fotmdation in Hiroshima, Japan, for his talented drawing of many of

the figures. Thanks to Susan Stapleton who typed this dissertation.

This investigation was supporteJ by NIH Training Grant No. T-Ol

GM00038 from the National Institute of General Medical Sciences. This

support is gratefully appreciated, as is the computer time provided by

the Biostatistics Department.

Finally, I thank my parents for their encouragement, tmderstanding,

and support throughout the years.

ACKNOWLEDGME1'ITS

LIST OF TABLES

LIST OF FIGURES

Chapter

TABLE OF CONfENTS

. . . . . . . . .P,age

ii

viii

xi

I . INTRODUCfION AND LITERA11JRE REVIEW. .

1.1 Introduction .

1.2 Essential Hypertension.

1.2.1 Definitions ..

1.2.2 The Pickering School

1.2.3 The Platt School

1.2.3.1 The Pre-1960 Hypothesis.

1.2.3.2 The Post-1960 Hypothesis

1

1

1

1

3

7

8

10

1.2.4 The Contributions of Other Investigators.. 12

1.3 Familial Hypercholesterolemia .

1.3.1 Classification of Familial Hyper1ipidemias . . . . . . .

1.3.2 Type II Hyper1ipoproteinemia..

1.3.3 The Genetics of Familial Hypercho1estero1e~a. .

1.4 Synopsis of the Problem...

II DESCRIPTION OF THE KINDRED..

2.1 Source of the Kindred..

2.2 Analysis of the Data as if From a Sample ofIndependent Individuals ....

2.2.1 Sex Differences ....

2.2.2 Relationships with Age.

21

21

26

28

33

35

35

36

47

47

III DESCRlPTION OF THE PEDIGREE

Chapter

3.3.2.2 Serum Cholesterol

2.2.3 Skewness and Kurtosis ..

viPage

57

60

66

69

69

73

76

78

80

80

80

90

90

94

94

99

105

105

106

111

118

118

124

128

134

Inter-trait Correlation .

3.3.2.3 Systolic Blood Pressure andSerum Cholesterol .

3.3.1.3 Sperry Cholesterol ..

3.3.1.4 Zak Cholesterol

2.2.4

4.4.3 Systolic Blood Pressure ..

4.4.4 Diastolic Blood Pressure

3.3.2 Bivariate Log-Normal Distributions.

3.3.2.1 Blood Pressure ...

2.3 Random Mating.....

4.1 Introduction . . . . · · . .4.2 The Major Gene Model

4.3 Method of Analysis . · · . .4.4 Results of Univariate Analyses .

4.4.1 Sperry Cholesterol . . · ·4.4.2 Zak Cholesterol . .

3.3 Fitting a Mixture of Normal Distributions

3.3.1 Univariate Log-Normal Distributions.

3.3.1.1 Systolic Blood Pressure.

3.3.1.2 Diastolic Blood Pressure

3.2 Age Distribution in the Two Pedigrees

3.1 The Pedigree Structure.

TIIE MAJOR GENE HYParnESIS .IV

Chapter

4.5 Results of Bivariate Analyses .

viiPage

134

4.5.1 Sperry Cholesterol and Zak Cholesterol.

4.5.2 Systolic and Diastolic Blood Pressure

v4.6 Conclusions .

TIlE POLYGENIC HYPOTI-IESIS

5.1 The Polygenic ~IDdel..

. . . . . . . . .

137

137

143

146

146

5.1.1 Sperry and Zak Cholesterol. 151

5.1.2 Systolic and Diastolic Blood Pressure. . . 153

5.1.3 Other Traits

5.2 Conclusions..

VI THE MIXED r-DDEL • • • •

6.1 Method of Analysis

. . . .. . . . .

. .

ISS

161

163

166

6.1.1 Genotypic Classification of Individuals

6.2 Sperry and Zak Cholesterol

167

169

170

177

178

. . .. . . .• •

6.3 Systolic Blood Pressure. • • .

6.4 Conclusions••••

VII Sm.MARY A'ID CONCLUSIONS •

APPENDICES

1. Secondary Hypertension: Hypertension Occurring as aManifestation of a Known Disease. . . . . . 186

2. List of Variables Observed in 1947.

3. List of Variables Observed in 1958 . . . . . . . .187

188

4. Sex, Age, Height, Weight, Systolic Blood Pressure,Diastolic Blood Pressure, Sperry and Zak Cholesterol, Beta and Prebetalipoprotein Values forMembers of the Pedigree Observed in 1958. . . 189

BIBLIOGRAPHY • . • • • • • • • • • . • . • • . • . . 193

LIST OF TABLES

Table Page

1.1 Diagnosis of Hyper1ipoproteinemia. . . 25

2.1 Age Distribution by Sex. . . . . . 37

2.2 Summary Statistics for Original and Natural Logarithmic-Transformed Variables. . . . . . . . . . . . . 42

2.3 Mean and Standard Error of Logarithmic-TransformedVariables By Sex . . . . . . . . . . . . . . . 48

2.4 Mean and Standard Error of Logarithmic-TransformedVariables by Age and Sex . . . . . . . . . . . 49

2.5 LL~ear and Quadratic Regression Coefficients of Agefor Logarithmic-Transfonned Variables by Sex . . . 56

2.6 Skewness and Kurtosis for Original and Age-adjustedLogarithmic-Transformed Variables. . . . . . 59

2.7 Correlation and Partial Correlation CoefficientsBetween Logarithmic-Transfonned VariablesInvolving Systolic and Diastolic Blood Pressureand Sperry and Zak Cho1estero1s . . . . . . . :62

2.8 Correlation and Partial Correlation CoefficientsBetween Logarithmic-Transformed Variables. . . 64

2.9 Inter-spouse Correlations of Age-Adjusted Logarithmic-Transformed Variables. . . . . . . 67

3.1 Age Distribution by Sex and Pedigree

3.2 Mean Age by Sex and Pedigree .....

3.3 Availability of Data for Six Traits by Sex andPedigree . . . . . . . . . . . . . . . . .

3.4 Maximum Likelihood Estimates of the Parameters for aMixture of Univariate Log-Normal Distributions

74

75

77

a. Trait: Systolic Blood Pressure - t~es. . 81b. Trait: Systolic Blood Pressure - Females. 81c. Trait: Diastolic Blood Pressure - Males . . 84d. Trait: Diastolic Blood Pressure - Females 84e. Trait: Sperry Cholesterol - Males . 87f. Trait: Sperry Cholesterol - Females . 87g. Trait: Zak Cholesterol - t~les 91

h. Trait: Zak Cholesterol - Females. . 91

ix

Table Page3.5 Maximum Likelihood Estimates of the Parameters for a

Hixture of Bivariate Log Nonnal Distributions

a. Trait: Systolic and Diastolic Blood Pressure -Males . . · · · · · · · · · · · · · · · · · . 9S

b. Trait: Systolic anu Diastolic Blood Pressure -Females . · · · · · · · · · · · · · · · 96

c. Trait: Sperry and Zak Cholesterol - Hales · 97

d. Trait: Sperry and Zak Cholesterol - Females 98

3.6 Maximum Likelihood Estimates of the Parameters forthe Two Local ~Iaxima for

a. Systolic Blood Pressure and Sperry Cholesterol -~1a.les· • . . . . . . . • • • . • . • . . • 101

b. Systolic Blood Pressure and Zak Cholesterol -Males . . . . . . . . . . . . . . . . . . 101

c. Systolic Blood Pressure and Sperry Cholesterol -Females . . . . . . . . . . . . . . . . . 102

d. Systolic Blood Pressure and Zak Cholesterol -Fet:nales . . . . . . . . . . . . . . . 102

4.1 The Genetic Transition Hatrix for a One-Locus, Two -Allele System. . . . . • • . . . . . . . . . 109

4.2 Parameters of the ~~del and Their Interpretation. .. 112

4.3 Maximum Likelihood Estimates From Univariate PedigreeAnalysis of Sperry Cholesterol Data

a.b.c.

Right Pedigree .Left Pedigree.

Both Pedigrees • · . . . .

· . . . · . .· . . . . · . .

119120

121

4.4 Maximum Likelihood Estimates From Univariate PedigreeAnalysis of Zak Cholesterol Data

a. Right Pedigree · · · · · • · 125

b. Left Pedigree. · · • · · · · 126

c. Both Pedigrees · · · · · · • · · · · · · · · · · · 127

4.5 Maximum Likelihood Estimates From Univariate PedigreeAnalysis of Systolic Blood Pressure Data

a. Right Pedigree • · · · · · · · · · · · · 129

b. Left Pedigree · · · · · · · 130

c. Both Pedigrees · · · · · · .. . · · · 131

x

Table Page4.6 Maximum Likelihood Estimates From Univariate Pedigree

Analysis of Diastolic Blood Pressure Data

a. Right Pedigree . .b. Left Pedigree. . . .

4.7 Maximum Likelihood Estimates of Bivariate PedigreeAnalysis of Sperry Cholesterol and Zak CholesterolLevels .

135136

a.b.c.

Right Pedigree . .Left Pedigree

Both Pedigrees .

138139140

4.8 Maximum Likelihood Estimates of Bivariate PedigreeAnalysis of Systolic and Diastolic Blood PressureLevels

a. Right Pedigree . . .b. Left Pedigree

141142

5.1 Maximum Likelihood Estimates of the Parameters for thePolygenic Model for Sperry and Zak Cholesterol byPedigree . . . . . . . . . . . . . . . . . . . .. 152

5.2 ~~imum Likelihood Estimates of the Parameters for thePolygenic Model for Systolic and Diastolic BloodPressure by Pedigree . . . . . . . . . . . . . .. 154

5.3 Maximum Likelihood Estimates of the Parameters for thePolygenic Model for Other Traits by Pedigree . .. 156

6.1 Maximum Likelihood Estimates of the Parameters for theMixed Model for Sperry and Zak Choles terol byPedigree . . . . . . . . . . . . . . . . . . . .. 171

6.2 Variance Component Estimates, Proportion of the TotalVariance, and Total Heritability Estimates for theVarious Traits by Pedigree . . . . . . . . . . .. 172

6.3 Maximum Likelihood Estimates of the Parameters for theMixed Model for Systolic Blood Pressure byPedigree 175

38

LIST OF FIGURES

Figure Page

1.1 The Five Types of Lipid and Lipoprotein Patterns forPatients with Familial Hyperlipoproteinemia. 23

2.1 Cumulative Plot of Age by Sex

2.2 Cumulative Plot by Sex

a. Systolic Blood Pressure and Ln(SBP) 43b. Sperry Cholesterol and Ln(Sperry) . . . . . . • . 44c. Weight and Ln(Weight) . . . . . . • . . 45d. Height and Ln(Height) .... . 46

3.1 a. Right Pedigree.

b. Left Pedigree

3.2 Empirical and Theoretical Cumulative Plots AfterFitting a Mixture of Log-Nonnal Distributions

70

71

a. Trait: Systolic Blood Pressure - ~4ales. · · 82

b. Trait: Systolic Blood Pressure - Females · · · · 83

c. Trait: Diastolic Blood Pressure - ~~les · · · · 85

d. Trait: Diastolic Blood Pressure - Females · · 86

e. Trait: Sperry Cholesterol - Males · · · · · 88

f. Trait: Sperry Cholesterol - Females 89

g. Trait: Zak Cholesterol. . · · · . . 92

3.3 Plots of the Estimated Distribution Means for SystolicBlood Pressure and Serum Cholesterol by Sex Correspondingto the Two Local Maxima of the Likelihood. . . . 104

6.1 Component and Total Theoretical Density Functions

a. Sperry Cholesterol . . . . · · · · · · ·b. Zak Cholesterol . . . . · ·c. Systolic Blood Pressure · · · . . · · · ·

173

173176

GlAPTER I

INTRODUCTIO:-J Ai'JD LITERATl..jRE REVIEW

1.1 Introduction

ThrougIl tile years, hign senun cholesterol levels and high blood

pressure have been recognized as two of the many risk factors for

coronary heart disease. Indeed, in 1970, the Report of the Inter

Society Commisson for heart Disease Resources (1970) named three risk

factors, hypercholesterolemia and hypertension along with smoking, as

being major risk factors for premature atherosclerotic disease,

especially coronary heart disease. Due to the serious dimensions of

morbidity and mortality attributable to both essential hypertension

and familial hypercholesterolemia, tilere have been many investigations

concerning these two conditions. There is no question that heredity

plays a role in both conditions. The debate concerns how big that

role is, and what is the genetic mechanism. In this chapter, a review

of the role heredity plays in both essential hypertension and familial

hypercilolesterolemia will be given.

1.2 Essential Hypertension

1.2.1 Definitions

In order to study tile heredity of essential hypertension properly,

one must differentiate between it and secondary hypertension. Essential

hypertension has always been defined by exclusion since no pathognomonic

biocllemical or metabolic abnormality has yet been identified. If hyper

tension is preceded by a specific cause or a specific lesion, then it

2

is tenned secondary hypertension (~'~ndlowitz 1961) (See Appendix 1 for

a list of causes). Essential hypertension is actually whatever remains

after exclusion, consequently, essential hypertension Iileans hypertension

without evident cause and is usually characterizeo by elevated arterial

pressure. It has also been called primary hypertension because the

hypertension precedes any cardiovascular changes.

ifuat is the cut-off point dividing the hypertensives and non

hypertensives? A scan of the literature will reveal almost as many

division lines as there are investigators. The dividing lines range

from about 120/80 to about 180/110 (Pickering 1961). In 1959, tIle

Conference on Methodology in Epidemiological Studies in Cardio-

vascular Diseases met in Princeton, i~ew Jersey (Pollack and Krueger

1960). The Conference, recognizing that clinical usage demanded

arbitrary but tmiform criteria of nonnal and of abnormal arterial blood

pressure, suggested the follO\iing criteria: Any person with systolic

pressure at or above 160 nun Hg or (inclusive or) diastolic pressure

at or above 95 nun Hg definitely is hypertensive. Those with systolic

pressure below 140 nun Hg and diastolic pressure below 90 mn Hg are

considered to be norrnotensives. The residual blood pressure levels

represent the questionables and are left up to the individual investi

gator. Although these criteria have not been tmiversally accepted,

many studies, including the Framingham Study (Kannel and Gordon 1970) ,

the Evans COLDlty Study (Cassel lY71) , and the U.S. National Health Survey

0~ational Center for Healtll Statistics 1966), have adopted them so

that some comparisons of results are possible.

Concerning the genetics of essential hypertension, there are pri

marily two Schools of Thought. One School, whose main proponent is Sir

3

Robert Platt, says that essential hypertension is a specific disease

entity and that the population can be separated into subgroups, those

with essential hypertens ion and those who are nonnotens ives . This

School has further hypothesized that the disease is detennined by a

gene with incomplete dominance. The other School, whose main proponent

is Sir George Pickering, maintains that essential hypertension is not

a specific disease entity, that a person inherits not a disease

essential hypertension, but rather a large number of genes which deter

mines a particular level of blood pressure, and those individuals

categorized as haVing essential hypertension are simply the ones whose

blood pressures fallon the upper end of a continuous unimodal frequency

distribution. This School says that essential hypertension is deter

mined by multifactorial inheritance, a combination of genetics and

environment .

This controversy, lively and bitter at times, has spanned more

than two decades. For the remainder of section 1. 2, the argt.mlents

advanced in support of and in opposition to the two Schools will be

presented. The important contributions made by'other investigators

will be cited.

1. 2.2. The Pickering School

Sir George Pickering ana his supporters (Pickering 1968; Cruz-Coke

1960; Hamilton et al 1954c)compare essential hypertension to human

stature; both are classical examples of polygenic inheritance. In

their view, any cut-off line between the hypertensive and non-hyper

tensive segments of the population can only be arbitrary (Hamilton et

al1954a).

Since it is known that mean blood pressure levels increase with

4

age for the two sexes, it would be misleading to compare two populations

of different ages and sexes. Hamilton, Pickering, Fraser-Roberts, and

Sowry (1954b) corrected for age mlo sex differences by computing age

and sex-adjusted scores. The effects of age are dealt with by adjusting

all readings by detennining how much each person's blood pressure level

is above or below the appropriate mean for his age and sex, and then

multiplying the deviation by a factor to make it equivalent to the

deviation at some standard age. Using these scores, they build evidence

in support of the multifactorial inheritance theory for essential

hypertension:

1) The frequalCY distribution of the adjusted blood pressures is a

continuous unimodel curve (Hamilton et al 1954c; Murphy et al 1966).

The distribution is not quite Gaussian; it is positively skewed.

Hamilton et al (1954b) studied the distribution of the adjusted scores

for diastolic pressures in three different populations - one group

represents the population-at-large; the second group consists of those

who are first-degree relatives (i.e. sibs, parents, and cllildren) of

propositi liith normal blood pressures (whom Hamil ton et al define as

those with diastolic pressures not exceeding 85 nun Hg) ; the final group

consists of first-degree relatives of propositi with essential hyper

tension (diastolic pressures of 100 rom Hg or more). The distributions

of the adjusted blood pressure scores for the first two groups are

almost identical - unimodal and positively skewed. The curve for the

last group, the relatives of the hypertensives, is still tDlimodal, but

the distribution is shifted to the right. Pickering and his supporters

argue that the consistent wlirnodal distributions indicate that people

with essential hypertension are those whose blood pressures fall in the

5

upper end of a "bell-shaped" distribution. Furthennore, any line of

demarcation to divide the population into two subgroups, the nonno

tensives and the hypertensives, can only be arbitrary; essential

hypertension represents a quantitative, not quaZitative, deviation from

the nonn. In this respect, essential hypertension would be like

stature or intelligence, a multifactorial trait.

2) Pickering and his group looked at the relationship between the

blood pressures of propositi and their first-degree relatives (Hamilton

1954c; -Mial1 et al 1967; Pickering 1967, 19(8). They noticed a

similarity which they tried to quantify by calculating coefficients of

resemblance; which are the regressions of.age-and sex-adjusted scores

for first-degree relatives on the· age-and sex-adjusted scores for pro

positi. The computed coefficients of resemblance between the adjusted

scores of all relatives and all propositi were 0.224 for systolic

pressures and 0.178 for diastolic pressures. 1~en the coefficients

were computed for relatives of hypertensive propositi, they were fairly

constant at about 0.2; this means that if the pressure of a subj ect

deviated from the nonn by 10 mm Hg, then the pressure of his relatives

differed from the nonn on the average by 2 mm Hg. The Pickering School

concludes from this that environmental factors playa major role in

detennining arterial pressure since the coefficient of resemblance is

relatively small - 0.2 for arterial pressure as against O.S for stature

(Oldham et al 1960). This constant coefficient of resemblance says

that the lower the pressure of the propositi, the lower the pressure

of their first-degree relatives of all kind; the higher the propositi

pressure, the higher the pressure for the relatives. This evidence

indicates that the inheritance of arterial pressure is quantitative,

6

or polygenic, and that the inheri tance is of the same kind whether the

arterial pressure is less than the nonm or in the essential }~ertension

range.

Acheson and Fowler (1967) have disputed this evidence. They say

that it is misleading to compare arterial pressure with height since

excess height is not associateci with excess mortality; furthermore,

height is normally distributed in the population, while arterial

pressure has a distribution which is skewed to the right. Acheson and

Fowler also criticized computing coefficients of resemblances using age

and sex-aujusted scores based on single blood pressure measurements

which tend to be highly variable. Miall and Oldham (1963) have recal

culated the coefficients basing them on two measurements. The results

are 0.399 for systolic and 0.302 for diastolic blood pressure, higher

than the 0.2 for both systolic and diastolic pressures when only one

measurement was taken .. They claim that even these may be underestimates

of the true familial resemblance because the use of age- and sex

adjusted scores does not correct for selective mortality; young people

with the higher pressures of genetic origin are more apt to have

lost older relatives with hypertension than older relatives with normo

tension-; Therefore, Acheson and Fowler surmised that genetics may

really play a larger role than the Pickering School have granted.

3) Hamilton, Pickering, Fraser-Roberts, and Sowry (1954c) regressed

systolic and diastolic scores on age for three groups of males and

females: a population sample from a skin disease clinic, relatives

of normal propositi, and relatives of hypertensive propositi. Except

for systolic blood pressures in males, the rates of increase of blood

pressures with age are almost the same for all three samples. The

7

Pickering Scllool advanced these results to argue tllat, ignoring male

systolic pressures, it is not the rate of rise with age that is

important in hypertension inheritance; there is a propensity for

higher pressures at aLL ages. This conclusion is to be contrasted to

that of the Platt School which says that individuals with essential

hypertension are characterized by a sudden rise of arterial pressure

during middle age.

4) The last piece of evidence appeals to logic and intuition. It is

well known that arterial pressure depends on many physiological factors

including cardiac output, radius of vessels, viscosity of the blood,

secretions of the adrenal gland, the electrolyte content of the blood,

the state of the baro-receptors, etc. (Pickering 1968). This dependence

on so many ftictors nas led the Pickering School to the opinion that it

is tmlikely that the inheritance of arterial pressure can be character

ized by one gene. In the opinion of the Pickering School, it is

tmlikely that anyone will be able to find a specific biochemical lesion

for essential hypertension.

1.2.3 The Platt School

Sir Robert Platt and his supporters consider essential hypertension

to be a distinct disease entity. Their arglDllents consist of :two kinds,

their own and those in rebuttal to evidence advanced by the Pickering

School.

Whereas the Pickering School sees the frequency distribution of

blood pressure in the population to be a tmimodal one where the top

10-20%, by middle age, have attained a blood pressure so high as to

carry hazards to survival, the Platt School proposes that there are

really two or more populations instead of only one; there are people

8

who genetically are more prone to develop hypertension in middle age,

and others who are not. The Platt School agrees with the Pickering

School that there is no natural dividing line between normal and

abnormal blood pressures. However, since there does not exist a more

specific test for essential hypertension, one is forced to base con

clusions principally on studies of blood pressures.

The Platt School agrees with clinicians that the age of risk for

essential hypertension is 45-60. Platt (1967) suggests that those with

essential hypertension have blood pressures that have risen steeply

during the middle years, and those who are normotensives demonstrate

no significant rise of blood pressure. Therefore, the Platt School

argues strongly that it is important to study only sibs instead of all

first-degree relatives, since the children of hypertensives are unlikely

to have reached ages 45- 60, and parents of hypertens ives will have

already experienced a selective mortality.

1.2.3.1. The Pre-1960 Hypothesis

Until about 1960, the Platt School (Platt 1959, 1961) hypothesized

that essential hypertension was inherited as a result of a major

dominant gene. Hence, siblings of hypertensive propositi should

segregate into two groups, those who inherited and those who did not

inherit the gene from their parents; it then should follow that a plot

of the blood pressures of siblings of hypertensive propositi should

reveal a bimodal distribution. The Pickering School would argue for

a unimodal distribution. Platt (1959) reanalyzed the data collected

by Hamilton and his co-workers (1954a) and by Sobye (1948) by looking

at just the siblings of hypertensive propositi. The resulting curves

do not appear to be unimodal; it is difficult to distinguish between

9

tlIeir being bimodal or trimodal. Indeed, the curves display troughs at

150 Jml Hg systolic and 90 nun Hg diastolic which conveniently happens

to be the dividing line between nonnal and high blood pressures cited

by many clinicians.

The results of a study by Morrison and Horris (1959, 1960) of 302

London bus drivers and conductors support Platt's findings. ~10rrison

mId Morris studied the blood pressure distributions of clIildren of

hypertens ive and non-hypertens i ve parents. According to the single gene

hypotllesis, children of hypertensive parents should segregate into

roughly two groups, and the distribution of their blood pressures should

be bimodal. On the other hand,children of nonnotensive parents should

also be normotensive and should have a unimodal blood pressure distri

bution. The results of the Morrison and Morris study support the one

gene hypothesis. The distribution of the blood pressures of tlle drivers

and conductors with hypertens ive parents showed bimodality, whereas tlle

distribution for drivers and conductors of non-hypertensive parents was

approximately normal. These findings have been disputed because of

tlle unusual criterion(age at death)that Morrison and ~brris used to

classify the parents as being hypertensive or non-hypertensive.

Lowe and McKeown (1962) conducted a study similar to that, of

Morrison and Morris of 5239 men working in an electrical engineering

firm. They found no bimodality' in the distribution of the blood

pressures of the middle-aged men who had one or both parents dead.

Ostfe1d and Paul (1963) examined 1989 men of ages 40-55 working for

Western Electric Company in Chicago. Using tlle same method of sub

dividing the parents, Ostfe1d and Paul also were unable to reproduce

tlle results obtained by ~-brrison and ~brris.

10

1.2.3.2. Tne Post-1960 Hypothesis

In about 1963, Platt (1903) perfonned his own study of 350 sibs of

178 hypertensive propositi. Examining the frequency distributions of

the blood pressures by age, he fOlll1d that the distribution is not

Gaussian and does not become Gaussian after a logarithmic transformation.

Platt observed that, with increasing age, there developed a bulge in

the distribution curves in the middle ranges of blood pressures, cen

teririg at a systolic pressure of about 160 mm Hg. By ages SO-59, most

of the sibs were in this middle range. There also developed, with

increasing age, a bulge at the high end of the distribution. In light

of these irregular trimodal distributions, Platt had to modify his

dominant inheritance hypothes is of pre-1960. He proposed the hypo

thesis that essential hypertension is inherited as a gene of incomplete

domin&lce. Those at the high end of the distribution are severe hyper

tensives, inheriting the gene for hypertension in the homozygous form;

persons in the middle range represent those with moderate llypertension,

inheriting the gene for hypertension in the heterozygous form;

finally, those in the lower end are the nonnotensives, inheriting two

nonnal genes.

The results of longitudinal studies of different populations of

sibs by Cruz-Coke (1959) and Perera (1960) support Platt's theory.

They showed that the sibs can be divided into two groups - those whose

arterial pressures rose little with age and those whose arterial pressures

rose steeply with age. In fact, a plot of the logarithm of the

systolic pressure resulted in a curve with three modes.

Platt found further support in this controversy from the Evans

County, Georgia Study conducted in 1960-1962 by McDonough, Garrison,

11

and Hames (1964). This study examined the frequency distributions of

systolic and diastolic pressures in 621 whites and 379 blacks of ages

55-74. The three investigators, perfoming a curve-fitting exercise,

attempted to find the minimum lllD'I1lJer of subgroups compatible wi th four

conditions: (1) Summing the curves for the subgroups must result in the

parent distribution. (2) The distribution for each subgroup should be

nearly normal. (3) The subgroup in the lower tail should have

pressures not mudl different from those seen at younger ages (repre

senting the normotensives whose blood pressures exhibit little rise

with age) . (4) Comparing equivalent subgroups, there should be no

white-black differences in mean blood pressure. The three investigators

found that two subgroups did not satisfy all of the conditions whereas

thr.ee subgroups did. The subgroup at the lower tail represents the

normotensives, possessing two normal genes assuming Platt's hypothesis

of incomplete dominance; those in the middle subgroup represent those

inheriting the gene for hypertension in heterozygous form, and those

in the upper tail represents those inheriting the gene for hyper

tension in the homozygous form. The estimated frequencies, obtained

from curve-fitting, of the three subgroups are surprisingly close to

what would be expected under Hardy-Weinberg equilibrium.

It must be noted that the curves for the three subgroups display

nDJch overlap. If Platt's hypothesis of three genotypes is correct,

then blood pressure is a poor discriminator; most blood pressure levels

could be the expression of more than one genotype; misclassification

cou~d result. More sensitive and specific methods for separating the

genotypes are needed. Furthermore, Inerely dividing a distribution into

subgroups can offer no confirmation of their physical existence

(~1urphy 1964).

12

1.2.4. The Contributions of Other Investigators

Hall (1966), in his PhD dissertation in 1966, compared the direct

and indirect methods of measuring arterial pressures. The direct

method of measuring blood pressure involves cormecting the artery

directly to a manometer with a hollow tubing. The indirect method,

the sphygmomanometer, is better known and involves a soft rubber cuff,

a colUlIU1 of mercury, and a stethoscope. Hall adjusted for differences

in ages, in tricep skinfold, in sub-scapular skinfold, and in mid-ann

circumference by including these variables along with direct blood

pressure reading as independent variables in a regression model with

the difference between direct and indirect readings as the dependent

variable. He thus could obtain an equation involving the difference

between direct and indirect readings as a function of direct blood

pressure readings, after adjusting for the ~ther independent variables.

A plot of the equation showed that the difference between direct and

indirect values increased with increasing direct measurements. In

other words, with increasing arterial pressures, the indirect readings

were increasingly underestimating the direct readings. Since almost

all frequency distributions of blood pressures have been based on

indirect measurements, Hall's result:suggests that it may be necessary

to modify the shapes of these frequency distributions. The overall

effect will be to extend the right-hand tails of the frequency curves

and, as a result, perhaps to sharpen the divisions separating possible

subgroups. Unfortunately, for a large population, it is difficult

and impractical to make direct measurements on every person in the

study.

Hall used the data from the Evans County Study to estimate

13

Hall assumed that the heterogeneous population consistedZ

(fll' 0 ),

analytically the same parameters that McDonough et al (1964) estimated

graphically.

of three normally distributed subpopulations with parametersZ Z

(flZ' 0), and (fl3' 0) in the proportions aI' aZ' and a3, respec-

tively (I a.= 1). Hall used the method of maximum likelihood to. 11

estimate the six parameters, (fll' flZ' fl3' aI' aZ' 0). It should be

noted that the standard errors for the estimates of the proportions

are quite large, particularly for the black population. In co~aring

Hall's estimates with those of McDonough et al (1964) obtained by free-

hand curve-fitting, one can observe that the proportions are similar for

blacks, but not for whites. In looking at the estimated means, Hall

fOlD1.d that the subpopulation means for blacks and whites are very

similar. However, there are greater proportions of blacks in the sub

populations with the higher arterial pressures than of whites. As a

result, the overall mean blood pressures for blacks are higher than

those for whites, with the higher frequency of a hypertens ion gene

possibly accolD1.ting for the observed differences.

For many years, the controversy between the Platt and Pickering

Schools has reached a standstillj neither School has been willing to

concede much to the other. As a result of this lack of progress, some

researchers have decided to try other approaches to try to resolve the

differences.

Some reasoned that if essential hypertension is determined by a

single major genetic factor, then by the "one-gene, one..enzyme hypo

thesis", a unitary defect in a biochemical mechanism is worth seeking

(McKusick 1960a, 1960b). If fOlD1.d, it coulc;i be the basis for the

treatment of essential hypertension. In 1959, Mendlowitz et al (1959)

14

boldly singled out a deficiency of the enzyme, O-methyl transferase,

which is important in the degradation of norepinephrine. But, by 1964,

they had dismissed this hypothesis in favor of a gene that modifies

catecholamine metabolism (Mendlowitz et al 1964, 1970). However, other

enzymes like renin and angiotensinase, and hormones like aldosterone

each have their own proponents (Pickering 1968). COllsequently, there

is disagreement regarding which enzyme or hormone is important in

determining essential hypertension.

The Platt and Pickering Schools both have maintained that the final

proof regarding whether essential hypertension is a distinct disease

entity or not may have to await a prospective study lasting 20 years

or more on a large lDlselected population to see whether, with increasing

age, the population will segregate into two subgroups, those whose

pressures rose steeply and those wllose pressures remain little changed.

There have been several longitudinal studies of the kind suggested.

Miall and Lovell (1967) reported the results of a longitudinal

study in South Wales. They fOlDld that age, per se, plays no direct

part in determining the rate of change of blood pressure, and that

changes in blood pressure are more closely related to the attained level

of blood pressure than to age. In other words, the higher the

individual's blood pressure, the greater will be the rate of increase

in his blood pressure with time. These results are in accord with the

Platt School.

Another longitudinal study was conducted by Harlan, Osborne, and

Graybiel (1962); they observed a relatively homogeneous group of white

males over anl8-year period. The 1056 healthy white Navy pilots were

first examined in 1940. None of them had a blood pressure level

15

over 132/86 at tllat time. They were re-examined in 1951-1952 and 1957

1958. After closely scrutinizing the frequency distributions of the

blood pressure levels, the three investigators concluded that there

was no evidence of a natural bimodality to suggest any evidence of

qualitatively different populations; furthermore, they suggested that

their study confirmed the fact that hypertension is a quwltitative

difference in blood pressure determined by a nroltiplicity of factors,

both genetic and environmental.

Unfortunately, this study and several like it suffer from two basic

flaws. One is that no attempt is usually made to exclude secondary

hypertensives. However, this is not as serious as the second flaw -

the study populations are usually highly selected. Sampling biases

exclude certain subgroups from the sample. The population in Harlan,

Osborne, and Graybiel's study excluded individuals who had high blood

pressures to begin with. Males with high blood pressures were dis

qualified and could not become i~avy pilots. As a consequence of this

deficiency, any presumed hypertensive subgroup may be so small as to

be obscured in the tail of the larger subgroup. The same phenomenon

may be observed in using insurance policyholders as the study popula

tion. Those with high blood pressures may be selected out since they

can be refused insurance or may have to pay higher premiwns for

insurance.

Feinlieb et al (1969) reported the results of a longitudinal study

of the relationship between blood pressure and age; the study was

based on data from the Framingham Study. Among the questions they

wanted to answer are two that are pertinent to the present discussion.

First, how does blood pressure change on a longitudinal basis; second,

16

to what extent <10 changes in blood pressure over time depend upon an

initial blood pressure? The Framingham Study is well suited to answer

these questions since the people in the study represent a cohort of

over 5,000 persons who have been examined biennially for almost 20

years. The report of Feinlieb et al covers the first seven examinations,

and it should be noted that secondary hypertensives have not been

eliminated.

The cross-sectional patterns of blood pressure in this study

agree with those of other population studies; with increasing age,

systolic blood pressures tend to rise; the same is true of diastolic

blood pressures, at least in women, and also in men up to about age 60.

To examine these trends in greater detail, Feinlieb et al divide the

study population into age cohorts. They find that, for both men and

women, the longitudinal trends of systolic blood pressures with age

are similar to the cross-sectional trends. The same is true of female

diastolic blood pressure patterns. However, for diastolic blood

pressures in males, although there is a basic trend of a rise of

diastolic blood pressure with age for all the cohorts, there is also a

tendency for the younger cohorts to have higher diastolic blood

pressures than the older cohorts; no explanation could be found for

this phenomenon.

To answer the question of the extent to which changes in blood

pressure over time depends on an initial blood pressure, Feinlieb

and his co-investigators divide the population into systolic blood

pressure cohorts according to the systolic blood pressure levels at a

particular instant in time; these cohorts are then examined for longi

tudinal trends. They find that, regardless of whether the initial

17

systolic blood pressure was 100-109 or 160-169, the longitudinal trends

in systolic blood. pressure are parallel. These longitudinal trends

remain even after the systolic blood pressure cohorts are subdivided

according to age. From this, Feinlieb et al conclude that the change

in an individual's blood pressure level later in life does not depend

on his blood pressure level earlier in life; there is a propensity for

an increase in blood pressure with age regardless of any earlier levels

of biood pressure. This result is markedly different from that of ~1iall

and Lovell (1967); there is no tendency for the population to separate

into two groups, those whose blood pressures rose steeply and those

whose blood pressures remained little cllanged.

Studies that have been used to determine the relative magnitude

of genetic and non-genetic effects on a trait include twin studies.

Investigators have been attracted to twins possibly because the analysis

seems simple. The basis for twin studies is that monozygotic (MZ) twins

are identical in their genetic constitution so that any differences

between them can be ascribable to non-genetic influences; dizygotic (DZ)

twins are related to each other in the same way as ordinary full

siblings. The twin study method assumes that the zygosity of the

pairs of twins has been determined correctly.

Using the observed among- and within-pair variances for the two

types of twins, twin study investigators have obtained estimates of

the heritability, the proportion of the total variation in a trait

accounted for by heritable effects. With regard to blood pressure,

the results of twin studies have not been consistent.

Osborne, DeGeorge, and Mathers (1963), in their study of the

blood pressures in S3 pairs of twins, found no significant difference

18

between the intrapair variances for MZ twins and DZ twins. Downie,

Boyle, et al (1969) also found no differences in the intrapair variance

in their series of 109 pairs of twins. Both groups conclude that

variability in blood pressure levels is predominantly under environ

mental influences.

Mtllhany, Shaffer,and rfines (1975) found that, in tl~ir series of

200 pairs of twins, genetic factors play an important role in determining

blood pressure levels. Their estimates of heritability for systolic

blood pressure are 0.73 and 0.56 for females and males, respectively,

and for diastolic blood pressure, 0.61 and 0.41, respectively. Borhani,

Feinlieb et al (1976) gathered 514 white male twin pairs from the

records of the Veterans Administration. Based on a method to estimate

the heritability which eliminates possible biases that may result

because the total variance in MZ twins was smaller than in DZ twins,

their estimates of ileritabi1ity are 0.8 for systolic blood pressure and

0.6 for diastolic blood pressure, both indicative of a major contribu

tion of genetic factors.

A recent paper by Elston and Bok1age (1978) is particularly relevant

to this brief examination of twin studies. They studied the fundamental

assumptions underlying the twin method and find that, of the many

assumptions, some have been discredited, some have not been tested,

and same are untestab1e. Consequently, they conclude that they have

serious reservations about estimates of heritability based onZy on

twin studies and question wilether, in most cases, the results of

genetic twin studies are applicable to the general population.

Finally, mention should be made of a study that is rich in both

ambition and potential: The Detroit Project Studies of Blood Pressure

19

using the family set method. The goal of the project is to test concur

rently medical, environmental, sociopsychological, and genetic hypo

theses for blood pressure variation (Harburg, Erfurt et al 1977 ) •

The study design consists of selecting four census areas in

Detroit to represent extremes of stress areas for blacks and whites;

the areas are designated black high stress, black low stress, white

high stress, and white low stress. rligh and low stress areas are areas

which differ markedly with respect to socio-economic variables (e.g.

income, education, occupation) and instability variables (e.g. crime,

marital instability, residential instabiIity). Wi thin each of the

four areas, family sets are collected. A family set consists of five

persons: an index case, his or her sibling, his or her first cousin,

his or her spouse, and an unrelated individual from the same area, of

the same sex and of a similar age who is a potential index case. Three

persons of the set are genetically related (index case, sib, and first

cousin) while the other two share an environmental connection with

the index case. The spouse serves as a "proximal environmental"

control while the 'unrelated person controls for environmental factors

which are within the same socio-environmental area as the index case.

There have been several reports of results from the Detroit

Project studies. One considers the relationship between socio

ecological stress areas and blood pressure (Harburg, Erfurt et al 1973).

Tne investigators found that black males living in a high stress area

have the highest blood pressure levels of all eight race-sex-res,idence

groups. Their blood pressure levels are significantly higher than

black males living in a low stress area. There is no significant

differential in blood pressure between this latter group and other

20

white groups. These results suggest an envirorunental, or more specifi

cally a socio-psychological influence on blood pressure.

More recently, there is a report examining family aggregation of

hypertension where systolic hypertension is defined as ~ 160 mm Hg

and diastolic hypertension as ~ 90 rom Hg (ScllUll, Harburg et al 1977).

The investigators are able to find only a weak tendency for diastolic

hypertension and less for systolic hypertension to aggregate in family

sets. Since this finding is in disagreement with other studies which

show familial aggregation of hypertension (Thomas and Cohen 1955;

Gearing, Clark, et al 1962; Ostfeld and Paul 1963), the authors suggest

that prior studies may have confounded envirorunental and genetic

correlations.

The family set has been used to estimate the heritability of blood

pressure by which is meant the proportion of the total variation in

blood pressure that can be accounted for by heritable effects

(Chakraborty, Schull, et al 1977). Consider a family set consisting

of the index case, sib, first cousin, and the unrelated control. The

covariance between members of the family set with respect to the trait

can be expressed as a function of an additive genetic variance, a

dominance variance, and an envirorunental variance (Falconer 1960).

From estimates of these variance components, the estimate of herit

ability can be computed. For the Detroit Project data, although the

estimates of lleritability are quite erratic, they show a tendency to be

relatively low. The investigators conclude from this that nongenetic

variables contribute more to observed blood pressure variation than do

genetic differences between individuals. Advocates of a large heri

table component in blood pressure variability certainly cannot find

much in these results to agree with.

21

1.3 Familial Hypercholesterolemia

Before the discovery and availability of sophisticated biochemical

procedures, familial and non-familial hyper1ipidemias, diseases which

are characterized by an increase of one or more plasma lipids and of

which familial hypercholesterolemia is but a subgroup, were first

discovered through its secondary manifestations, lipid deposits in

tendons and subcutaneous tissue called xanthomatosis. However, recent

advances in the laboratory have allowed the familial hyper1ipidenias

to be subdivided into subgroups using plasma levels of cholesterol and

triglycerides and plasma lipoprotein patterns.

1.3.1. Classification of Familial Hyperlipidemias

The most cOIllIOOn lipid in plasma is usually phospholipid; it is

believed that its flUlction is to bind other lipids to plasma proteins.

The next JOOst conunon lipid in plasma is cholesterol, with about

three-fourths of the total cholesterol usually esterified with long

chain fatty acids. The third most canmon plasma lipid is triglyceride.

There are two major sources of triglyceride; one is exogenous or from

the diet; the other is endogenous which origina"ces mainly from the

liver. After these three classes of plasma 1ipids, there are several

other lipids, but of smaller concentration. Among these are the free

fatty acids, carotenoids, vitamin A, and glycolipids (Frederickson and

Lees 1972).

The endogenous lipids and those from the diet must be transported

through the blood vessels. However, since the maj or plasma lipids,

phospholipid, cholesterol, and triglyceride, are not soluble in serum,

they do not circulate free in the senun. Rather, they circulate bOlUld

to proteins by forming stable lipid-protein complexes called lipo-

22

proteins. Thus, lipoproteins are the units of lipid transport (Levy

1971) .

Advances in laboratory procedures since 1950 have olanged the

classification of familial and nonfamilial hyperlipidemias into one

for hyperlipoproteinemias. The lipoprotein patterns can be distin

guished using an ultracentrifuge or paper electrophoresis.

Using ultracentrifugation, the lipoproteins can be separated into

four groups (Stone and Levy 1972; Frederickson and Lees 1972):

1) High density lipoproteins (HOL) - commonly called alpha

lipoproteins, density> 1.063 grn/ml.

2) Low density lipoproteins (LDL) - common referred to as beta

lipoproteins, density between 1.006 and 1.063 grn/ml.

3) Very low density lipoproteins (VLDL) - commonly known as prebeta

lipoproteins, density between 0.95 and 1.006 grn/ml.

4) Chylomicrons - density < 0.95 gm/ml.

As a laboratory procedure, ultracentrifugation tends to be

expensive and difficult to use. Paper electrophoresis, while rela

tively rapid, simple, and inexpensive, does not have the resolution

power of an ultracentrifuge (Frederickson and Lees 1972). As the dif

ferent lipoproteins migrate towards the anode, four separate bands can

be distinguished. The non-migrating band consists of the chylomicrons.

Then, with increasing distance from the origin, come the bands of the

beta-lipoproteins, prebeta-lipoproteins, and the alpha-lipoproteins

(Stone and Levy 1972; Frederickson and Lees, 1972). See Figure 1.1.

Of the four lipoproteins, only chylomicrons, beta-lipoproteins,

and prebeta-lipoproteins are important as far as classifying the

familial hyperlipoproteinemias are concerned. These three lipo

proteins are interrelated in that they consist of the same lipids, but

23

Normal II III IV v

------ ------ ------

/3 /3 ~ , ...: .... ;_~.' 'r' : , ~:.', •

,,,./3 ,,../3

. .';'. i'·~..·:~. i :' •.:.~,... :..:;..:..:.. ~::; ,:"I,."

~

+ t t t I + IC TG C TG C TG C TG C TG

Usual changeIn

plasma lipids

Figure 1.1 The Five Types of Lipid and Lipoprotein Patternsfor Patients with Familial Hyper1ipoproteinemia

C = Cholesterol TG = Triglyceride [FromFrederickson and Lees (1966) p. 435]

24

in differu1g proportions (Frederickson and Lees 1972). The chy1omicrons

are the vehicles for the exogenous lipids, especially the trig1ycerides,

in the blood. The prebeta-1ipoproteins transport principally endo

genous triglyceride. The beta-lipoproteins transport about 75% of the

cholesterol in the serum. As a result of the different compositions,

increased beta-lipoproteins are associated with increases in cholesterol

and phospholipid while increased chy1omicrons and prebeta-lipoproteins,

are associated with increases in triglyceride.

Five familial hyperlipidemias can be distinguished using the

lipoprotein patte~. However, it should be noted that this transla

tion of hyperlipidemia into hyperlipoproteinemia does not imply that

these diseases are determined by mutations at loci regulating the

structure or metabolism of liPOProteins. Investigators feel that,

although the present system of classifying familial hyperlipidemias is

convenient, it will be replaced by a system based on etiology as more

information appears. The value of including lipoprotein patterns in

the classification procedure lies in a small but definite increase in

specificity above that possible in using only plasma lipid concentra

tion measurements (Frederickson and Lees 1972).

The lipoprotein patterns can be discerned by applying a combina

tion of three procedures:

(1) Examination· of the standing plasma after the plasma has been

kept at 4°C for 18-24 hours. A creamy layer at the top indicates the

presence of chylomicrons. A turbid infranate is indicative of increased

prebeta-lipoproteins; a clear infranate can mean increased beta

lipoproteins as these small molecules are completely soluble and do not

refract light (Stone and Levy 1972).

25

(2) Measurement of the plasma cholesterol and triglyceride con

centrations. For each of the five hyperlipoproteinemias, there is a

different cholesterol to triglyceride ratio. Figure 1.1 illustrates

this schematically, while Table 1.1 displays some typical ratios.

Type Definitive lipoprotein Appearanc~ of: Standing Cholesterol-to-pattern Plasma at 4°C triglyceride

Ratio

I Chylomicron present; Creamy supernatant jnormal or decreased clear infranatant 1:9beta- and prebeta-

II Increased beta-jnormal No creamj clear oror increased prebeta-j slightly turbidno chylomicron 4:1

III Abnormal beta-and Creamy supernatant mayprebeta-j abnormal be present; turbid orchylomicron cloudy infranant 1:1

IV Increased prebeta-; No cream;normal beta-; no turbidchylomicron 9:10

V Increased prebeta-j Creamy supernatant jnormal beta-; turbid infranatant~lylomicrons present 1:5

Table 1.1 Diagnosis of Hyperlipoproteinemia[from Stone and Levy(1972) p. 346]

(3) Paper electrophoresis. Plasma samples are obtained from

individuals after they have been on a l6-hour fast. Figure 1.1 shows

the electrop}IDretic patterns associated with each of the five

hyperlipoproteinemias.

26

Table 1.1 summarizes the way in which the five familial h~)er

lipidemias can be distinguished using the lipoprotein patterns:

Type I Hyperchylomicronemia

Type II Hyperbetalipoproteinemia

Type III Combined Hyperbetalipoproteinemia and Hyperprebetalipo

proteinemia

Type IV Hyperprebetalipoproteinemia

Type V Combined Hyperchylomicronemia and Hyperprebetalipoproteinemia

1.3.2 Type II Hyperlipoproteinemia

Type II Hyperlipoproteinemia has been called, for reasons stated

below, familial hypercholesterolemia, hyperbetalipoproteinemia,

familial xanthoma, and familial hypercholesterolemic xanthomatosis.

lID individual with hyperbetalipoproteinemia has higher concentra

tions of beta-lipoproteins, and since beta-lipoproteins transport

principally plasma cholesterol, he has higher levels of plasma choles

terol as well. The triglyceride level is little affected. Type II

is the most common type of familial hyperlipoproteinemia known

(Frederickson and Lees 1972). The clinical manifestations of this

disease include deposition of lipid in the skin and tendons

(xanthomatosis), corner of the eyelids (xanthelasma), eyes (corneal

arcus), and vascular endothelium (atheromatosis) (Frederickson and

Lees 1972; Harlan, Graham and Estes 1966), although not every mani

festatio~ is present in every case.

This disease was probably first reported by Rayer [1836] when

he observed one of its clinical manifestations, cutaneous and

tendinous xanthomas in 1836. Soon thereafter, clinicians noted that

there was considerable aggregation of cases of xanthomas in families,

27

and, as a result, familial xanthoma became well-established as an

entity. In 1873, a connection between xanthomas and blood lipids was

suggested by Quinquad (Chauffard and LaRoche 1910). In 1913, after

observing similar lesion in the arteries of cholesterol - fed rabbits,

a relationship between xanthomas and hypercholesterolemia was hypo

thesized. A study by Burns (1920) showed that cutaneous xanthomas were

always associated with hypercholesterolemia; consequently, familial

hypercholesterolemic xanthomatosis became the designation for the

disease. Svendsen (1940) declared that the primary expression of the

disease was hypercholesterolemia and that physicians should consider

increased cholesterol levels instead of cutaneous lesions as being

the characteristic sign of the disorder. In the early 1950's, McGinley,

Jones, and Gofman (1952) showed that individuals with xanthomas and

xanthelasma also have increases in beta-lipoproteins. Their studies

were perhaps the first to point out the connection between lipoprotein

patterns and hyper1ipidemias which has resulted in the present scheme

of classifying hyper1ipidemias.

Besides the manifestations mentioned above, one other is coronary

heart disease. However, there is disagreement as to the exact rela

tionship between hypercholesterolemia and coronary heart disease.

Although Harlan et al {l966) observed deaths fram heart disease in

the second and third decades associated with extensive xanthomatosis,

they also found that familial hypercholesterolemia was compatible with

survival into the sixth, seventh, and eighth decades. In 1967, Jensen

and his colleagues (1967) found a significantly higher death rate from

coronary heart disease in family members with hypercholesterolemia

than in family members with normal levels of cholesterol. Piper and

28

Orrild (1956) and Slack and Nevin (1968) fOlUld morbidity rates [rom

coronary artery disease significantly higher in hypercholesterolemics

than in their normocho1estero1emic relatives.

The clinical manifestations of Type II hyperlipoproteinemia

usually appear at an early age. Wilkinson and his coworkers (1948)

and Epstein and his colleagues (1959), in separate studies on the same

population, found that hypercholesterolemia usually was evident before

age 10, in many cases by age of one year.

A specific biochemical defect for Type II hyperlipoproteinemia

has been reported. Goldstein and Brown (1974, 1975) report that

familial hypercholesterolemia, in vitro, is due to a mutation involving

a regulatory protein. They identify on the cell surface of cultures of

normal human fibroblasts a regulatory molecule, the low density lipo

protein (LDL) receptor. In normal cells, the binding of LDL to the

receptor reduces cholesterol synthesis by suppressing 3-hydroxy - 3

methylglutaryl CoA reductase which is a rate-controlling enzyme. The

binding also enhances the rate of degradation of the lipoprotein. The

homozygotes for familial hypercholesterolemia lack the LDL receptor,

while in heterozygotes there is a reduction in the number of LDL

receptors.

1.3.3. The Genetics of Familial Hypercllolesterolemia

There is no longer mum controversy over the concept that a single

autosomal gene mutant for Type I I hyperlipoproteinemia gives the bearer

almost 100% certainty that he will have hyperbetalipoproteinemia.

There is almost total agreement that the disorder is due to a dominant

gene.

29

There have been numerous genetic studies of patients with familial

hypercholesterolemia. Investigators, in studying the genetics of

hypercholesterolemia, have used generally the same basic approach.

First they determine the cholesterol level in each member of a family

or of several families. Then, using a predetermined (either statis

tically computed or, more often, adopting one cited by clinicians) cut

off point, eadl member is categori zed as being hypercholesterolemic or

having normal cholesterol values by observing whether his cholesterol

value is above or below the cut-off point, respectively. If the family

spans more than two generations, it is broken down into two-generational

families. The parents are then categorized by mating types, i.e. both

hypercholesterolemic, both normal, or one normal and one affected.

For each mating type, the investigators test for Mendelian segregation

ratios among the offspring. The difficulty in interpreting the results

of the studies has been due to a lack of uniformity in defining hyper

cholesterolemia; part of the problem has been the many different

laboratory procedures used in determining cholesterol levels. 1ft

addition, there has been a deficiency of matings of certain phenotypes.

Wilkinson and his colleagues (1948) were one of the first to

conduct an extensive genetic study of hypercholesterolemia. They

investigated the condition in a family of over 200 members in 1948.

Based on their observation that about a half of the offspring resulting

from a mating between a hypercholesterolemic parent and a parent with

nonna1 cholesterol values became hyperdlo1esterolemic, they proposed

that the condition was determined by a dominant gene. In fact, after

observing the occurrences of xanthomatosis in the family, they con

cluded that the gene produced a moderate increase of serum cholesterol

30

wilen present in a single dose (heterozygous) and a large increase in

serum cholesterol along with severe xanthomatosis and a higher suscep

tib ility to coronary heart disease when present in a double dose

(homozygous). The contention that xanthomatosis is fOl.md only in

homozygous individuals was reaffirmed by Hirschhorn and \~ilkinson

(1959) in a separate study in 1958. Aldersberg et al (1949), Herndon

(1954), and Godal et al (1956) have agreed with this interpretation

based on results obtained fram studying other families.

During the 1950's and, early 1960's, investigators began to question

this interpretation. Among the dissenters were Alvord (1949), Stecher

and Hersh (1949), Leonard (19S6), Piper and Orrild (1956), Wheeler

(1957), Harris - Jones et al (1957), and Guravidl (1962). They began

to find patients with xanthomatosis (supposedly horoozygous abnormal)

who either had an offspring or a parent with normal cholesterol levels.

They also found two xrolthomatous parents producing some offspring who

had normal dlolesterol levels. In 1966, Harlan et al (1966), found

that out of 42 children of xanthomatous parents, 19 had normal

dlolesterol levels. As a result of these discrepancies, the theory

was advanced that the inheritance of hyperdlolesterolemia can be

explained on the basis of a simple dominant gene and that xanthomatosis

and high cholesterol readings are different expressions of the same

gene.

In 1964, Khaclladurian (1964) noticed that, in his study of 10

Arab sibships, both parents of young children afflicted with high

cholesterol readings and extensive xanthomatosis suffered from

abnonna1 cno1esterol concentrations. He reasoned that if he were to

assume that such children were homozygous affected, then both of their

31

parents and all of their offspring would have to be hypercholesterolemic.

He then attempted to locate similar individuals in other studies. Six

homozygous affected individuals were located in Epstein et aI's study

(1959) and one each from Meilman et al (1964), Piper and Orrild (1956),

and Adlersberg et al (1949). In eacll case, both parents were h)~er

cholesterolemic. He was unable to find any homozygous affected

producing any offspring, perhaps because of death from coronary neart

disease in the first or second decade. From these findings, Khachadurian

was able to define the phenotype of a hOlOOzygously affected individual:

he has markeu hypercllolesterolemia and extensive xanthomatosis usually

developing before age 15. Xanthomascan also develop in the heterozygous

individual, but the lesions develop later in life, are smaller and

fewer in number. The levels of beta-lipoprotein in the heterozygous

individuals are about twice those in the normal individuals, and the

homozygotes have levels that are two to three times higher than those

in the heterozygotes. Although the heterozygote often dies prematurely

of vascular diseases, the homozygote rarely survives to adulthood. In

other words, the homozygous abnormal genotype is often associated with

the more severe expressions of the disease.

In 1972, Jensen and Blankenhorn (1972) challenged the conclusion

that a single dominant gene is the IOOde of inheritance for familial

hypercholesterolemia. Their evidence consisted principally of pointing

out instances where the results of previous studies did not meet the

strict criteria of Mendelian inheritance, and they concluded that a

more probable mode of inheritance for familial hypercholesterolemia is

polygenic inheritance. They claimed that positively skewed Gaussian

distributions are more easily explained by polygenic inheritance

weighted with a few hypercholesterolemic genes than as a composite

32

curve of two distinct populations. They fOLU1d that the cholesterol

levels of hypercholesterolemic children are closer to that of the

midparent (the mean of the cholesterol levels for both parents) than

to that of the hypercholesterolemic parent. Furthermore, the polygenic

theory could explain the observed phenomenon of skipped generations

more reasonably than the single dominant gene theory. Jensen and

Blankenhorn found children with high cholesterol levels which exceeded

the sum of the parental levels which they cite as evidence of heterosis.

Finally, they cite evidence of observed outbreeding in several studies.

In part as a response to the polygenic theory of Jensen and

B1ackenhorn (1972), Schrott, Goldstein, et a1 (1972) studied the

inheritance of familial hypercholesterolemia in a large kindred

spanning four generations with 92 descendants. They observed that the

distribution of serum cholesterol in a family where hypercholesterolemia

is present is bimodal. Using a cutoff point to separate normals from

affecteds, analysis of various mating types produced segregation ratios

which are consistent with monogenic inheritance. Third, the serum

cholesterol level distribution in third degree relatives of hyper

cho1estero1emics was still bimodal. From these three pieces of

evidence, Schrott, Goldstein, et al concluded that familial hyper

dlo1estero1emia is inherited as an autosomal dominant gene.

Elston, Namboodiri, et a1 (1975) used pedigree analysis to study

the genetic transmission of hypercholesterolemia in a 195 member

kindred. In this study of a pedigree, they did not break the five

generational pedigree into two-generational families,and serum

cholesterol was analyzed as a quantitative trait, avoiding the necessity

of using cutoff points to dichotomize or trichotomize the data; these

33

two considerations should result in a more powerful analysis. They

found that a mixture of bvo lognormal distributions fits the cllolesterol

data better than a single lognormal distribution. From their pedigree

analysis, they concludedthat there is a dominant gene segregating for

hypercllolesterolemia in their kindred.

During the past several years, there have been reports of evidence

for linkage between a hypercholesterolemia locus and the C3 locus

(Ott, Schrott, et al 1974; Elston, !~amboodiri et al 1976; Berg and

Heiberg 1977). This reported linkage, the results of pedigree analysis,

along with the discovery of the biochemical mechanism,constitute rather

conclusive evidence that a dominant gene is the mode of inheritance of

hypercholesterolemia.

1.4 Synopsis of the Problem

With respect to the genetics of familial hypercholesterolemia, there

is conclusive evidence that it is determined by a single autosomal

dominant gene. One of the purposes of this study is to attempt to

corroborate this by reanalyzing, using more modem methods, the serum

cholesterol data collected by Wilkinson and his co-workers (1948) in

1947 and Epstein and his colleagues (1959) in 1958 from a mU1tigenera

tiona1 family living in or near Bay City, Michigan.

With regard to the genetics of essential hypertension, in spite

of a vo1tmri.nous literature, the role played by inherited factors remains

unresolved. The question of whether a person with essential hyper

tension has inherited a distinct disease or merely a predisposition made

manifest by environmental factors has not been answered. We will analyze

the blood pressure data collected from the same multigenerational Bay

City pedigree using the same methods as for the serum cholesterol data.

34

Chapter II will be a descriptive study of the data collected from

this pedigree where sex differences, transformations, inter-trait

correlations will be examined. In Chapter III, the pedigree structure

will be described. At the same time, attempts will be made to fit a

mixture of more than one distribution to the blood pressure data and

to the serum cholesterol data. In addition to determining whether

there is bimodality or trimodality to the distributions of blood pres

sure 'and serum cholesterol, the estimates of the means, variances, and

admixture proportions of the component distributions will be used in

the genetic analysis.

Chapter IV, V and VI will be the analysis of the data assuming

various underlying genetic models. Chapter IV will consider the major

gene model; a major gene is a single identifiable gene which can

accoun~ for a significant amount of the phenotypic variation. Chapter

V will consider the polygenic rodel where the phenotype is assumed to

be determined by a large number of equal and additive gene effects.

In Chapter VI, the underlying genetic model will be a mixed model by

which is meant a model that allows for segregation of a major gene

together with a polygenic and environmental background. In a sense,

this model combines features of the models of ChaptersIV and V.

CHAPTER II

DESCRIPTION OF TtIE KINDRED

2.1 Source of the Kindred

Members of the kindred which is the subject of the present study

lived in or near Bay City, Michigan~ Bay City is located in central

Michigan on the shores of Saginaw Bay which connects it wi th Lake

Huron. The serum cholesterol data for this pedigree have been analyzed

twice before, once in 1947 by Wilkinson and his colleagues (1948) and

the other time in 1958 by Epstein and his co-workers (1959).

As of 1958, the entire pedigree, whicll spans five generations,

consisted of 383 persons. Scattered among this pedigree are some

rather large sibships. For example, the largest sibship consists of

eighteen members; other examples are two sibships of size 13, one of

size 11, and one of size 10. SUcll large sibships should prove advan

tageous in doing a genetic study.

However, due to death, migration, or recalcitrance, out of the

383 persons in the pedigree, only 284 were examined in 1947 or 1958,

or both. Of the ninety-nine who were not examined, twenty-four were

said to have died before 1947; the remainder either moved from

the area or refused to participate in the study.

The 284 members who were examined can be subclassified into three

groups: 49 who were examined in 1947 only, 123 who were examined in

1958 only, and 112 who were examined in both years. Regarding the

36

49 who were examined only in 1947, thirteen had died between 1947 and

1958, most of the remainder had migrated out of Bay City, while a few

refused to co-operate in the subsequent 1958 study.

Appendix 2 lists the variables which comprise the data for the

161 persons examined in 1947. Two laboratory methods were used to

determine serum cholesterol. Most of the serum cholesterol deter

minations were analyzed using the Bloor method (Todd and Sanford 1943)

while about a dozen were determined by the Schoenheimer-Sperry method

(Sperry 1945). Since serum cholesterol was the variable of interest

in 1947, there were not much data available on other variables. Where

as almost all of the 161 individuals who were examined in 1947 had

cholesterol (160) and cholesterol ester (156) measurements made, less

than half of them (69) had blood pressure data taken. 111erefore, due

to the differing procedures of determining serum cholesterol and to

the lack of other data, the analysis will be confined to the 235

persons examined in 1958.

2.2 Analysis of the Data as if From a Sample of Independent Individuals

Appendix 3 gives a list of the variables which comprise the data

collected for the 235 individuals examined in 1958. The records for

each person include a medical history, results of a physical examina

tion, and laboratory results. Data for each person were extracted from

these records and keypunched onto computer cards. An attempt was made

to identify secondary hypertensives by searching through the records

for any mention of conditions listed in Appendix 1; none was found.

Of course, not every person had data available on every variable.

Although these 235 individuals are part of a five generation kindred,

for the rest of this chapter, the underlying pedigree structure will

37

be ignoreu, and the 1958 data will be examined as if the 235 persons

constitute a sample of inllependent individuals.

Table 2.1 Age Distribution by Sex

Hales Females Total

Number of Number of Number of~ Persons % Persons % Persons %

< 10 35 31.5 35 28.2 70. 29.8

10 - 19 18 16.2 32 25.8 50 21.3

20 - 29 8 7.2 10 8.1 18 7.6

30 - 39 24 21.6 28 22.6 52 22.1

40 - 49 15 13.5 9 7.2 24 10.2

50 - 59 5 4.5 3 2.4 8 3.4

60+ 6 5.4 7 5.6 13 5.5

Total 111 100.0 124 100.0 235 100.0

The 1958 population consists of 111 males and 124 females, and

their age distribution is shown in Table 2.1. The age distributions by

sex are similar; the males are slightly older than the females, with,

the mean age of the males being 24.5 years and that of the females

23.2 years; this difference is not statistically significant. Figure

2.1 shows the cumulative plot of age for the entire population as well

as for each sex. The cumulative plot gives, for each age along the

abscissa, the proportion of the population with an age less than or

equal to it. Half of the population are younger than 20, and only 10%

are older than 50. As both Table 2.1 and Figure 2.1 show, there is

an Unusual scarcity of individuals in the 20-29 age group. One can

38

75

1....... #'

1.0

"""• #'."MAL' /., ....MALE

.,.0.5 ._r

0.5

Z0I-::>al

a:I-~ 00

0 25 50 75 0 25 50 75w> 1.

."..",.;AGE

l-e{.J::> TOTAL~::>u

AGE

Figure 2.1 Cumulative Plot of Age by Sex

39

only sunnise the reason or reasons for this. Possible explanations

include military service, away at college, or couples marry and move

away. HoWever, these reasons seem less plausible when one discovers

that there is a dip in the age distribution for the 1947 population

at age group 10-19, and the dip in 1958, ten years later, is just a

continuation of it. Perhaps what is being observed is the result of

birth rate suppression during the Depression and World War.

A medical history was obtained from all subjects, and a physical

examination was perfonned during which height, weight, and blood

pressure were measured, and a non-fasting blood sample drawn. Each

blood sample was divided in half. One half was shipped to a New York

laboratory, where the serum cholesterol level w~ detennined using

the Schoenheimer-Sperry method (Sperry, 1945). For the purposes of

the present study, these values will be called Sperry cholesterol

values. The other half was shipped to the Lipid Metabolism Laboratory

in the Department of Medicine at the Medical College of South Carolina

in Charleston, where the serum cholesterol was measured using the Zak

method (Zak et al 1952); these will be called Zak cholesterol values.

Furthermore, in Charleston, the serum was fractionated in an ultra

centrifuge giving a high density alpha lipoprotein (alphaLP) component

(density> 1.063 gm/ml) ,a low density beta LP component (density

between 1.006 and 1.063 gm/ml) ,and a very lCM density prebeta LP

component (density < 1.006 grn/ml). It should be noted that the latter

component also includes the chylomicrons (density < 0.95 gm/ml), but,

for convenience, it will be called the prebeta LP component. The

quoted upper limits of nonna1 for the above measurements are: Sperry

cholesterol 210 mgt, Zak cholesterol 240 mgt, alpha LP 90 mgt, beta LP

40

140 mg%, and prebeta LP 30 mg% (Wilkinson, Hand, and Fliegelman 1948;

Epstein, Block, Hand and Francis 1959).

Let xl ,x2"" ,xn be a sample of n observations. In addition

to the mean and the standard error of the mean, two summary statistics

which measure departures from normality can be computed (Snedecor and

Cochran 1967).

The first of these statistics is the coefficient of skewness,

denoted by v'Ol or gl' with a positive value indicating that the

distribution is skewed to the right (toward the higher values) and a

negative value for distributions skewed to the left. Let

mp =

nI

i=l(x.-X)p/n

1

thbe the p sample moment about the mean. Then the skewness can be

computed from

[2.1]

!f the x I s are a sample from a normal population, gl is approximately

normal with mean zero and variance 6/n. For sample sizes less than

200, significance levels of &1 are tabulated (for example, the

Biometrika Tables, Pearson and Hartley 1954).

The other statistic is the kurtosis, denoted by g2 or bZ-3,

which can be computed from

[2.2]

The ratio m4/m~ has value 3 for the nomal distribution. Therefore

positive values of &2 indicate that the distribution is more peaked

41

than the nonnal distribution and negative values result from distri

butions that have a flatter top than the nonnal. For samples from the

normal distribution, g2 is asymptotically normal with mean a and

variance 24/n. Since the distribution of g2 approaches normality

very slady, significance levels can be found in tables, like the

Biometrika Tables.

The summary statistics for the variables are shown in Table 2.2.

All the variables except for weight and diastolic blood pressure had

distributions, on the original scale of measurement, with significant

skewness. Since the later genetic analyses will assume normality for

the variables, it is important to transform them in a way to signifi

cantly reduce, if not to eliminate, the skewness. A commonly used

transformation for this is the logarithmic one. Table 2.2 shows that

by making the logarithmic transformation, the skewness for some vari

ables (diastolic blood pressure, phospholipid, uric acid, alpha LP and

prebeta LP) disappeared, while for others (systolic blood pressure,

Sperry and Zak cholesterol, cholesterol ester, and beta LP), although

skewness did not disappear, it was at least reduced. Skewness and

kurtosis will again be considered in section 2.2.3 ; since many of the

traits have significant age effects, it is more valid to look at the

distribution of the traits after they have been adjusted for the effects

of age. Figure 2.2 displays the cumulative plots for several traits

(systolic blood pressure, Sperry cholesterol, height, and weight) both

on the original measurement scale and after the logarithmic transfor

mation.

42

Table 2.2 Summary Statistics for Original and NaturalLogarithmic-Transformed Variables

Number of StandardVariable Scale Persons i\1ean Error Skewness Kurtosis

**Age Orig. 235 23.8 1.2 0.64 -0.48** **Blood Pressure Orig 188 121.3 1.4 0.93 1. 53

(Systolic) Log 4.79 0.01 0.39** 0.27*Blood Pressure Orig. 188 78.1 0.9 0.38 0.41

(Diastolic) Log 4.34 0.01 -0.14 0.08** **Cholesterol Orig. 200 204.5 5.1 1. 64 2.79

(Sperry) Log 5.27 0.02 0.78** 0.46** **Cholesterol Orig. 155 240.4 6.0 1.39 1.45

(Zak) Log 5.44 0.02 0.76** 0.13**Weight Orig. 222 114.0 3.7 -0.02 -1.14

Log 4.58 0.04 -0.70** -0.71****Height Orig. 210 59.1 0.7 -0.78 -0.30

Log 4.06 0.01 -1.11** 0.48**Hemoglobin Orig. 196 13.3 0.1 -0.62 -0.39

Log 2.57 0.01 -1. 02** 1.00**** **Cholesterol Orig. 199 151.1 3.8 1.67 2.98

Ester Log 4.97 0.02 0.75** 0.55**

Phospholipid Orig. 196 300.8 3.0 0.50 0.19Log 5.70 0.01 0.13 -0.24

Uric Acid **Orig. 189 3.4 0.1 0.87 0.36Log 1.18 0.03 0.12 -0.58

** *Alpha LP Orig. 152 74.2 0.8 0.73 0.86Log 4.30 0.01 0.30 0.30

** **Beta LP Orig. 152 137.1 5.4 1. 53 1. 73

Log 4.82 0.03 0.62** -0.01** **

Prebeta LP Orig. 153 21.4 1.1 1. 78 5.17Log 2.89 0.05 -0.35 0.78*

Significance levels are indicated by

0.01 < P < 0.05

** P < 0.01

43

.* ..-MALe .1

1'1I·~

.. r

5.25

5.25

5.25

5.05

15.05

5.0154.86

4.85

4.85

LN (SBP)

4.1515

4.65

4.615

/(.,..........

TOTAL

,JI

..... 1'*

... ;.. ..

FEMALE .1

{tI'

.~

,Jo4.45

1.0

0.5

o4.45

1.0

0.5

0.5

190

o190 4.45

1.0... .

165 190

1615

140

140

115

1115

•• .. -MALE .1

.IrJ

,. r

90 1115 140 165

SYSTOLIC BLOOD PReSSURe (mmHg)

o90

0.5

TOTAL

1.01.-------.,...-----,.---...,...-""4""'..............,....

90

1.0

zo~

:lal

a:IUI

is 05w .>I-«~

:l~:lU

Figure 2.2 Cumulativ~ Plot by Sex

a. Systolic Blood Pressure and Ln(SBP)

44

1.0,.------r-----""'"!"'---_........;to;

0.5

0.5

1.0,.------r------.----:_::-eo--t;. .-

~

0.5

0.5

0 095 225 355 485 4.6 5.1 5.6 6.2

1.0 ... 1.0lII'-~•• •• ,0;" i

.;*'z MALE f MALE

S!I-:>

I1ZI

a:I-Ul

00.5wO. S

>I-

j"et..J:>~:>0

~;+0

225 330 435 4.75 5.20 5.70 6.10.. ..,. .. • 1.0+.. ~ •.#' .-

FEMALE ,* FEMALE ,10/,10

5.1 5.6

LN (SPERRY)

6.2

Figure 2.2 b. Sperry Cholesterol and Ln (Sperry)

4S

TOTAL

1.0r-------.-----....,.----:~

TOTAL

.5 .5

(/ ;~

,I'

,/

" 0/.

20 95 170 245 3.10 3.90 4.70 5.50

WEIGHT IN POUNDS LN (WEIGHT)

1.0r-"----..,....----~--~--....

·5 .5

//170 245 3.90 4.70 5.50

I/O'1.0

MALE MALEZ0~

::JCD

a::~enis .5 ...l .5w> .+ ,. .;v~ .,;';c( /*..J::J

y) ~+++I~:JU

#'•025 95 165 235 3.20 3.95 4.70 5.45

1.0

/1.0

FEMALE FEMALE

Figure 2.2 c. Weight and Ln (Weight)

46

TOTAL

.5

1.0r----..,...-----T"--~........

TOTAL

.5

1.0 ~----r------'----_",,-+

,./ ,./; .;

°30.0 45.0 60.0 75.0 03.4 3.7 4.0 4.31.0 1.0

Z MALE MALE

2I-:::lal

II:I-!!!0w .5 .5>l-e(

~+ .",...J:::l )1 )1:t:::l(J

,; ~.;

•• •• ~

030.0 45.0 60.0 75.0 °3.4 3.7 4.0 4.3

1.0 1.0 ,FEMALE FEMALE

o~~---~--- ~----=,2.0 44.5 58.0 88.0

MelGMT IN INeMES

.5

",. .+•

,+.'•• t

/

.5

~ ++•3.75 4.00

LN (MeIOMT)

4.25

Figure 2.2 d. Height and Ln(Height)

47

2.2.1 Sex Differences

As was mentioned before, there is no significant difference in

the age distributions or the mean age between sexes. Table 2.3 shows

the means and standard errors of the variables by sex.

With regards to the four main variables of interest, there are

no significant sex differences in the means for systolic or diastolic

blood pressure levels nor for Sperry or Zak cholesterol levels. In

addition, no significant sex differences are found in the means for

cholesterol ester, phospholipid, or beta lipoprotein.

However, the males were significantly taller and heavier, had

higher hemoglobin, uric acid. and prebetalipoprotein levels, and lower

alpha lipoprotein levels than the females.

2.2.2 Relationships with Age

Table 2.4 shows the relat ionships of the variables wi th age and

sex. Due to the small number of cases, especially in the higher age

groups, it will be difficult to make valid comparisons of the trends

by sex, but certain trends are evident.

For each sex, systolic blood pressure increases with age. In

general, the data are consistent with the results of other studies in

that the males have higher systolic blood pressures than the females

until about age 50, when the curves cross and after that the females

have higher blood pressures. For the females, the relationship with

age is fairly linear, while in males, the increase is not as steep

after age 30 as before.

For both males and females, there is an increase of diastolic

blood pressure with age; the increase in females is linear but the

diastolic pressure in males seems to peak at age group 50-59 and is

48

Table 2.3 Mean and Standard Error of Logari thmic-Transformed Variables by Sex

Number of StandardVariable Sex Persons Mean Error Test

Age M III 24.55 1. 78(Original) F 124 23.19 1. 58 NS

Blood Pressure M 90 4.79 0.02(Systolic) F 98 4.78 0.02 NS

Blood Pressure M 90 4.35 0.02(Diastolic) F 98 4.34 0.02 NS

Cholesterol M 93 5.29 0.03(Sperry) F 107 5~26 0.03 NS

Cholesterol M 70 5.48 0.03(Zak) F 85 5.41 0.03 NS

Weight M 103 4.67 0.06F 119 4.51 0.05 *

Height ~1 99 4.10 0.02F 111 4.03 0.02 *

Hemoglobin M 90 2.62 0.02F 106 2.53 0.02 **

O1.olesterol M 92 4.98 0.03Ester F 107 4.95 0.03 NS

Phospho- M 92 5.72 0.01lipid F 104 5.68 0.01 NS

Uric Acid M 86 1. 26 0.04F 103 1.10 0.03 **

Alpha LP ~1 69 4.28 0.02F 83 4.32 0.01 *

Beta LP M 69 4.89 0.05F 83 4.77 0.04 NS

Prebeta LP M 69 3.05 0.07F 84 2.76 0.07 **

Significance levels are indicated by:

NS - Not significant * 0.01 < P < 0.05 ** P < 0.01

49

Table 2.4 Mean and Standard Error of Logarithmic-TransformedVariables by Age and Sex

Systolic BPSexes Pooled Males Females

Age No. Mean SE No. Mean SE No. Mean SE

< 10 26 4.60 .02 14 4.62 .02 12 4.57 .02

10 - 19 47 4.70 .02 18 4.66 .02 29 4.72 .02

20 - 29 18 4.83 .02 8 4.83 .03 10 4.82 .02

30 - 39 52 4.83 .02 24 4.87 .03 28 4.80 .02

40 - 49 24 4.85 .02 15 4.86 .03 9 4.85 .02

50 - 59 8 4.97 .06 5 4.93 .05 3 5.04 .13

60+ 13 5.01 .04 6 4.94 .05 7 5.07 .05

Diastolic BP

Age

< 10 26 4.14 .02 14 4.15 .03 12 4.14 .02

10 - 19 47 4.24 .02 18 4.23 .03 29 4.25 .02

20 - 29 18 4.39 .01 8 4.39 .03 10 4.39 .01

30 - 39 52 4.42 .02 24 4.44 .02 28 4.40 .02

40 - 49 24 4.44 .02 15 4.44 .03 9 4.44 .03

50 - 59 8 4.52 .04 5 4.51 .05 3 4.53 .09

60+ 13 4.49 .04 6 4.43 .07 7 4.54 .05

50

SperrySexes Pooled Males Females

Age ~o. Mean SE No. Mean SE . No. t-lean _SE

< 10 38 5.17 .04 18 5.16 .04 20 5.18 .06

10 - 19 47 5.10 .04 17 5.05 .08 30 5.13 .05

20 - 29 18 5.32 .07 8 5.44 .14 10 5.22 .05

30 - 39 52 5.33 .04 24 5.36 .04 28 5.31 .06

40 - 49 24 5.41 .06 15 5.43 .08 9 5.39 .10

50 - 59 8 5.57 .14 5 5.57 .19 3 5.58 .23

60+ 13 5.40 .08 6 5.25 .06 7 5.52 .11

Zak

Age

< 10 23 5.34 .04 9 5.36 .06 14 5.32 .06

10 - 19 39 5.27 .04 13 5.27 .08 26 5.27 .04

20 - 29 13 5.47 .07 6 5.59 .13 7 5.37 .07

30 - 39 47 5.50 .03 23 5.54 .04 24 5.47 .06

40 - 49 15 5.62 .08 10 5.61 .11 5 5.64 .14

50 - 59 7 5.72 .13 4 5.74 .19 3 5.70 .20

60 + 11 5.54 .07 5 5.41 .05 6 5.66 .11

Weight

Age

< 10 65 3.79 .04 30 3. 79 .07 35 3.79 .05

10 - 19 46 4.57 .04 17 4.61 .06 29 4.55 .05

20 - 29 17 4.98 .04 7 5.12 .04 10 4.88 .04

30 - 39 50 5.07 .03 24 5.21 .02 26 4.95 .04

40 - 49 23 5.05 .03 14 5.10 .04 9 4.96 .04

50 - 59 8 5.10 .06 5 5.19 .04 3 4.94 .05

60+ 13 5.08 .04 6 5.16 .05 7 5.01 .04

51

HeightSexes Pooled Males Females

Age No. Mean SE No. Mean SE No. Mean SE

< 10 57 3.80 .02 26 3.81 .03 31 3.80 .02

10 - 19 46 4.09 .01 17 4.10 .02 29 4.08 .01

20 - 29 17 4.19 .01 7 4.24 .02 10 4.15 .01

30 - 39 49 4.19 .01 24 4.23 .01 25 4.14 .01

40 - 49 21 4.21 .01 14 4.24 .01 7 4.15 .01

50 - 59 7 4.21 .02 5 4.22 .02 2 4.18 .01

60+ 13 4.16 .01 6 4.21 .01 7 4.13 .01

Phospholipid

Age

< 10 34 5.67 .02 17 5.68 .02 17 5.66 .04

10 - 19 47 5.66 .02 17 5.71 .03 30 5.64 .02

20 - 29 18 5.71 .03 8 5.76 .04 10 5.67 .04

30 - 39 52 5.70 .01 24 5.69 .02 28 S.71 .02

40 - 49 24 5.71 .04 15 5.77 .04 9 5.62 .05

50 - 59 8 5.86 .05 5 5.83 .08 3 5.90 .03

60+ 13 5.72 .05 6 5.67 .07 7 5.76 .07

Uric Acid

~

< 10 36 1. 00 .05 16 1.07 .08 20 0.95 .07

10 - 19 46 1.12 .05 17 1.13 .09 29 1.11 .06

20 - 29 18 1.26 .08 8 1.37 .11 10 1.18 .12

30 - 39 52 1. 21 .05 24 1.34 .07 28 1.10 .06

40 - 49 20 1.38 .08 12 1.43 .10 8 1. 31 .12

50 - S9 6 1. 20 .12 4 1. 32 .13 2 0.97 .13

60+ 11 1.30 .11 5 1. 41 .10 6 1. 22 .18

52

HemoglobinSexes Pooled ~1ales Females

Age i~O. Mean SE No . Mean SE No. Mean SE

< 10 39 2.53 . 02 19 2.52 .04 20 2.54 .03

10 - 19 47 2.52 .03 17 2.62 .04 30 2.47 .04

20 - 29 18 2.69 .02 8 2.70 .02 10 2.68 .03

30 - 39 49 2.60 .02 22 2.70 .01 27 2.52 .03

40 - 49 24 2.59 .03 15 2.59 .04 9 2.60 .05

50 - 59 7 2.57 .06 4 2.59 .10 3 2.54 .09

60+ 12 2.58 .04 5 2.67 .04 7 2.51 .OS

Alpha LP

Age

< 10 23 4.29 .03 9 4.31 .05 14 4.28 .04

10 - 19 39 4.31 .02 13 4.28 .05 26 4.32 .02

20 - 29 13 4.32 .03 6 4.34 .04 7 4.32 .04

30 - 39 44 4.28 .02 22 4.25 .02 22 4.32 .03

40 - 49 15 4.34 .03 10 4.33 .03 5 4.36 .06

50 - 59 7 4.26 .02 4 4.24 .02 3 4.29 .04

60+ 11 4.27 .04 5 4.16 .05 6 4.36 .04

Prebeta LP

Age

< 10 23 2.68 .11 9 2.77 .14 14 2.62 .16

10 - 19 39 2.58 .09 13 2.61 .14 26 2.57 .12

20 - 29 13 2.98 .13 6 3.01 .11 7 2.95 .24

30 - 39 45 2.95 .09 22 3.22 .11 23 2.70 .12

40 - 49 15 3.13 .17 10 3.24 .22 5 2.92 .28

50 - 59 7 3.41 .12 4 3.40 .16 3 3.43 .22

60+ 11 3.38 .15 5 3.38 .33 6 3.37 .12

53

lower after that. The curves are almost identical tD'ltil about age 50,

and for higher ages the females have higher diastolic pressure.

However, there are relatively small numbers of individuals in ~le

older ages.

For each sex and for both Sperry and Zak cholesterol, there are

increases with age. Comparing the sexes, for the first two decades of

life the cholesterol levels are very similar in the two sexes; for ages

20 to about 50, the male cholesterol levels are higher, after which

the curves cross, and the females have the higher cholesterol levels.

The relations between cholesterol ester and beta lipoprotein with age

are the same as for cholesterol, which underscores the fact that all

these variables are correlated with each other.

For the first twenty or thirty years of life, there is a steep

increase in both height and weight for both males and females. After

age 30, the heights plateau for both males and females; the rate of

increase in weight decreases for females, and in males, weight

continues to increase tD'ltil about age 40 and then plateaus or perhaps

decreases. After age 20, the males are taller and heavier than the

females.

There is no consistent trend of hemoglobin with age for females.

In males, hemoglobin levels increase tD'ltil age 30, plateau tD'ltil age

40, and then decrease slowly. There is a general tendency for the

hemoglobin levels in males to be higher than those in females.

There is no consistent age effect for phospholipids in males, and

there is a weak linear increase in females. There is also a weak

linear increase of uric acid levels with age in females; however for

males, uric acid levels increase until age 50, after which they seem

S4

to decrease a little. At all age levels, the males have higher uric

acid levels than the females.

Although there is a general tendency for females to have higher

alpha lipoprotein levels, neither males nor females show any trend of

alpha lipoprotein with age. On the other hand, prebeta lipoprotein

levels increase with age in both males and females, with the males

having the higher levels at all ages. The level in males shows a

general increase for almost the entire age span, whereas the level

in females remains relatively constant until about age 20 before

increasing with age afterwards.

In order to quantify the relationships between these variables and

age, and also to have a method available subsequently for adjusting

for age, a regression analysis was done. Let YI'Y2""'Yn and

xl ,x2, ... ,xn denote tile value of the trait and the age, respectively,

for n individuals. The regression equation of Y on X can be

written

[2.3]

where

Y. is the logarithm of the variable for .th individual,11

So is a constant,

X. is the age of the .th individual,11

e. is random variation, assumed distributed NCO, (12),1

131 is the linear regression coefficient of age, and

62 is the quadratic regression coefficient of age.

A least square procedure which minimizes Ley i - 80is used to estimate the regression coefficients 80 , 81

ss

" ,,2 2- 81Xl - 62Xi )

and 82 and

their standard errors. First HO: 61 = 0, 62 = 0 is tested. If

either 61 or 62 is not significantly greater than zero, then that

term is removed fram the regression equation, and then the regression

estimates and the standard errors are computed anew and tested. Table

2.5 gives the linear and quadratic regression coefficients for each

trait by sex. If a particular coefficient is missing, this is to

indicate that that particular coefficient is not significantly different

from zero.

For most traits, there are no sex differences in the degree of

the age relationship. However, there are a few differences;. for

systolic and for diastolic blood pressure, the linear and quadratic

age effects are significant in males while only the linear effect is

significant in females. There is no age effect for hemoglobin levels

in females, but both linear and quadratic effects are significant

in males. The most striking difference is for prebeta LP; for the

males, only the linear effect is significant, and for the females,

only the quadratic effect is significant. For alpha LP, there is no

age effect in either sex.

The total sum of squares about the mean, Ss.T' can be partitioned

into two portions:

sSr [2.4]

where Y is the mean of the Y's

The second term on the right handregression for the .th1 person.

and y.1

is the value of the fitted

56

Table 2.5 Linear and Quadratic Regression Coefficients ofAge for Logarithmic-Transformed Variables by Sex

Variable Sex Linear Quadratic Constant R2(%)

'IeSystolic BP M .01239 -.0001 4.538 48.47

F .00703 4.585 57.66

Diastolic BP M .01720 -.00017 4.036 50.14F .00666 4.153 49.77

Sperry Cholest M .00602 5.113 11. 85F .00651 5.087 13.47

Zale Cholest M .00537 5.319 10.38F .00718 5.216 20.98

Neight ~1 .08117 -.00090 3.486 88.48F .06810 -.00073 3.558 81.64

Height M .026S3 -.00030 3.706 81.82F .02220 -.00026 3.737 72.24

illHemoglobin M .00794 -.00010 2.502 9.66

F .NO AGE EFFECT 2.530

Cholesterol r.1 .00550 4.822 9.91Ester F .00645 4.786 12.79

Phospholipid M i~O AGE EFFECT 5.720F .00186* 5.629 5.03

Uric Acid M .00731 1. 059 12.74F .00413* 0.997 4.05

Alpha LP M ~O AGE EFFECT 4.28F ~O AGE EFFECT 4.32

Beta LP M .00712* 4.668 7.88F .00950 4.526 16.46

Prebeta LP M .01514 2.588 19.98F .00018 2.577 12.41

Note: All linear and quadratic regression coefficient estimates

are significant at the 1% level, with the exception ofthose marked with 'Ie which are significant at the 5%level.

57

side of equation 2.4 is the sum of squares of the deviations of the

points Y on the fitted line from their mean. This quantity is denoted

by SSR' the sum of squares attributable to regression. A statistic

R2 can be computed using the formula

2R = SSR/SST [2.5]

R2 indicates how much of the total variation in Y can be accounted

for by the fitted regression. In the case of traits and ages, if R2

is large, this indicates that much of the variation of the trait can

be explained by age.

Age accounts for 80-90% and 70-80% of the total variation for

weight and height, respectively. For systolic and diastolic blood

pressure, R2 is 48-58%; for Sperry or Zak cholesterol, cllo1estero1

ester, and beta lipoprotein, age explains 8-20% of the total variation.

Except for prebeta LP (R2 is 12-20%), the other remaining traits

have low R2.

2.2.3 Skewness and Kurtosis

At the beginning of section 2.2, there was a brief discussion of

the necessity for finding a transformation which made the distribution

of the trait more like the normal distribution. It was noted that the

logarithmic transformation was one commonly use for this purpose.

Furthermore, two statistics, skewness (gl) and kurtosis (g2) , could be

used to measure departures from normality (equation 2.1 and 2.2) .

In section 2.2.2, it was shown that many of the traits have

significant age effects. The question arises: what, if the traits

are adjusted for age, would the distributions look like? Would they

58

be more like the normal distribution? Consequently, the traits were'" A "2

age adjusted by computing the residuals, Yi - 60 - 61Xi - 82XiA A A

where 60, 61, and 62 are the regression coefficients tabulated

in Table 2.5.

The skewness and kurtosis of the distribution of the resulting

residuals are exhibited in Table 2.6. For every trait except

hemoglobin, the skewness either disappears or is at least reduced.

After taking logari thIns and age adj usting, only hemoglobin and prebeta

lipoprotein still have significant kurtosis. Thus, except for

hemoglobin, doing a logarithmic transformation causes the distribution

to become more like the normal distribution. For the exception

hemoglobin, the distribution of the original values looks unusual:

Hemoglobin Numbe r 0f cases %

< 9 2 1.0

9 - 9.9 10 5.1

10 - 10.9 10 5.1

11 - 11.9 23 11.7

12 - 12.9 4 2.0

13 - 13.9 41 20.9

14 - 14.9 44 22.4

15 - 15.9 16 8.2

16 - 16.9 46 23.5

Total 196 100.0

After trying several transformations, it was f01.IDd that the square

transformation reduced the skewness froTIl -0.62 to -0.35, the

latter value being significant at the 5% level.

59

Table 2.6 Skewness and Kurtosis for Original and Age-Adjusted Logarithmic-Transformed Variables

. Number ofTrait Scale Individuals Skewness Kurtosis

** **Blood Pressure Original 188 0.93 1. 53(Systolic) Log(age-adj.) 0.59** 0.56

*Blood Pressure Original 188 0.38 0.41(Diastolic) Log(age-adj.) 0.07 0.26

** **Cholesterol Original 200 1.64 2.79(Sperry) Log(age-adj.) 0.84** 0.57

** **Cholesterol Original 155 1. 39 1.45(Zak) Log(age-adj.) 0.82** 0.21

**Weight Original 222 -0.02 -1.14Log(age-adj.) 0.23 -0.26

**Height Original 210 -0.78 -0.30Log(age-adj.) -0.19 0.51

**Hemoglobin Original 196 -0.62 -0.39Log(age-adj.) -1. 01** 1.18**

** **Cholesterol Original 199 1.67 2.98Ester Log(age-adj.) 0.83** 0.63

**Phospholipid Original 196 0.50 0.19Log(age-adj.) 0.11 -0.35

**Uric Original 189 0.87 0.36Acid Log(age-adj .) 0.11 -0.49

** *Alpha LP Original 152 0.73 0.86Log(age-adj.) 0.35 0.40

** **Beta LP Original 152 1. 53 1. 73Log(age-adj.) 0.69** 0.03

** **Prebeta LP Original 153 1. 78 5.17Log(age-adj.) -0.47* 1.20**

Test results are lndicated by

Not significant* 0.01 < P < 0.05** P < 0.01

60

Z.Z.4 Inter-trait Correlations

Assuming that two traits, Xl and XZ' have a bivariate nonnal

distribution, a measure of the linear relationship between them is

the correlation coefficient

[Z.6J

with the sample est~late r l2 calculated from

[Z.7]

A test of HO: p(XI , XZ) = 0 vs. ~II: p(XI , XZ) ~ 0 can be done by

computing r lZ and referring to standard tables. An alternative pro

vided by Fisher is to transfonn r lZ to a quantity Z

and Z is distributed almost N(~ in i~~

[2.8]

-!-). n is the sample size.n-3 '

Suppose there are three variables Xl' XZ' and X3, and it was

required to compute the correlation between Xl and Xz in a cross

section of individuals all having the same value of variable X3• A

partial correlation coefficient P12. 3 measures the part of the correla

tion between Xl and Xz that is not simply due to their relationships with

X3• The sample estimate can be computed fromr 12 - r 13 r 23r = [2.9]

12.3 L 2 2Y'(1-r13 ) (1-r23 )

This can be referred to standard tables. An alternative way to compute

61

r12 . 3 is to regress Xl on X3 and X2 on X3, and then compute

the correlation coefficient on the residuals (Xl - Xl) and (XZ - X2).

The two methods lead to identical results.

If there are four variables Xl' X2, X3, and X4, PlZ.34

measures the correlation between Xr and Xz while holding X3 and

X4 at fixed levels. The sample estimate can be calculated

r 1Z . 34 =2 2

1(1 - r14 .3 )(1 - r Z4 .3 )

[2.10]

This can be referred to standard tables (Snedecor and Cochran 1967).

Tables 2.7 and 2.8 display the correlation coefficients for 78

pairs of traits, and many significant correlations are noted. However

a significant correlation may not mean that two variables are directly

related to one another. As section 2.2.2 showed, many of the variables

are related to age so that perhaps the significant correlation between

traits is due to each trait being correlated with age. Partial

correlation coefficients are computed, adjusting for age and ageZ.

Sperry and Zak cholesterol values are highly correlated (0.94)

which is not W1expected since the two laboratory procedures are

different measurements of the same thing. The two are significantly

correlated with both systolic blood pressure (0.30, 0.31 respectively)

and diastolic blood pressure (0.22, 0.23), but not to either weight

or height; any correlation between cholesterol and weight or height

disappears after adjusting for age. Sperry and Zak cholesterol are

highly correlated with cholesterol ester (0.99, 0.92), phospholipid

(0.31,0.31), beta-lipoprotein (0.92,0.96), and prebeta lipoprotein

(0.28,0.32). The high correlation with beta lipoprotein is also to

62

Table 2.7 Correlation and Partial Correlation CoefficientsBetween Logaritlunic-Transformed VariablesInvolving Systolic and Diastolic Blood PressuresmId Sperry and Zak Cholesterol

Partial CorrelationVariables H Correlation Partialling Age, Age2

Systolic BP andDiastolic BP 188 .80** .63**Sperry Cholesterol 181 .45** .30**Zak Cholesterol 145 .49** .31**Weight 181 .65** .24**Height 177 .59** .18**Hemoglobin 177 .14* .04Cholesterol Ester 181 .44** .30**Phospholipid 180 .16* .04uric Acid 172 .29** .12Alpha LP 142 -.01 .02Beta LP 142 .45** .29**Prebeta LP 143 .42** .21**

Diastolic BP andSperry Cholesterol 181 .40** .22**Zak Cholesterol 145 .44** .23**Weight 181 .66** .17*Height 177 .59** .07Hemoglobin 177 .19** .10Cholesterol Ester 181 .39** .23**Phospholipid 180 .20** .10Uric Acid 172 .27** .08Alpha LP 142 .01 .04Beta LP 142 .38** .18**Prebeta LP 143 .38** .18**

Sperry Cholesterol andZak Cholesterol 15S .94** .94**Neight 192 .31** .01Height 187 .23** -.10Hemoglobin 193 .15* .10Cholesterol Ester 199 .99** .99**Phospholipid 196 .35** .31**Uric Acid 189 .11 .00Alpha LP 152 .04 .06Beta LP 152 .93** .92**Prebeta LP 153 .38** .28**

Table 2. 7 Con 't

63

Partial Correlation

Variables N Correlation Partialling Age, Age2

Zak Cholesterol andWeight 149 .35** -.02Height 146 .25** -.17*Hemoglobin 150 .14* .08Cholesterol Ester 155 .92** .91**Phospholipid 155 .35** .31**Uric Acid ISO .13 .00Alpha LP 152 .04 .06Beta LP 152 .97** .96**Prebeta.LP 153 .42** .32**

Significance levels indicated by

** P < 0.01

* 0.01 < P < 0.05

64

Table 2.8 Correlation and Partial Correlation CoefficientsBetween Logarithmic-Transformed Variables

\'11' hi HE~D Q-:lOE PHOS UA ALP BETA PREB

WT .95** .24** .31** .14* .35** -.05 .31** .31**

HT .82** .23** .23** .10 .35** -.01 .24** .23**

HEM) .17* .12 .14* .14* .03 -.01 .15* .12

CHOE .04 -.07 .09 .33** .12 .08 .91** .35**

PHOS -.02 -.07 .12 .29** - .13* -.01 .34** .15*

UA .19** .20** -.03 .02 -.20** -.11 .13 .14*

ALP -.n -.01 -.01 .09 .00 - .10 -.02 - .19**

BETA -.02 - .11 .09 .90** .30** .02 -.01 .29**

PREB .01 -.08 .07 .25** .08 .04 -.19**.18*

Number of pairs in each cell range from 143 to 209 .

Common Correlation Coefficients are Above the Diagonal.

Partial Correlation Coefficients (Partialling Age, Age 2) are

Be low the Diagonal .

\IT =Weight

HT = Height

HEM) = Hemoglobin

CHOE = Cholesterol Ester

PHOS = Phospholipid

UA = Uric Acid

ALP = Alpha LP

BETA = Beta LP

PREB = Prebeta LP

Si~lificance levels are indicated by

** P < 0.01

* 0.01 < P < 0.05

65

be expected since beta lipoproteins transport about 75% of the

cholesterol in the serum.

Systolic and diastolic blood pressure are correlated (0.63).

Systolic blood pressure is significantly correlated with weight (0.24)

and lleight (0.18), but diastolic pressure is not. In addition to being

associated with Sperry and Zak cholesterol as mentioned above, both

systolic and diastolic pressures are correlated with cholesterol ester

(0.30 and 0.23 , respectively), beta lipoprotein (0.29, 0.18), and

prebeta lipoprotein (0.21,0.18). There is no significant correlation

with phospholipid, uric acid, hemoglobin, or alpha lipoprotein

especially after age adjusting.

Indeed, except for a weak negative correlation between alpha and

prebeta liproprotein (-0.19) and a weak positive correlation between

weight and hemoglobin (0.17), neither alpha lipoprotein nor hemoglobin

are significantly related to any of the other traits. Uric acid is

significantly correlated only with weigllt (0.19), height (0.20), and

phospholipid (- 0.20) .

Except for the correlations already mentioned, neither height nor

weight is significantly related to any other trait; in particular,

neither is correlated with any of the "cholesterol-related" traits, such

as Sperry and Zak cholesterol, cholesterol ester, phospholipid, and

beta LP.

Cholesterol :esters and beta lipoprotein are highly correlated

(0.90). i'Jeither is significantly correlated with weight, height,

hemoglobin, uric acid, and alpha LP; both are significantly correlated

with systolic and diastolic pressures, Sperry and Zak cholesterol,

phospholipid, and prebeta LP.

66

With most of the pairs of traits, there are no significant

differences in the partial correlation coefficients by sex. There are

15 pairs of traits which have heterogeneous correlations by sex. For

almost all of these pairs, the difference is not substantial; the

correlations for the two sexes have the same sign but one is significant,

and the other one is not. There is a big difference only for the

correlation between hemoglobin and alpha LP; for males, the correlation

is significant and negative (-0.21), and for females, it is significant

and positive (0.22).

This last part of the descriptive study, in addition to providing

an initial estimate of the correlations for bivariate pedigree analyses,

will be helpful in interpreting the results. For example, if a

monogenic model were to fit the cholesterol data, then the fact that

cholesterol, cllolesterol ester, and beta LP pairwise have high corre

lations may suggest the existence of pleiotropic gene effects.

2.3 Random ~~ting

In this kindred, there are 65 pairs of spouses. For the sub

sequent genetic analysis, it would be both interesting and important

to determine with respect to which traits there is random mating and

which traits there is assortative mating. Assortative mating means

that mated pairs are more alike (positive assortative mating) or more dis

similar (negative assortative mating) for some phenotypic trait than

would be expected if they were picked at random from the population.

A measure of assortative mating is the inter-spouse correlation, which

is listed for the traits in Table 2.9.

There is a high inter-spouse correlation with respect to age (0.971);

people generally marry another person of a similar age. Because of this,

Table 2.9 Inter-spouse Correlations of Age-adjustedLogarithmic-Transformed Variables

Variable Ntunber of Pairs Correlation

Age (Original Scale) 49 0.97**

Weight 46 0.3J'

Height 44 0.58**

Systolic BP 49 0.44**

Diastolic BP 49 0.22

Sperry Cholesterol 49 0.22

Zak Ololesterol 38 0.41**

Hemoglobin 44 -0.12

Ololesterol Ester 49 0.21

Phospholipid 49 0.38**

Uric Acid 41 0.55**

Alpha LP 36 0.24

Beta LP 36 0.46**

Prebeta LP 37 O.ll

67

uS

it is essential to adjust the traits for age before computing the inter

spouse correlations; otherwise spurious results may be obtained due to

the effect of age on the traits. The method of age adjustr:lent is to

compute Y. - Y. where Y1' is the fitted regress ion of the tra it Y

1 12on age and age ; these residuals are used in calculating the correlations.

There is strong assortative mating for height (0.58) and somewhat

weaker for weight (0.33). There is an unexpected high correlation

between spouses for uric acid. However, Table 2.7 has shown that uric

acid and height are correlated, and assortative mating for height has

been noted. Likewise, assortative mating exists for systolic blood

pressure, Zak cholesterol, phospholipid, and beta LP, but not for the

other variables, specifically diastolic blood pressure, Sperry

cholesterol, hemoglobin, cholesterol ester, alpha LP, and prebeta LP.

The effect of assortative mating will be discussed at the end of

Chapter IV.

QiAPTER III

reSCRIPTION OF lliE PEDIGREE

The kindred described in Chapter 2 forms a pedigree which spans

five generations. The pedigree is displayed in Figure 3.1. For the

purposes of the subsequent genetic analyses, the pedigree has been

split into two simple pedigrees each originating with a pair of parents.

The two co~onent pedigrees have been labeled the "Right Pedigree"

(Figure 3.1a) and the "Left Pedigree" (Figure 3.1b).

3.1 The Pedigree Structure

The co~onent pedigrees as displayed in Figure 3.1 include the

284 persons examined in 1947 and/or 1958, along with fifteen persons

who, although never examined, are included to complete the

structure. For example, if the parents of examined children are

omitted because they were never observed, the pedigree structure would

look bizarre and inco~lete. Persons who were never observed are

indicated in Figure 3.1 by completely darkened symbols, and those who

were examined only in 1947 are indicated by a semi-darkened symbols.

Those who are unmarked, the 235 individuals examined in 1958, will be

the subjects of the genetic analyses. Measures on several traits for

these 235 individuals are listed in Appendix 4.

A reference to a specific member of the pedigree is accomplished

using a combination of a letter, a Roman numeral, and an Arabic ntunber.

The letter, either "R" or "L", specifies whether it is the Right or

I

"III

IV

V

I ,

", ••• ,' •• "auH .... IIJ1 • • II .. " ..... ', •••• .II. ..

KEY•• NEVER OBSERVED

II () OBSERVED IN '947 ONL Y

..,1 ••• .," ........ " .... """14

III

IV

V •••,U ••••• ~ •• n •• MnnnunNnN~•• aaMM." •• ......... n •••• ...... ...

e

Figure 3.1a Right Pedigree

e e

'-Jo

).

~0

o "~ ~ct: ..

:I

~ :t'" -ClQ Qo ~~ ct:::. ~

). ~ ~

~ ee.e

::

(1)(1)!-<eo.....~(1)

0..

+oJ4-l

~

:::I ..0

~

l"")

(1)!-<::3eo.....~

71

72

Left pedigree; the Roman numeral specifies the particular generation,

and the Arabic number locates the specific person in that generation.

Thus, R IV 35 denotes the 35th person of the fourth generation in the

Right Pedigree.

The two pedigrees are each centered around a large sibship. The

Left Pedigree is centered around a sibship of ten which comprise the

third generation. The Right Pedigree is centered around a sibship of

twelve which dominates the third generation. The parents of eadl of

these sibships, L II 1-2 and R II 3-4, respectively, came to this

country from Alsace-Lorraine, first settling in Illinois before moving

to Michigan in about 1923. Of the 235 individuals examined in 1958,

only R II 3 was born outside the United States. Although the families

are closely knit, there is no consanguinity in this kindred (Epstein,

Block, et al 1959). There is one pair of dizygotic twins (L V 7 and 8)

which, for the purposes of the analyses, will be treated as ordinary

sibs. As was mentioned in Chapter 2, included in this kindred are

several large sibships; there is one with fifteen sibs (R IV 1-22),

one with twelve sibs (R III 3-23), two with ten sibs each (R IV 73-88

and L III 1-18), and one with nine sibs (R IV 55-71).

The two pedigrees can be joined at two places; there are four

persons who are COIIDnon to both pedigrees. R II I 3, 4 are the same as

L III 19, 18, and R III IS, 16 are the same as L III 8, 7. It should

be noted that the children and grandchildren of R II I 3, 4 and R I II

IS, 16 could also be included in the Left Pedigree. However, the

strategy adopted is to maximize the Right Pedigree and use it as the

main focus for the search for a maj or gene. Then the search will be

repeated on the Left Pedigree to dleck for consistency of results.

73

Finally, both pedigrees will be analyzed, treating the component

pedigrees as independent pedigrees; this should be a reasonable

approximation since the two pedigrees have only [our persons in common.

In preparation to doing genetic analysis on the pedigree, the

data set must be in a form such that the pedigree structure can be

reconstructed quickly. First, each individual in the pedigree is

given a sequence number. Then, for each individual, a record on disk

is created containing the following:

1. Sequence number of the individual.

2. Sequence number of the spouse, if any.

3. Sequence number of the next sib, if any.

4. Sequence number of the first child, if any.

5. The number of children.

6. Sequence number of the individual's father.

7. Sequence number of the individual's mother.

8. Sex

9. Age

10. Measures on traits.

3.2 Age Distribution in the Two Pedigrees

Table 3.1 shows the age distribution in the two pedigrees by sex,

and the mean ages are tabulated in Table 3.2. The tables point out that

those in the Left Pedigree are older than those in Right Pedigree. Al though

the males in the Lp.ft Pedigree are, on the average, 3.6 years older than

those in the Right Pedigree, the big difference is in the females;

those in the Left Pedigree are, on the average, almost fifteen years

older. There are no females ,in the Left Pedigree under 10 years of

age, while females in this age group constitute 35% of the females in

Table 3.1 Age Distribution by Sex and Pedigree

Right Pedigree

Males Females Total

Age No. '% No. % No. %

< 10 28 33.7 35 35.0 63 34.4

10 - 19 13 15.7 25 25.0 38 20.8

20 - 29 4 4.8 6 6.0 10 5.5

30 - 39 21 25.3 22 22.0 43 23.5

40 - 49 9 10.8 5 5.0 14 7.6

50 - 59 3 3.6 2 2.0 5 2.7

60+ 5 6.0 5 5.0 10 5.5

Total 83 100.0 100 100.0 183 100.0

Left Pedigree

Age

< 10 7 23.3 0 0.0 7 12.5

10 - 19 ,.. 16.7 7 26.9 12 21.4;)

20 - 29 3 10.0 3 11.5 6 10.7

30 - 39 4 13.3 7 26.9 11 19.6

40 - 49 6 20.0 4 15.4 10 17.8

SO - S9 3 10.0 1 3.8 4 7.1

60+ 2 6.7 4 15.4 6 10.7

Total 30 100.0 26 100.0 56 100.0

74

Table 3.2 Mean Age by Sex and Pedigree

TestRight Left Pedigree

Sex Pedigree Pedigree Differences

No. 83 30

Males Mean 24.3 27.9 NS

SE 2.1 3.7

75

Females

Pooled

TestSexDifferences

No.

Mean

SE

No.

Mean

SE

100

20.8

1.7

183

22.3

1.3

NS

26

35.6

3.4

56

31.5

2.6

NS

**

**

Significance levels are indicated by

NS Not significant

** p < 0.01

76

the Right Pedigree. These results LUlderscore the importance of

adj usting for age.

Not everyone had measurements made for all the traits. Table 3.3

shows the availability of data for the four main traits along with

height and weight.

3.3 Fitting a Mixture of Normal Distributions

Prior to the pedigree analysis, and to obtain initial estimates

for it, analyses were done to determine if a mixture of two or three

normal distributions would fit the data for each of the four main

traits significantly better than one distribution. In addition, for

certain pairs of traits, mixtures of bivariate normal distributions

were fitted.

If a mixture of two or three distributions were to fit the data

significantly better than one distribution, it would be attractive to

think of each distribution as corresponding to the distribution of

phenotypes for a specific genotype. In fact, during the pedigree

analysis, we assume that the conditional probability density flIDction2(pdf) of observing phenotype x, given genotype u, is N(].1u' 0 ) ,

where the initial estimates of].1u and 02 are obtained from the curve

fitting in this section.I

Analogous to sections 2.2.3 and 2.2.4, let Y. be the measure1

of the trait (in the original scale) for the i th individual. Then,

the age-adjusted natural logarithmic transformed trait value for the

i th individual in the pedigree is

I

Y. = .en Y.1 1

i = I, ... ,n

77

Table 3.3 Availability of Data for Six Traits by Sexand Pedigree

Pedigree

Left Right BothM F Total M F Total M F Total

Observedin 1958 30 26 - S6 83 100 183 ·111 124 235

BloodPressure

SperryCholesterol

ZakCholesterol

Weight

Height

23 25

24 26

18 22

26 23

25 23

48

50

40

49

48

69 75 144

71 83 154

54 65 119

79 98 177

76 90 166

90 98 188

93 107 200

70 85 ISS

103 119 222

99 III 210

78

where X. is the age of the i th individual and1

81, 82 are the appropriate estimated regression

coefficients from Table 2.5.

The curve-fitting will be performed on the Y..1

3.3.1 Univariate Log-Normal Distributions

Ignoring the genetic relationships, Y., (i = 1,2, ... ,n), can be1

assumed to be independent and identically distributed random variables

with p.d.f. g(y). The p.d.f. of a mixture of k distributions from

a conmon family of distributions f(y;!V can be written as

g(y) k= '. 1 a.f(y; e.)l.J= J .....J[3.1]

The admixture proportions a., j = 1,2, ... , k-lJ

j = 1,2, .•. ,k will be estimated using thee. ,.....J

maximum likelihood procedure. There is no difficulty in allowing

kwhere '. 1 a. = 1.l.J= J

and the parameters

g(y) to be a mixture of normal distributions so that

12 -1 ( 2 2)fey; ~j)= (2no ) exp -(y - ~j) /20 .

A cormnon variance 02 will be used rather than a different variance

for each component distribution; this is because assuming different

variances leads to singularities on the likelihood surface, e.g. the

estimation procedure may collapse one of the components to a single

observation, resulting in the variance for that component being zero.

For a mixture of k normals, the likelihood is written:

nL = IT

i=l

79

[3.2]

The likelihood is written as a fWlction of 2k fWlctionally inde

pendent parameters: k means, (k-l) independent admixture

proportions, and a conunon variance. These 2k parameters will be

simultaneously estimated using a maximum likelihood subroutine package

~IK devised by Kaplan and Elston (1972). We are interested in

testing HO: data consist of k component distributions versus HI:

k' components (k' > k). The test will be based on the likelihood

ratio criterion. Wolfe (1971) investigated the distribution of

-2 in A where A = y~, and where ~ and Lk , are the likeli

hoods Wlder HO and HI' respectively. After discovering that the

distribution of -2 in A did not sufficiently match the usual X2

approximation, his Monte Carlo investigations suggest that - 2 C

in(Lk/~') is approximately distributed as a x2 with 2m(k' - k)

degrees of freedom, where

C = [n - I - (~k')/2]/n,

m = number of variables,

k' = number of components Wlder HI' and

n = sample size.

It should be noted that, for any reasonably large n, C will be near

1 in value; in the present study, C ranges from 0.936 to 0.975. The

effect of letting C = I will be an anti-conservative test, i.e. the

actual significance level is larger than the nominal one.

For k components, the parameters are (~l""'~k' 0, al , ••• ,

ak-l) which will be estimated using ML procedures.

80

3.3.1.1 Systolic Blood Pressure

The maximum likelihood estimates (MLE) of the parameters for a

mixture of k univariate log-normal distributions for systolic blood

pressure are given in Table 3.4a for males and Table 3.4b for females.

Figure 3.Za and 3.Zb show the empirical and theoretical cumulative

plots for males and for females, respectively.

To test HO: k = 1 vs HI: k' = 2 or HO: k = Z vs HI: k'= 3,

-ZC in Ll/LZ and -ZC in LZ/L3, respectively, should be compared

to a X2 with 2 d.f.

For males, neither a mixture of two nor a mixture of three dis-

tributions fits the systolic blood pressure data better than one

distribution while for females, a mixture of two distributions fits the

data significantly better (0.01 < P < 0.05) than only one distribu

tion. Almost 19% of the females are in the higher distribution.

3.3.1.2 Diastolic Blood Pressure

The MLE of the parameters are given in Table 3.4c for males and

Table 3Ad for females. Figures 3. 2c and 3. 2d show the empirical and

the theoretical cumulative plots of diastolic blood pressure for males

and for females, respectively. For neither the males nor the females

is the hypothesis of one log-normal distribution fitting the diastolic

blood pressure data rejected; a mixture of two distributions does not

fit significantly better than one alone.

3.3.1.3 Sperry Ololesterol

The MLE of the parameters are given in Table 3.4e and Table 3.4f

while the cumulative plots of Sperry cholesterol levels for males and

females are shown in Figures 3.2e and 3.Zf, respectively.

81

Table 3.4 Maximum Likelihood Estimates of the Parametersfor a Mixture of Univariate Log Normal Distributions

a. Trait: Systolic Blood Pressure - Males

Number of Distributions

Parameters One Two Three

4.537

4.698

4.982

0.06

0.485

0.491

0.024(2)

4.70(4)

0.10

4.618

4.957

0.10

0.975

0.OZ5(1)

5..04(3)

0.08

4.627

0.11

1.00

cr

(Xl

(Xz

(X3

xZ (Z d. f.)

Signif. level

(i) S.E. = 0.03

(2) S.E. = O.OZ

(3) Tests that a mixture of two distributions fits better than one.

(4) Tests that a mixture of three distributions fits better than two.

b. Trait: Systolic Blood Pressure - Females

Parameters

1J1 4.593 4.560 4.533

1JZ 4.740' 4.665

113 4.81Z

cr 0.10 0.07 0.06

(Xl 1.000 0.81Z 0.618

(Xz 0.188 0.314

(X3 0.068(1)Z 8.37 2.64X (Z d.f.)

Signif. level 0.02 >0.10

(1) S.E. = 0.05

82

1.0r-----.....-----...-;:~==--_"T

0.5

O-=:;; L.- --'

5.05 4.35 4.58 4.81 5.05LN (SYSTOLIC BPI

4.814.58

4.5B 4.81 5.05

LN (SYSTOLIC BPI

THREEDISTRIBUTIONS

ONEDISTRIBUTION

1.0 r-----.....-----,....:;::;II-OZ-~

0.5

0.5

zo~

::lCD

a:~en(5w>~

«.J::l~::l(J

Figure 3.2 Empirical and Theoretical Cumulative PlotsAfter Fitting a Mixture of Log NonnalDistributions

a. Trait: Systolic Blood Pressure - Males

1.0r-----...-----,.-~::::=o...__,

83

, .Or-----...,.-----,...----:::::P-'"'t

4.60 4.715 4.90

LN (SYSTOLIC BPI

zo~

:JCD

II:~

!!!cw>~c(.J:J~:Jo

0.5

ONEDISTRIBUTION

4.60 4.75 4.90

0.5

4.60 4.75

LN (SYSTOLIC BPI

4.90

Figure 3.2b Trait: Systolic Blood Pressure- Females

Table 3.4

e. Trait: Diastolic Blood Pressure - ~~les


84

Parameters

o

One

4.107

0.12

1.000

0.72

Two

4.100

4.361

0.12

0.974

0.026(1)

0.84

Three

3.977

4.137

4.377

0.09

0.247(2)

0.715

0.038(3)

(1) S.E. = 0.06

(2) S.E. = 0.16

(3) S.E. = 0.04

d. Trait: Diastolic Blood Pressure - Females

Parameters

)..11 4.161 3.890 3.889

)..IZ 4.165 4.160

)..13 4.347

0 0.11 0.11 0.10

CL1 1.000 0.014(1) 0.017(1)

CL Z 0.986 0.953

CL3 0.030(Z)2 0.62 0.32X (Z d.L)

(1) S.E. = 0.03

(2) S.E. = 0.11

85

1.0r-----...-----""T"-:"""'"""":::;::;,..-...

TWODISTRIBUTIONS

1.0 ,..------r------.----:::::;r--t

0.5

O~~__--:,,:- --:,,= ~

4.150 J.715 4.0 4.25 4.50

LN (DIASTOLIC BP)

4.215

ONEDISTRIBUTION

0.5

0.5

0_.::;;; .... &.- ....

J.75 4.0 4.25 4.150

LN (DIASTOLIC BPI

z2~;)lD

a::~

'"a 3.75 4.0w 1.0>~

~ THREEi DISTRI BUTIONS

;)U

Figure 3.2c Trait: Diastolic Blood Pressure - Males

86

0.5

0.5

TWODISTRIBUTIONS

O__-=~""';'---, ....J. :-:'

3.80 4.03 4.26 4.150LN (DIASTOLIC BP)

1.,.r-------r----,..---~-..,

0.15

THREEDISTRIBUTIONS

O~...~~_--:~ --:~:--__--:~

3.BO 4.03 4.28 4.150LN (DIASTOLIC BPI

ONEDISTRIBUTION

1.0..-----.......-----,..--:-:P_~

zoI::>III

a:I-'a 0 ..._:!!::::__..,...,."... --:~:__---~

a 3.80 4.28 4.150

~ 1.0 .-----~-----.----~-~

IotoJ::>~::>(J

Figure 3.2d Trait: Diastolic Blood Pressure - Females

87

Table 3.4

e. Trait: Sperry Cholesterol - Males

Number of DistributionsParameters One Two Three

lJ1 5.124 5.041 4.861

lJ2 5.673 5.149

lJ3 5.697

a 0.29 0.19 0.13

a ' 1.000 0.868 0.3271

a2 0.132 0.546

Cx3 0.127

x2(2 d.f.) 16.46 2.78

Signif. level < .001 >.10

f. Trait: Sperry Cholesterol - Females

Parameters

lJ1 5.091 4.988 4.693

lJ2 5.581 4.999

lJ3 5.584

a 0.28 0.17 0.16

a1 1.000 0.827 0.031 (1)

a2 0.173 0.795

a3 0.1732 23.62 0.84X (2 d.f.)

Signif. level <.001 >.10

(1) S.E. = 0.07

88

6.004.95 5.30 5.65

LN (SPERRY CHOLESTEROL)

1.0~----,-----r---""""--=""

0.5

6.00

0.5

0.5

4.95 5.30 5.65 6.00LN (SPERRY CHOLESTEROL)

zQI::JlD

a:I~C

w r-----.,.-----.,...----r---::::=_~> 1.0le(..J::J~::J(J

Figure 3.2e Trait: Sperry Cholesterol - Males

89

O .....:::::.....:.....__~ -'- --J

4.5 5.0 5.5 6.0LN (SPERRY CHOLESTEROL)

1.0.-------,------r----:;=_-+

0.5

6.05.5

0.5

5.0 5.5 8.0


1.0

zo~

::JCD

a:~

~oW r-----...,...-----,.....-...,.,-o:z:oo-.....> 1.0~c(..J::J~:JU

Figure 3.2£ Trait: Sperry Cholesterol - Females

90

The results are consistent in the two sexes. For each sex, a

mLxture of two distributions fits the Sperry cholesterol data signifi-

cantly better than one distribution. There is almost the same

proportion of each sex in the higher distribution, 13% of males and

17% of females.

3.3.1.4 Zak Cholesterol

The maximum likelihood estimates of the parameters are given in

Table 3.4g for males and Table 3.4h for females, and the empirical and

the theoretical cumulative plots of Zak cholesterol for males and for

females are shown in Figure 3. 2g . As in the case wi th Sperry

cholesterol, the results with Zak cholesterol are consistent in the

two sexes. For each sex, a mixture of two log-normals fits the Zak

cholesterol data significantly better than one distribution, but a

mixture of three distributions does not fit significantly better than

two. In addi tion, there is a.lmos t the same proportion of individuals

of each sex in the higher distribution, 19% of males and 13% of females.

It is reassuring to know that two laboratory procedures whidl are

supposedly measuring the same serum cholesterol levels can produce

such consistent results.

3.3.2 Bivariate Log-Normal Distributions

In examining pairs of traits, analyses were done to determine if

a mixture of two or three bivariate log-normal distributions would fit

the data better than one distribution. Analogous to section 3.3.1,

the Y. are now random vectors of two random variables corresponding

to th:'two traits. Xi" (~~]. The :i' (i = 1•.•.•nl. can be

assumed to be independent and identically distributed random vectors

with p.d.f. gel)' The p.d.f. of a mixture of k distributions

91

Table 3.4

g. Trait: Zak Cholesterol - Males

Number of DistributionsParameters One Two Three

lJ l 5.335 5.229 5.099

lJ2 ·5.774 5.357

lJ3 5.807

a 0.26 0.16 0.10

CL1 1.000 0.806 0.390

CL2 0.194 0.436

CL3 0.1742 .

16.66 5.84X (2 d.L)

Si~nif. level <.001 .05

h. Trait: Zak Cholesterol - Females

Parameters

lJ1 5.219 5.147 5.045

lJZ 5.699 5.239

lJ3 5.718

a 0.24 0.15 0.12

CL1 1.000 0.870 0.407(1)

CLZ 0.130 0.471(1)

CL30.123

2 20.40 1.34X (2 d.L)Signif. level <.001 >.10

(1) S.E. = 0.22

92

MAL.E

THAEEOISTAIBUTIONS

1.0..-------r------.......----'::l"'·

0.5

O-'::::~ """' ~ --I

5.60 5.e5 4.eO 5.25 5.60 5.i5L.N (ZAK CHOL.ESTEAOL.)

1.0

Z TWO0 OISTAIBUTIONS~

:lIII

a::~III

00.5w

>~c(..J:l~;)U

04.90 5.25

FEMAL.E

1.o------.......-----r---~=_ 1.0r------.,.------r-----:"'::::--+

z2~

:lCD

a::~

~ow 0.5>~c(..J:l~:lU

IU5

0.5

0-=;;.-. ....... -"'-- -'

5.55 !l.eo 4.75 !l.1!l 5.55 5.90

L.N (ZAK CHOL.ESTEAOL.l

Figure 3.2g Trait: Zak Cholesterol

93

from a common family of distributions f(l;~) can be written as

g (v) = \~ 1 a· fey; 8)."" L. J= J ...-

[3.3]

There is no difficulty in allowing gel) to be a mixture of bivariate

normal distributions so that

for j = 1,2, and 3,

[3.4]

(~l' Jwhere ,l,I. = JJ ~2j

and

The correlation between Y1 and Y2 is p, and 0'1 and 0'2 are the

standard deviations for Y1 and Y2' respectively. A common variance

covariance matrix L will be assumed for the component distributions.

For a sample of n independent random vectors X with p.d.f.

given in equat ion 3. 3, the likelihood funct ion is wri tten

n kL = n L· 1 a. f (l·; ij., L ) .

i-I J= J 1 J[3.5]

It can be written as a function of 3k + 2 parameters: 2k

means, k-l admixture proportions, the common variance for each variate

along with acoounon correlation between variates. The 3k + 2 para

meters will be estimated simultaneously using MAXL1K (Kaplan and

Elston 1972). Wolfe's test (Wolfe 1971), as described in section

94

3.3.1, can be used to test 110: data consist of k component distri

butions versus HI: k' components (k' > k). Again wi th A = Lk/Lk"

-2C in A is approximately distributed as a x2 with 2m(k'-k)

degrees of freedom.

3.3.2.1 Blood Pressure

After age-adjusting the natural logarithmic-transformed systolic

and diastolic blood pressure, an attempt is made to fit a mixture of

bivariate log-normals to the data. Tables 3. Sa and. 3. Sb show the maxirrn..un

likelihood estimates of the parameters for males and females,

respectively.

For males, a mixture of two bivariate distributions does not fit

the systolic and diastolic blood pressure data significantly better

than one distribution. However, for females a mixture of two distri-

butions fits the data significantly better than one distribution, but

a mixture of three distributions does not fit significantly better than

two. There are about 20% of the females in the higher distribution,

and comparing these means, common standard deviations, and proportions

with those in Tables 3.4b and 3.4d, the local maximum for the bivariate

distributions corresponds more to the systolic blood pressure than to

the diastolic blood pressure univariate values. Therefore, if there

is a genetic polymorphism for blood pressure, evidently it expresses

itself clearly only for systolic blood pressure in females.

3.3.2.2 Serum Cholesterol

After age-adjusting the natural logarithmic transformed Sperry

and Zak cholesterol data, a mixture of bivariate log-normals is fitted

to the data. Tables 3.Sc and 3.Sd give the maximum likelihood estimates

of the parameters for males and females, respectively.

e e e

Table 3.5 ~fuocimum Likelihood Estimates of the Parametersfor a Mixture of Bivariate Log Normal Distributions

a. Trait: Systolic and Diastolic Blood Pressure - Males


P

2X (4 d.f.)

1.000

.564

0.989

0.011 (1)

Three

Srstolic Diastolic

4.567 4.107

4.652 4.107

5.036 4.082

0.10 O.lZ

.343

.646

.011 (1).

.673

0.16

Diastolic

4.107

4.081

0.12

.626

Two

0.11

Systolic

4.622.

5.034

3.58

.12

Diastolic

4.108

0.11

One

Systolic

4.625

a

a l

a Za 3

l!1

l!2

l!3

Parameters

(1) S.E. = .01

\.0trl

Tahle 3.5

b. Trait: Systolic and Diastolic Blood Pressures - Females


Diastolic

4.162

Parameters

l:!l

l:!2

l:!3

(J

al

a 2

a3

p

One

Systolic

4.594

0.10

1.000

0.657

0.11

Two

Systolic

4.566

4.701

0.08

0.796

0.204

0.871

Diastolic

4.169

4.130

0.11

Three

Systolic Diastolic

4.567 3.962

4.567 4.171

4.762 4.210

.08 0.10

0.072

0.792

0.136

0.839

2X (4 d.L)

Signif. level

e

26.46

<.001

7.30

>.10

e e

\DQ\

e e

Table 3.5c. Trait: Sperry and Zak Cholesterol - Males


e

Parameters

1!1

1!2

1!3

a

a 1

a 2

a 3

p

One

Sperry

5.169

0.30

1.000

0.939

Zak

5.340

0.27

Two

Sperry

5.053

5.639

0.18

0.814

0.186

0.830

Zak

5.232

5.784

0.16

Three

Sperry

4.944

5.169

5.669

0.15

0.416

0.411

0.173

0.727

Zak

5.108

5.365

5.809

0.10

2X (4 d.f.)

Signif. level

13.51

.009

7.42

>.10

\D-....J

Table 3.5d. Trait: Sperry and Zak Cholesterol - Females


Parameters

~l

~Z

~3

a

One

Sperry

5.087

0.Z9

Zak

5.223

0.24

Two

Sperry

4.997

5.638

0.19

Zak

5.147

5.700

0.15

Three

Sperry

4.897(1)

5.066(2)

5.651

0.17

Zak

5.035(2)

5.221(2)

5.714

0.12

Ul

Uzu3

p

ZX (4 J.f.)

Signif. level(1) S.E.:: 0.1

(2) S.E.:: O.Z

1.000

0.936

19.56

<.001

.870

.130

0.839

1.58

>.10

.340

.536

.124

.805

\.Q

00

e e e

99

For both the males and the females, a mixture of two bivariate

distributions fits the data better than one, but a mixture of three

distributions does not fi t better than a mixture of two. About 19%

and 13% of the males and females, respectively, belong to the'higher

distribution. Further.more, regardless of whatever initial estimates

are tried for the males and for the females (e.g. first try initial

estimates where there are 2 different means for Sperry cholesterol and

the same means for Zak cholesterol and then where there are 2 different

means for Zak cholesterol and the same means for Sperry cholesterol),

the search of the likelihood surface always converges to the same

maximum likelihood estimates; i.e. there is only one maximum. In

addition, comparing these estimates with those obtained from fitting

mixtures of univariate distributions (Tables 3.4e -3.4h) reveals that

the two sets of means, common variance, and proportions are quite

similar. The estimate of p, the cammon correlation between Sperry

and Zak cholesterol, is similar to the 0.94 in Table 2.7. An inter

pretation of this is that if there is a major gene for Sperry

cholesterol and a major gene for Zak cholesterol, it is the same gene

for both measurements. This case is to be contrasted with the

situation described next.

3.3.2.3 Systolic Blood Pressure and Serum Cholesterol

The univariate analyses have shown that a mixture of two distri

butions fits the data significantly better than one distribution for

Sperry cholesterol, Zak cholesterol, and systolic blood pressure; and

in the case of the latter trait, only for females. There was no

evidence of a genetic polymorphism for diastolic blood pressure in

either sex nor for systolic blood pressure in males.

100

1be bivariate analysis of blood pressure showed that a mixture of

two distributions fits better than one for females only; moreover, the

local maximum in the bivariate case corresponds to the maximum for

systolic blood pressure in the univariate case.

In order to study blood pressure and serum cholesterol jointly,

a mixture of two bivariate log normal distributions is fitted to the

age-adjusted log-transformed systolic blood pressure and either Sperry

cholesterol or Zak cholesterol; diastolic blood pressure is not used

since all attempts in this study to detect a polymorphism have been

unsuccessful.

The purpose of studying systolic blood pressure and serum

cholesterol jointly is to determine whether there is only one maximum,

or if there are two local maxima. The procedure is to start with two

sets of initial estimates. In the first set, the initial estimates of

the means for sys tolic blood pressure are set equal, and the means for

serum cholesterol are different; in the second set, the situation is

reversed. Then, for each set of initial estimates, a search of the

likelihood surface is made for a maximum. In the case of Sperry and

Zak cholesterol in section 3.3.2.2, the estimation procedure found

only one maximum regardless of which set of initial estimates was used.

For males and for females, two local maxima were found for systolic

blood pressure and serum cholesterol. The estimates of the parameters

for the two maxima are shown, for males, in Table 3.6a for Sperry

cholesterol and in Table 3.6b for Zak cholesterol. Tables S.6c and 3.6d

show the estimates for females for systolic blood pressure and Sperry

cholesterol and Zak cholesterol, respectively. In each case, maximum I

corresponds to the maximum for cholesterol while maximum 2 corresponds

~,

101

Table 3.6 Maximum Likelihood Estimates of the Parameters

a. Trait: The Two local Maxima for Systolic Blood Pressure andSperry Cholesterol - Males

Local Maxima

1Parameters Systolic BP

2Sperry Systolic ap Sperry

5.105

5.105

0.29

4.617

4.941

0.10

5.016

5.652

0.19

4.616

4.700

0.11a

ld1

ldz

*L.1

p

0.859

0.141

0.222

96.32

is the likelihood corresponding to the

0.968

0.032

0.378

. th .1 maxllTILDTl.

b. Trait: The Two Local Maxima for Systolic Blood Pressure andZak Cholesterol - ~fuUes

Local Maxima21

Parameters Systolic BP Zal<

ldl 4.628 5.223

B2 4.610 5.774

a 0.11 0.15

<ll 0.792

<lZ 0.208

p 0.295

L/L2 397.50

Systolic BE4.623

4.935

0.10

0.957

0.043

0.368

5.340

5.296

0.27

102

c. Trait: The Two Local Maxima for Systolic Blood Pressure andSperry Cholesterol - Females

Local I\ta.xima

Parameters

~l

1!2

a

1Systolic BP

4.594

4.605

0.10

0.838

0.162

0.368

6616.36

Sperry

4.980

5.579

0.17

2Systolic BP

4.561

4.743

0.07

0.811

0.189

0.131

Sperry

5.044

5.217

0.36

u.. Trait: The Two Local Maxima for Systolic Blood Pressure and ZakCholesterol - Females

Local Maxima

Parameters1

Systolic BP Zal<2

Systolic BP

0.15a

4.594

4.642

0.10

5.134

5.693

4.557

4.732

0.071

5.179

5.324

0.24

0.853

0.147

0.215

2218.55

0.746

0.254

0.112

103

to one for systolic blood pressure. Judging from the likelihoods, the

local maxUnuffi corresponding to cholesterol is much higher. than the

one corresponding to systolic blood pressure. Comparing these

estimates with those obtained from fitting mixtures of univariate

distributions reveals that the two sets of means, cornmon variance, and

proportions are similar. In most cases, the cornmon correlation is

close to the correlation of 0.30 between systolic blood pressure and

Sperry and Zak cholesterol levels (Table 2.7). Estimates of the distri

bution means at these local maxima are plotted in Figure 3.3.

This analysis suggests that, if there is a maJor gene for systolic

blood pressure and a major gene for sennn cholesterol, there are two

separate genes, since fitting a mixture of bivariate normal distribu

tions reveals two local maxima.

104

A. SYSTOLIC BLOOD PRESSURE VS SPERRY CHOLESTEROL

MALE FEMALE

5.5 r------,,...-----,.---.......---.., 5.5

6.0

•5.5

•

5.0

•

•

• • 4_.4.5 l-.__---' --'- --'- ....I 4.5,L-.---'-------'---.........----J

5.0 5.5 6.0 5.0


01XIU.J

g 5.0III>!!!z.J

B. SYSTOLIC BLOOD PRESSURE VS ZAK CHOLESTEROL

MALE

!US r------,----....----r---..,FEMALE

5.5,....-----,.---.....,....---..,.------,

5.0

CL1XI0.J0

5.0~III •>-!!!z.J

• • • •••

•6.05.5

4.5 l-.__---! --'- -L. ....I 4.5L-.---''-------'---.........---~

5.0 5.5 6.0 5.0

LN (ZAK CHOLESTEROL)

KEY: • MEANS FOR LOCAL MAXIMUM CORRESPONDING TO CHOLESTEROL

• MEANS FOR LOCAL MAXIMUM CORRESPONDING TO SYSTOLIC BLOOD PRESSURE

Figure 3.3 Plots of the Estimated Distribution Means forSystolic Blood Pressure and Serum CholesterolSex Corresponding to the Two Local Maxima ofthe Likelihood.

OiAPTER IV

TI1E ~fAJOR GENE HYP01HESIS

4.1 Introduction

The traditional method of analyzing pedigree data has been to

break the pedigree into many two-generational families and then to

look for Mendelian segregation ratios. After the investigator uses

some criterion to classify each individual to a specific genotype,

he will detennine, depending on the mating types of the parents,

whether the genotype distribution in the offspring is consistent wi th

a particular genetic hypothesis. This method of analysis wastes infor

mation in that various relationships (e.g. grandparent-grandchild,

uncle-niece) are ignored. In addition, the families are not indepen

dent, the same individual may be a child in one family and a parent

in another.

Sometimes two-generational data are not sufficient; misleading

conclusions can be drawn. In a classical study by Sewall Wright (1934)

of the inheritance of polydactyly in guinea pigs, he was able to

demonstrate that, for two generations after crossing two inbred strains,

the data mimic simple Mendelian inheritance. However, after further

breeding, it was concluded that p~lygenic inheritance was more appro

priate. MOre recently, Li1ienfeld (1959), also looking at two

generational data, was able to shmv that the inheritance of a trait,

like attending medical school, which is presumed to be determined

principally by socio-cultural factors, is consistent with transmission

106

by an autosomal recessive gene.

Analyzing the whole pedigree intact will yield more genetic infor

mation. Elston and Stewart (1971) have developed a general approach

to the genetic analysis of pedigree data, and their metllod will be

applied to the Bay City pedigree. In this chapter, the approach will

be briefly described, and then the method will be used to seardl for

a major gene for systolic blood pressure, diastolic blood pressure,

and sennn cholesterol separately as well as for the two blood pressures

jointly and for the two measures of serum choles terol jointly. The

next chapter, Chapter V, will be an examination of the polygenic model,

and Chapter VI will consider the mixed model,segregation of a major

gene toge't!~er with polygenic and environmental background.

4.2 The Major Gene Model

By major gene is meant a single gene that can account for a

significant portion of the phenotypic variation. In a one-autosomal

locus, two-allele system, let the two alleles be represented by A

and B, and further, for convenience, let the genotypes be indexed

M = I, AB = 2, and BB = 3.

Consider a pedigree with n individuals and a measure of a

quantitative trait x., (i = 1, ... ,n)1

on each individual. The

mathematical model for x. under a major gene model, given genotype t,1

can be wri tten as

[4.1]

where met) Ct = 1,2,3) is the major gene effect IDld e is the

random environmental effect which is assumed distributed NCO, 02).

107

The major gene effect m is distributed

Genotype

M AB BBIndex l 1 2 3

Effect mCl) 1J1 1J 2 1J3

Frequency I/Jl 1/J1 1/J2 1/J3 Le I/Jl = 1

The mean and variance of mare

ECm) = r3I/Jl mCl) = m

l=l[4.2]

3 2V(m) = r I/Jl m(l)

l=l

-2 2- m = om [4.3]

The quantity 0; is called the variance due to the major gene, and its

estimate will be used to co~ute an estimate of heritability. For

convenience, mCl

) will be written me since there will be no ambiguity

as to what the subscript is referring.

It is assumed in deriving the likelihood that, given the parental

genotypes, the genotypes of the offspring are independent of one another.

Further assume that, conditional on their own genotypes, the

phenotypes of the offspring are independent of one another.

Thus, the likelihood L of observing a sibship of size n with

measures xl' ... ,xn given parental major gene effects ms and mt is

= II f(x·lm , mt )i 1 S

= II r3 f(x·lm) f(m 1m mt )i u=l 1 u u S

=

108

where mu is the major gene effect of the offspring.

Spouses are assumed to be marrying into the pedigree, i. e. they

have no parents who are members of the pedigree. Under the assumption

of random mating, i.e. an individual's phenotype is independent of

his spouse's phenotype and genotype, it is not difficult to consider

spouses in the model. Given an individual's parents have major gene

effect ms and mt , the likelihood of observing him wi th measure

x and his spouse with measure y is

= f(xlms mt ) fey)

3 3I f(x I~) f(m 1m mt ) I f(Ylmy) f(m )~l u s ~l ~

where mv is the major gene effect of the spouse. Therefore, the

likelihood L of observing a sibship and their spouses, given ms ' mt

is

L = IT I3f(x·lm) f(mulms mt ) I 3

f(Y·lm) femv)i ~1 1 u ~l 1 v

[4.4]

For persons with no spouse, the second summation is set equal to one.

This likelihood can be expressed as a function of three quantities

eaCt~ of which will be rewritten to confonn with the notation in the

Elston and Stewart (1971) paper:

1. The genetic mechanism, f(m 1m mt ),u s is denoted simply by

Pstu ' the probability that, conditional on the parents'

genotype being s and t, an individual has genotype u.

For the one-locus, two-allele model these probabilities

109

can be arranged into a 3x3 genetic transition matrix in

which each element is a vector (p$t 1 Pst 2 Pst 3)·

Furthermore, we can express each of the Pst u in the

matrix as a function of three transmission probabilities:

i) T1

= Pr(AA ~ A), probability a parent of genotype

AA will transmit an A allele to his offspring

ii) T2 = Pr(AB ~ A), probability a parent of genotype AB

will transmit an A allele to his offspring, and

iii) T3

= Pr(BB ~ A), probability a parent of genotype BB

will transmit an A allele to his offspring.

The relationship between the pIS and the TIS is:

p = (l-T )(l-T )st 3. s t

s, t = 1,2,3 [4.5]

For the simple Mendelian hypothesis, and T =3

0, and the genetic transition matrix is as shown in Table 4.1.

Table 4.1 The Genetic Transition Matrix for a One-Locus,Two-Allele System

t

s

1 =M

2 = AB

3 = BB

1- M

(1 0 0)

(1 1. 0)2 2

(0 1 0)

2 = AB(1 !. 0)2 2

(1. ! 1)424

(0 t t)

3 = BB(0 1 0)

(0 1. 1.)2 2

(0 0 1)

The phenotype-genotype relationship,2.

is denoted by &u(xi ) or gv(Yi)'

conditional p.d.f., given genotype

110

f(xi!mu) or f(Yi1m),

Thus, gJx) is the

u, of observing x. For

a specific trai t under the one maj or locus bvo -allele model,

there will be three p.J.f. g (x), one for each postulatedugenotype, and the distribution of x given genotype u is

taken to be ~(~u' 02). There is no difficulty in allowing

~(x) to be age and/or sex dependent, or alternatively, x

can be age and/or sex adjusted.

3. The genotypic distribution among persons "external" to the

pedigree, f (m). By "external" is meant that these

individuals have no parents who are themselves members of the

pedigree. Hence, persons "external" to the pedigree include

spouses of individuals in the pedigree and the two original

parents. In most cases, their genotypes will not be known,

and we let fCIDvJ, denoted by Wv ' be the proportion of

individuals "external" to the pedigree with genotype v

(v = 1,2,3).

Under the assumption of random mating and the basic assumption

that, given the genotypes of both parents, the genotypes and phenotypes

of the offspring are independently distributed, Elston and Stewart

(1971) have derived the likelihood of observing a particular set of

pedigree data as a function of the three quantities described above.

The derived likelihood involves a series of summations and products.

There is a Fortran program GENPED (Kaplan and Elston 1975) which will

construct this likelihood.

J

111

4.3 Method of Analysis

For systolic and diastolic blood pressure, as well as Sperry

cholesterol and Zak cholesterol, the analyses done in Chapter II suggest

that a natural logarithmic transformation is appropriate. Therefore,

the pedigree analysis will be done on the natural logarithmic trans-

formed variable, appropriately age-adjusted. If y. is the measure1

of the trait for the i th indiVidual in the pedigree, and x. is his1

age, the variable of interest is the measure of the trait adjusted

to age 30:" "2 2y! = .en y. - 131ex. -30) - 82ex. - 30 )

1 1 1 1

where y' is the log-transformed value adjusted to age 30,

131 is the linear age correction coefficient

82 is the quadratic age correction coefficient.

[4.6]

The quadratic coefficient 82 is used only if necessary, and necessity

is determined by whether, in a regression of .en y against x and x2,

the regression coefficient 132 is statistically significant. Table

2.5 indicates that 82 is needed only for systolic blood pressure and

diastolic blood pressure in males.

The parameters of the model and their interpretation are listed

in Table 4.2. To account for sex differences, there will be one set

of ~'s, 02, and 8' s for males and another set for females. In

this way, the model considers the heterogeneity between sexes, but at

the expense of adding five or six more parameters.

The likelihood of observing a particular set of pedigree data can

be expressed as a function of the parameters listed in Table 4.2. Denote

112

Table 4.2 Parameters of the 010del and Their Interpretation

1- T1: Tile probability that an individual of type AA will transmit A.

2. T2: The probability that an individual of type AB will transmit A.

3. T~: 'Gle probability that an individual of type BB will transmit A..)

4. 1j!1: The relative frequency of type M in "parental" population.

S. 1j!,,: The relative frequency of type AB in "parental" population..:..

6. 1j!3: The relative frequency of type BB in "parental" population.

7. ~l : The mean of the natural log of the trait for individuals of

type AA.

8. ).J2: The mean of the natural log of the trait for individuals of

type AB.

9. ~3: The mean of the natural log of the trait for individuals of

type BB.

10. 0

11. PI:

12. 62:

Tne standard deviation of the natural log of the trait.

The linear age correction coefficient.

The quadratic age correction coefficient (if necessary) .

113

this likelihood as

where ~ is a vector of the parameters. Suppose that there are k

parameters, and write 8 = (8 , 8) where r + s = k. We are- -r-s

interested in testing the hypothesis (r ~ 1) that

HO: ~l" = ~rO agains t HI: ~r ~ ~rO· [4.7]

The test will be done using the Likelihood Ratio (LR) method proposed

by Neyman and Pearson (1928). The method requires the maximum 1ikeli

hood{ML) estimators of (~r' ~s)' giving the uncondi tiona! maximum

of the likelihood function

A '"

L (P 18 , 8 ).-r -s

We will call this the maximum ~f the likelihood under the generoal

unroestT'iated model, i.e. no restrictions have been placed on the

parameters. The LR method also requires finding the ~ln.. estimators of

e when HO holds t giving the condi tional maximum of the LF-s

In most cases, ~s ~ ~s· The test of HO is based on the likelihood

ratio

A =

'"A

L (P I~rO' ~s)

L(PI~r' ~s)[4.8]

114

Wilks (1938) showed that asymptotically, when 1I0

holds, the distribu

tion of -2 to A tends to a x2-distribution with r degrees of

freedom. The term -2 to A, where A is given by equation 4.8, can'" '" '"be rewritten 2 (in L(ple , e ) - to L(ple 0' e))j this is twice the.... r ....s ....r ....s

difference in the two log likelihoods.

There is an alternative to the likelihood ratio test. Under

regularity conditions, the vector of ~~ estimators e is asymptoti-....r

cally rnultinonmally distributed with variance-covariance matrix V

(Kendall and Stuart 1973). Then

is asymptotically distributed as a x2 with r degrees of freedom,

where a consistent estimate of V can be obtained by numerical double

differentiation of the likelihood at its maximum (Kaplan and Elston

1972). It can be shown that the likelihood ratio test and this test

based on the ML estimators of the parameters are asymptotically

equivalent (Kendall and Stuart 1973). Since the distribution of Qr

may approach normality slowly, and since, if r is large, obtaining

a consistent estimate of V by numerical double differentiation can

be quite time-consuming on the computer, hypotheses will be tested

using the likelihood ratio test.

Subject to the constraints 0 s T., ~. s 1 (i = 1,2,3),l. l.

and all the variances being non-negative, the maximum like-l 1jJ. = 1,. l.l.lihood of the pedigree data, maximizing over all the unknown parameters,

can be obtained both under the general unrestricted model and WIder

HO' using the maximum likelihood subroutine package MAXLIK devised by

Kaplan and Elston (1972) which essentially conducts a search of the

115

likelihood surface. MAXLIK allows for various constraints and restric-

tions .

We are interested in testing the following hypothesis:

1. The presence of Harody-Weinberg equi Zibriwn proportions, HO: W2 =

21WlW3' Twice the difference between the likelihoods under the

general unrestricted model and under HO will be compared with a

x2 with 1 d.f.

2. The presence of

l3 = O. Thus,

1simple Mendelian inheritanae, HO: T1 = 1, l2 = 2'

-2 in A will be compared with a x2 with 3 d.f.

3. The presence of a purely environmental hypothesis~ HO: II = l2 =

l3 = T. This is equivalent to saying that the probability of

transmitting the A allele is not dependent on the genotypes of

the parents. Rejection of this hypothesis indicates that there is

transmission from one generation to the next. The resulting

statistic -2 in A will be compared with a x2 with 2 d.f.

4. The presence of a dominant (HO: ~l = ~2) or a reaessive

(HO: ~2 = ~3) hypothesis. In each case, -2 in A will be compared

with a x2 with 1 d.f.

Initial estimates are needed to use the subroutine package ~~IK.

It is not unreasonable to use as initial estimates the final estimates

obtained in fitting the mixture of log nonnal distributions to the

pedigree data (Chapter III). There is no failsafe guarantee that

MAXLIK \vill find the absolute maximum; the program may produce a local

maximum. As a check, it is prudent to enter MAXLIK with various

initial estimates, noting whether it converges to the same maximum

each time.

.' .,'.' '0" . ,.. "

116

There is no problem to extend the Elston and Stewart method to the

bivariate case. Consider a pedigree with n individuals and measures

of two quantitative traits ~ = (xl' x2) on each individual. Eacll

measure in ~ can be adjusted for age using equation 4.6. Analogous

to equation 4.1, the mathematical model for X., ( i =1 , ... ,n)-1

under a

major gene model,given genotype u, can be written as

[4.9]

wllere ID(u) (u = 1,2,3) is the major gene effect and £ is the random

environmental effect which is assumed distributed N(Q, L ), where L

is of the fonn:

L =POIOZ

The numbers 1 and Z refer to the two trait~ and P is the intertrait

correlation.

The major gene effect m is distributed

Genotypes

AA AB BB

Index .e. 1 Z 3

Effect met) ~l ~2 1!3

Frequency WI Wz W3 L Wt = 1.e

The genetic mecruwlism, as expressed by Pst u' is identical to

the univariate case. To express the pllenotype-genotype relationships

under the one major locus, two-allele model, the distribution of ~

given genotype u is taken to be N(Mu ' L)' The genotype distribution

BIOMATHEMATICS TRAINING PROGRAM 117

among persons "external" to the pedigree is again IjJ (v = 1,2,3).v

Although the number of parameters is increased, the method of testing

hypotheses using the likelihood ratio test is the same as for the

univariate case.

It is possible to estimate what portion of the total phenotypic

variance is attributable to the major gene. Assume that the common

variance represents the cOlllllon environmental variance oZ. Thee

variance for the major gene effect 02 is given by equation 4.3.m

Thus, the proportion attributable to the major gene is 02/(02 + 0eZ)m mand can be estimated using the ML estimates in place of the parameters.

Elston, Namboodiri, and Kaplan (1978) suggest that, based on

siJm.Jlation experiments (Go, Elston, and Kaplan 1977), genetic segrega

tion at a major locus can be reasonably inferred if the following

criteria are satisfied during pedigree analysis:

1. The hypothesis of Mendelian inheritance HO: Ll = 1, L2 =i, L3 = 0

cannot be rejected, and the estimated probabilities are close to

these values.

2. The environmental hypothesis, HO: Ll = TZ = T3 = T, is rejected.

3. The likelihood of the pedigree under the dominant hypothesis HO:

ul = Uz is very different from that under the recessive hypo

thesis HO: Uz = u3·

4. The data, ignoring any pedigree structure, fit a mixture of two log

normal distributions significantly better than a single log normal

distribution.

These criteria will be kept in mind in analyzing each of the

traits.

118

For each of the various models (e.g. Mendelian inheritance,

envirorunental hypothesis), the M... estimates of the unlalown parameter

vector .2 of L(P!.2) will be obtained for the Left Pedi1.~ree, the

Right Pedigree, and for Both Pedigrees combined. A likelihood ratio

test can be constructed to test HO: ~L = ~R vs. HI: .2L ~ QR'

where the subscript denotes whether the parameters are for the Left or

the Right Pedigree. The likelihood ratio method requires the uncondi

tional maximum of the likelihood function. Now, since the measures of the

trait in the Left Pedigree and in the Right Pedigree are essentially

independent (there are only four persons common to both pedigrees),

this maximum is approximated by the product of the unconditional

maximum for each pedigree, L(PLPRI.2L.2R) = L(PLI.2L) * L(PR1.2~ .

This is compared with the conditional maximum when HO holds,

L(PLPRI.2L = .2R). The likelihood ratio is

L(PLPRI.2L = .2R)

L(PLI.2L) L(PRI.2R)

The distribution of -2 tn A is asymptotically a x2 - distribution

with k degrees of freedom where, in this case, k is the number of

parameters in ~ •

4.4 Results of Univariate Analyses

4.4.1 Sperry Cholesterol

The maximum likelihood estimates are tabulated in Table 4.3, first

for the Right Pedigree (Table 4.3a) , then for the smaller Left Pedigree

(Table 4.3b) , and finally, for both pedigrees combined assuming that

they are independent (Table 4. 3c) .

119

Table 4.3. Maximum Likelihood Estimates From UnivariatePedigree Analysis of Sperry Cholesterol

a. Right PedigreeHardy-We inberg

Unrestricted Equilibrium ~~nde1ian EnvironmentalTl .631 .555 1.000 .110

T2 .350 .411 .500 .110

T- .0001# .0001# .000 .110.)

WI .051 .001 .000" .026

llJ2.0001# .053 .050 .0001#

llJ .. .949 .946 .950 .974.)

Males

*~l and ~2 5.842 5.837 5.827 5.840

*~ .. 5.216 5.216 5.216 5.219.;)

(J .194 .194 .196 .202

81 .005 .005 .005 .005

Females

*~1 and ~2 5.805 5.807 5.802 5.798

*~3 5.203 5.203 5.203 5.203

(J .163 .162 .163 .161

81 .007 .007 .007 .008

lnL 153.212 152.759 150.136 133.745

2 .906 6.152 38.934X

Signif. level > .10 >.10 <.001

1#Converged to a bound

'IeMeans are adjusted to age 30

120

b. Left PeciigreeHardy-Weinberg

Unrestricted EquilibriLDTl Mendelian Environmental

1 11. 000# 1.000# 1.000 .116

1 2 .423 .469 .500 .116

1 3.008 .005 .000 .116

WI .103 .003 .107 .037

W2 .000# .110 .000# .026

W3 .897 .887 .893 .937

~la1es

*lJ1 and lJ2 5.822 5.820 5.821 5.824

*lJ 35.164 5.164 5.164 5.164

(J .138 .138 .138 .138

61 .007 .007 .007 .007

Females

*lJ1 and lJ 2 5.696 5.698 5.701 5.611

*lJ- 5.121 5.121 5.122 5.099.;)

(J .158 .159 .158 .154

61.005 .005 .005 .007

tnL 52.860 52.218 52.699 49.717

2 1.284 .322 6.286X

Signif. level >.10 >.10 .043

#Converged to a bound

*Means are adjusted to age 30

121

c. Both Pedigrees

Hardy-WeinbergUnrestricted Equilibrium Mendelian Envi roruncnta1

.632 .SS4 1.000 - .10;--Tl

T2 .368 .428 .500 .107

T3.000# .000# .000 .107

1jJ1 .068 .001 .00011 .013

1jJ2 .000# .070 .068 .012

1jJ3 .932 .929 .932 .976

Males.-

lJ1 and lJ 5.830 5.827 5.821 5.841.- 2

lJ35.203 5.203 5.203 5.207

(J .184 .184 .185 .187

81.006 .006 .006 .005

Females.-

lJ1 and lJ2 5.779 5.782 5.777 5.779.-

lJ35.178 5.179 5.179 5.182

(J .168 .168 .169 .172

81 .006 .006 .006 .007

£.nL 199.905 199.249 196.693 177.543

2 1.312 6.424 44.724X

Signif. level >.10 .093 <.001

2~eterog X14 12.334

'Converged to a bound

"Means are adjusted to 30

11.456 12.284 11. 838

122

For each set of data, the first column contains the maximum like li-

hood estimates under the general unrestricted model obtained with the

constraints that the l' S and the lJJ' s could each vary between 0 and

1, and the sum of the ~'s is 1. The second column contains esti-

mates when the likelihood is maximized wi th the above constraints and

the restriction that the ~'s are at Hardy-Weinberg equilibrium. The

third column consists of the estimates when the maximization is done

1with 1 1, 1 2, and 1 3 fixed at 1, 2' and 0, respectively; this

represents the Mendelian hypothesis. Finally, the last column corres

ponds to the maximization being done with all the 1'S set equal

(the environmental hypothesis).

For all three data sets, the test for departure of the ~'s from

Hardy-l~einberg equilibrium is not significant. In the Right Pedigree

and for Both Pedigrees, the environmental hypothesis is emphatically

rejected. In the Left Pedigree, a test of HO: 1 1 = 1 2 = T3 results

in a chi square of 6.286. Ordinarily, for this case, -2 .en A is

distributed asymptotically as a chi square with 2 degrees of freedom.

However, in maximizing the likelihood for the general unrestricted

model, two of the parameters (Tl and ~2) may not be at local maxima

since they have converged to a bound. There is little theory concerning

the distribution of -2 in A in this situation. There have been sug

gestions that the asymptotic distribution is a x2 but with the number

of degrees of freedom being less than two. We will decide whether or

not to reject a hypothesis by relying on a conservative approach.

Whatever the actual cumulative distribution of -2 in A is when certain

parameters have converged to a boundary, it is botmded by the

cumulative distribution of a chi square with 2 degrees of freedom.

123

Using this to approximate the actual distribution of -2 fu A, the

actual significance level Sllould be lower than the nominal one; thus

this is a conservative test. For the test of HO: Tl = TZ : T3 with

a resulting chi square of 6.Z86, the nominal significance level is

0.04, and the actual one is lower. Consequently, this suggests that

there is transmission from one generation to the next.

In the Right Pedigree, the likelihood of the pedigree under the

dominant hypothesis is 13.4 times larger than that under the recessive

hypothesis. In the Left Pedigree, this value is 4.8, and for the two

pedigrees combined, the ratio of the likelihoods is 39.5. These

comparisons show that a dominant hypothesis is preferable for Sperry

cholesterol over a recessive one.

The Mendelian hypothesis cannot be rejected for any of these three

data sets. Again we have to rely on a conservative approach since two

parameters converge to bounds.

In Chapter III, it was shown that a mixture of two log nomal

distributions fits the Sperry cholesterol data significantly better

than one distribution.

Referring to the four criteria listed at the end of section 4.3,

each has been satisfied, and therefore we can conclude from the

analyses that there is a major gene segregating for hypercholesterolemia

in this pedigree. In examining the ~'s, we can see that over 90%

of those "external" to the pedigree have the homozygous recessive

genotype BB, about 7% have genotype AB, and less than 1% have the

homozygous dominant genotype AA. This latter small value is certainly

consistent with the idea that persons homozygous for the hyper

cholesterolemia gene may be subject to selection through premature

mortality.

124

The percent of the Sperry cholesterol variance accounted [or by the

major gene is 42% for males and 44~o for females. The non- significant

heterogeneity ali square values indicate that no heterogeneity was

detected between the estimates for the Left Pedigree and those for the

Right Pedigree.

4.4.2 Zak Cholesterol

The maximum likelihood estimates are tabulated in Table 4.4. The

results of the analyses with Zak cholesterol closely ~irror those with

Sperry cholesterol; t~is is not surprising as the two measures of

cholesterol are highly correlated (correlation of 0.94 in Table 2.7).

In the three data sets, the test for departure of the ~'s from

Hardy-Weinberg equilibrium is not significant. The environmental hypo

thesis is again rejected in the Right Pedigree and in Both Pedigrees,

indicating the existence of vertical transmission. In the Left Pedigree,

the chi square for the environmental hypothesis is 5.104. Since two of

the parameters in the unrestricted case have converged to a bound, the

nominal significance level is about 0.08. On this basis, we cannot

reject the environmental hypothesis. Elston, Namboodiri, and Kaplan

(1978) point out that it is possible for a major gene to be segregating

and find that the environmental hypothesis fits the data. This could

happen if all the parental mating types are the same, in which case the

children's distribution of genotypes would be the same. In this case,

it would be impossible to distinguish between the effect of a major

gene and an environmental effect. For the Left Pedigree in which 90%

of the population are of genotype BB and where the number of individuals

is small, it is not unlikely for all the parental mating types to be

the same.

125

Table 4.4 ~~irnum Likelihood Estimates From Univariate PedigreeAnalysis of Zak Cholesterol

a. Right PedigreeHardy-Weinberg

Unrestricted Equilibrium Mendelian Environmental

(I .635 .488 1.000 .105

(2 .277 .391 .500 .105

(3 .000# .000# .000 .105

4;1 .089 .002 .000# .000#

4;2 .000# .089 .071 .000#

4;- .911 .909 .929 1. 000#.:>

Males

'*\.11 and \.1 2 5.924 5.922 5.930 5.949

'*\.135.388 5.390 5.393 5.417

a .157 .157 .158 .184

81.005 .005 .005 .006

Females

'*\.11 and f.lZ 5.918 5.916 5.883 5.942

*\.1 3 5.379 5.379 5.377 5.383

a .139 .140 .145 .137

81.007 .007 .007 .007

fuL 136.150 135.100 132.934 122.169

2 2.100 6.432 27.962X

Signif. level >.10 .092 <.001

itConverged to a bound

#I~leans are adjusted to age 30

126

b. Left Pedigree

Hardy-HeinbergUnrestricted Equilibrium Mendelian Envi rorunent al

Tl1.000# 1.000 1.000 .096

T2 .374 .418 .500 .096

T3 .007 .004 .000 .096

WI .124 .006 .128 .051

1Ji 2.000# .139 .000# .048

W3 .876 .855 .872 .902

Males

*~1 and l.J 5.922 3.921 5.891 5.923

2I\:

~3 5.327 5.326 5.322 5.327

a .123 .123 .125 .123

61.007 .007 .008 .007

Females

*~1 and l.J2 5.772 5.771 5.772 5.757

*l.J 35.265 5.265 5.265 5.263

a .126 .126 .126 .128

61.006 .006 .006 .006

fuL 46.712 46.032 46.073 44.160

2 1.360 1. 278 5.104X

Signif. level >.10 >.10 .078



127

c. Both Pcdigrces

Hardy-WcinbergUnrestricted J~~Lui libritun ~~ndclian Environmcntal

T1 .659 .521 1.000 .105

T2 .297 .410 .500 .105

T3 .000* .000* .000 .105

1/Ji .103 .003 .000* .000#

1/JZ .000* .104 .096 .033

1/J3 .897 .893 .904 .967

l-1a1es

*1J1 and 1J2 5.928 5.926 5.928 5.940

*1J3 5.375 5.375 5.375 5.381

a .152 .151 .152 .155

61 .005 .005 .005 .005

Females

*1J1 and 1J2 5.854 5.857 5.831 5.914

*1J3 5.343 5.344 5.342 5.353

a .148 .149 .152 .149

61 .006 .006 .006 .006

inL 175.906 174.469 172.444 159.953

2 2.874 6.924 31.906X14Signif. level .090 .074 <.001

Heterog x2 13.912 13.326 13.126 12.75214

IIConverged to a bound


128

In the Right Pedigree as ,~cll as in Left Pedigree, the likelihood

of each pedigree under the dominant hypothesis is more than twice as

large as that under the recessive hypothesis. For Both Pedigrees

together, the ratio of the likelihoods is 11.0. This indicates a

preference for the dominant hypothesis over the recessive hypothesis.

A chi square of 1.278 for the Left Pedigree indicates an adequate

fit of hypercholesterolemia to the major gene hypothesis. The tests

of the Mendelian hypothesis result in a chi square of 6.432 for the

Right Pedigree and a chi square of 6.924 for Both Pedigrees. Ordinarily,

these are chi squares with three degrees of freedom. However, in each

case, two of the parameters under the unrestricted model (T 3 and 1Ji2

)

converged to a bound as did one parameter (lJil) under the Mendelian

model. The nominal significance levels are 0.09 for the Right Pedigree

and 0.07 for both pedigrees combined. On this basis, the genetic

hypothesis cannot be rejected. In Chapter III, it was shown that a

mixture of two log normal distributions fits the Zak cholesterol data

significantly better than one distribution (Table 3.4 g and h) .

As with Sperry cholesterol, the four criteria for inferring that

a major gene for Zak cllolesterol is segregating in this pedigree have

been fUlfilled. The percent of the variance for Zak cholesterol

accounted for by the major gene is 54% for males and 47% for females.

Again, there is no evidence to reject the hypothesis that the estimates

for the Right Pedigree and for the Left Pedigree are homogeneous.

4.4.3 Systolic Blood Pressure

The maximlUll likelihood estimates are tabulated in Table 4.5. In

the Left Pedigree, the likelihood of the pedigree under the recessive

hypothesis (HO: u2 = u3) is twice as large as that under the dominant

129

Table 4.5 ~~imum Likelihood Estimates From UnivariatePedigree Analysis of Systolic Blood Pressure

a. Right Pedigree

Unrestricted Mendelian Envirorunental

T1 .168 1.000 .470

T2 .619 .500 .470

T3.000# .000 .470

WI .225 .234 .194

W2 .775 .766 .697

W3 .000# .000# .109

Males

]JI\: 4.881 4.833 4.7651

*]J2 and ]J3 4.759 4.763 4.786

a .102 .107 .111

81 .006 .006 .006

FemalesI\:

]J1 4.944 4.918 4.936

I\: 4.749 4.738 4.746]J2 and ]J3

.067 .063 .066a

81.007 .007 .007

lIlL 257.948 255.523 255.446

2 4.850 5.004X

Signif. level >.10 .082

it a bound'Converged to


130

b. Left Pedigree

Unrestricted ~lende1ian Envirorunenta1

1 11.000# 1.000 .717

1 2 .659 .500 .717

1 3 .000# .000 .717

~1 .087 .108 .102

~2 .899 .892 .800

~3 .013 .000# .098

Males

*~1 4.988 4.992 4.966

*~2 and ~ .. 4.773 4.775 4.770

.)

a .076 .076 .080

61 .006 .006 .006

Females

*~l 4.913 4.936 4.907

'Ie

~2 and ~3 4.760 4.791 4.744

a .057 .068 .052

61 .008 .007 .009

.fuL 84.916 84.467 84.492

2 .898 .848X

Signif. level >.10 >.10


*Means are adj usted to age 30

c. Both Pedigrees

Unrestricted Mendelian Envirorunental

T11. 000# 1.000 .484

TZ .631 .500 .484

T3 .000" .000 .484

1jJ1 .154 .175 .178

IjJZ .834 .8Z5 .800

1jJ3 .01Z .000" .0Zl

Males1:

]..11 4.803 4.843 4.909

*]..12 and ]..13 4.787 4.773 4.761

0 .115 .111 .099

61 .006 .006 .006

Females1:

]..11 4.910 4.919 4.933

*]..IZ and ]..13 4.741 4.748 4.758

0 .063 .064 .070

61 .008 .008 .007

131

in L 337.875

2X -.-..,.

Signif. levelZHeterog X 9.97814

/#Converged to a bound

1:Means are adjusted to age 30

336.901

1.948

>.106.178

334.539

6.672

.03610.798

132

hypothesis. For the larger Right Pedigree, the ratio of the likelihoods

in favor of the recessive hypothesis is almost 17; for Both Pedigrees

combined, the ratio is 46.2. The evidence shows a clear preference

for the recessive mode of inheritance over the dominant mode. Further-

more, there is no evidence of any departures from Hardy-Weinberg

equilibrium.

For the Right Pedigree, the chi square for testing the environ

mental hypothesis, HO: Ll = L2 = L3' is 5.004. Since two of the

parameters, when the unrestricted likelihood is maximized, converged to

a bound, the nominal significance level is 0.082 with the actual

significance level being somewhat lower. Strictly speaking, the

environmental hypothesis cannot be rejected. Although the Mendelian

hypothesis cannot be rejected (chi square is 4.85), two points must be

noted. Under the unrestricted model, the parameter estimate Ll is

0.168 when theoretically it should be 1.00. Standard errors for the

estimates can be computed numerically by double differentiation of

the likelihood surface at its maximum using ~~XLIK (Kaplan and ElstonA

1972). The computed standard error for Ll is 0.262 which indicates

that Ll is significantly different from 1.00. Secondly, the

difference in the two means for the males is not statistically

significant, suggesting that one distribution will fit the data for

males.

For the Left Pedigree, neither the environmental hypothesis nor

the Mendelian hypothesis is rejected. Since the likelihoods of the

pedigree data under the unrestricted model, under the Mendelian model,

and under the environmental model are very similar, this suggests that

the likelihood surface is very flat; consequently, there is little

133

evidence for a major gene segregating for systolic hypertensio~in the

Left Pedigree.

For Both Pedigrees combined, the environmental hypothesis is

rejected (0.01 < P < 0.05) but the Mendelian hypothesis is not rejected.

There is bimodality in the data for females, but the means of the dis

tributions for males are not significantly different; this is certainly

consistent with the results obtained N'hen mixtures of distributions

were fitted ignoring the pedigree structure (Tables 3.4a and 3.4b).

This is also reflected in the fact that the percentage of the variation

for systolic blood pressure accounted for by a major gene is only 5.4%

for males but 50.6% for females. The test that the estimates of the

Left Pedigree and those of the Right Pedigree are homogeneous is not

rejected.

Although there does not appear to be a ~jor gene for systolic

hypertension segregating in the Left Pedigree the interpretations of

the results for the Right Pedigree cannot be unequivocal. Recalling

the four criteria for inferring genetic segregation at a major locus,

all of them, to some extent, have been satisfied by the Right Pedigree.

For the Right Pedigree, the genetic hypothesis cannot be rejected

although the estimate of 1'1 is significantly different from one.

The environmental hypothesis is not rejected, however the nominal

significance level is 0.082 but the actual one is smaller. There is

clear preference for the recessive hypothesis. A mixture of two

distributions fits the systolic blood pressure data better than a single

distribution for females but not for males. This evidence suggests

that perhaps there may be a major gene for systolic blood pressure

segregating in females in the Right Pedigree. The analysis done on

both pedigrees combined does not disagree with this conclusion.

134

4.4.4 Diastolic Blood Pressure

The maximum likelihood estimates of the parameters are tabulated

in Table 4.6. In both the Left Pedigree and the Right Pedigree, the

dominant and the recess i ve modes of inheri tance are about equally

likely. However, when the two component pedigrees are combined, the

likelihood under the dominant hypothesis is almost 350 times larger

than the likelihood under the recessive hypothesis. This is a

curious and probably a spurious result.

In the Right Pedigree as well as in the Left Pedigree, both the

major gene ID1d the environmental models fit the data; neither hypo

thesis can be rejected. Such a flat likelihood surface indicates

that there is little evidence that there is a major gene segregating

for diastolic blood pressure in this pedigree. Furthermore, recall

that, ignoring the pedigree structure, in neither the males nor the

females does a mixture of two distributions fit the diastolic blood

pressure data significantly better than one distribution (Tables 3.4c

and 3.4d) .

4.5 Results of Bivariate Analyses

The reason for looking at more than one trait at a time is that,

in some instances, doing so may lead to a better separation of groups.

This is not very likely in the case of Sperry cholesterol and Zak

cholesterol as the two traits are so highly correlated (correlation of

0.94). But, in the case of systolic and diastolic blood pressures

(correlation of 0.63), analyzing the traits jointly may produce a

clearer genetic ~1alysis than examining each trait separately. Further

more, blere may be a major gene controlling the correlated portion of

the two traits .

135

Table 4.6 Maximum Likelihood Estimates From UnivariatePedigree Analysis of Diastolic Blood Pressure

a. Right Pedigree

Unrestricted r.1ende1ian Environmental

L11.000# 1.000 .667

L2 .578 .500 .667

T~ .000# .000 .667.)

\til .580 .493 .318

\tI2 .009 .507 .050

\tI3 .411 .000# .632

Males

*].11 4.452 4.446 4.447

* 4.339].12 and ].I~ 4.317 4.302.)

(J .101 .098 .107

61 .015 .015 .016

62 -.0002 -.0001 -.0002

Females

*].11 4.352 4.356 4.356

*f.l2 and f.l~ 4.305 4.298 4.320.)

(J .10S .104 .106

61.007 .007 .007

fuL 243.473 242.323 241. 774

2 2.300 3.398X

Signif. level >.10 >.10



136

b. Left Pedigree

Unrestricted Mendelian Environmental

T1 .651 1.000 .368

TZ .434 .500 .368

T3.000# .000 .368

WI .072 .070 .067

4J2 .927 .829 .833

W3 .001 .100 .100

Males

*~1 4.745 4.593 4.734

*~2 and ~3 4.401 4.392 4.400

a .091 .097 .091

81.004 .004 .004

Females

*~1 4.634 4.634 4.634

*~2 and ~3 4.400 4.400 4.401

a .047 .047 .047

81 .003 .003 .003

fuL2

X

88.411 87.898

1.026

87.972

.878

Signif.1eve1



>.10 >.10

137

There is one addition to the list of parameters in Table 4.2:

p, the conunon correlation between the two traits. Of course, the

means (lJ' s) will now be vectors with two elements, one for each

trait.

4.5.1 Sperry Cholesterol and Zak Cholesterol

The maximum likelihood estimates of the bivariate pedigree

analysis are tabulated in Table 4.7. The dominant mode of inheritance

for both traits which was assl.D11ed in doing the univariate analysis is

retained for the bivariate analysis. For both traits, the means of

the component distributions and the conmm variances in Table 4.7

are almost identical to those obtained by fitting mixtures of bivariate

log normal distributons ignoring the peuigree structure. For the Left

Pedigree ~ld the Right Pedigree, and for both pedigrees combined, there

is no significant departure from Mendelian segregation. The frequency

distribution of the three genotypes obtained in bivariate analyses is

siITlilar to those obtained for both Sperry cholesterol and Zak

cholesterol in univariate analyses. An estimated 8~o of those "external"

to this pedigree have the gene for hypercholesterolemia. These

results only reinforce the conclusion that there is a major gene

segregating for high cholesterol regardless of whether measurement

is made by the Sperry method or by the Zak method.

4.5.2 Systolic and Diastolic Blood Pressures

The maximl.D11 likelihood estimates of the parameters are tabulated

in Table 4.8. When the pedigree relationships are ignored, a mixture

of two bivariate distributions fits the blood pressure data signifr

cantly better than one distribution for females only, and not for

males (Tables 3. 5a and 3. 5b). Even for the females, the estimated means

138

Table 4.7 ~mximum Likelihood Estimates of Bivariate PedigreeAnalysis of Sperry Cholesterol and Zak CholesterolLevels

MendelianSperry Choles Zak Choles

1.000

.500

.000

.000"

.058

.942

a. Right Pedigree

UnrestrictedSperry Choles zak Choles

1"1 .698

1"? .316..1 3

.000#

~1 .058

~2.000#

~3 .942

Males;Ie

].J.1 and ].J.2 5.332 5.939;Ie

].J.- 5.238 5.396.)

(J .183 .160

p .804

81 .005 .005

Females;Ie

].J.l and ].J.2 5.832 5.893;Ie

].J.- 5.196 5.377.)

(J .166 .145

p .843

61 .007 .007

fuL 352.6082

X3


;Ie

Means are adj us ted to age 30

5.832

5.239

.182

.005

5.829

5.197

.166

.007

.804

.845

350.095

5.026

P >.10

5.940

5.396

.159

.005

5.890

5.378

.145

.007

139

b. Left Pedigree

Unrestricted MendelianSperl)" tholes zal< Choles Sperry Cho1es Zak Cho1es

T1 1.000# 1. 000

T2 .374 .500

T3 .007 .000

~1 .125 .128

IV2 .000# .000#

IV- .875 .872.)

t-ta1es

'*J.l1 and J.l 2 5.730 5.892 5.735 5.893

'*J.l3 5.158 5.319 5.156 5.316

CJ .150 .124 .148 .122

p .910 .903

81 .010 .008 .010 .008

Females

"J.ll and J.l2 5.701 5.772 5.701 5.772

*lJ3 5.114 5.265 5.114 5.264

CJ .170 .126 .169 .126

p .730 .730

81 .005 .006 .005 .006

fuL 125.965 125.561

2 .808X_.:> p >.10

#converged to a bound

'*Means are adjusted to age 30

140

c. Both Pedigrees eUnrestricted Mendelian

Sperry Choles Zak Choles Sperry Choles Zak CholesII .678 1.000

l2 .319 .500

l3 .000# .000

1Ji1.082 .000#

1Ji2.000# .080

1Ji3.918 .920

Males'Ie

f..!1 and f..!2 5.823 5.936 5.824 5.935'Ie

f..! ... 5.221 5.377 5.221 5.377.)

a .179 .153 .178 .153

p .823 .824

81 .006 .005 .006 .005

Females

'"f..!1 and f..!2 5.793 5.861 5.786 5.855'Ie

f..!3 5.171 5.346 5.172 5.346

a .174 .150 .175 .151

p .828 .831

81 .006 .006 .006 .006

fuL 466.124 463.044

2 6.160x....) >.10P

2 24.898 25.224Heterog X24


'IeMeans are adjusted to age 30

141

Table 4.8 Maximum Likelihood Estimates of Bivariate PedigreeAnalysis of Systolic and Diastolic Blood PressureLevels

a. Right Pedigree

Unrestricted MendelianSystolic BP Diastolic BP Systolic BP Diastolic BP

Tl .430 1.000

T2 .489 .SOO

T- .248 .000~

WI .190 .163

W2 .810 .785

W3 .000 .052

Males

"~1 4.851 4.246 4.852 4.343

"~2 and ~3 4.769 4.381 4.763 4.365

(J .108 .111 .106 .120

p .735 .591

61 .006 .013 .006 .012

62 -.0001 -.0001

Females

*~1 4.899 4.314 4.920 4.344

"~2 and ~3 4.753 4.336 4.747 4.326

(J .084 .107 .080 .109

p .860 .839

61 .008 .007 .007 .008

tnL 537.102 530.847

2 12.510X3- ---

p = .006

"Means are adjusted to age 30

142

b. Left Pedigree

Unrestricted MendelianSystolic BP Diastolic BP Systolic BP Diastolic BP

1 11.000# 1.000

1 2 .459 .500

1 3.000 .000

tJi1.091 .085

tJi 2 .909 .915

tJi 3.000 .000

Males

'"loll 4.994 4.547 4.993 4.547

*lol2 and lol3 4.776 4.378 4.776 4.377

a .077 .094 .077 .093

p .433 .428

61 .006 .004 .006 .004

Females'it

loll 4.981 4.634 4.981 4.633:'I

lJ2 and lJ3 4.804 4.400 4.804 4.401

a .068 .047 .068 .047

p .407 .407

61 .006 .003 .006 .003

fuL 189.275 189.185

2 0.180X3P >.10


*Means are adj usted to age 30

143

for diastolic blood pressure in 'Table 3.5b are very similar, suggesting

that the bivariate fit is dominated by systolic blood pressure.

For the Right Petiigree (Table 4. Sa), the Mendelian hypothes is is

rejected as the chi square statistic is 12.510. For the males, there

is really only one bivariate distribution for blood pressure; for

females, the means for diastolic blood pressure are similar. This

reflects the results from bivariate curve fitting analysis.

For the Left Pedigree, the major gene hypothesis is not rejected,

but neither is the environmental hypothesis. With a likelihood

surface being so flat, there is no evidence for the existence of a

major gene for systolic anti diastolic blood pressure jointly.

4.4 Conclusions

The univariate pedigree analysis indicates that there is a major

gene segregating in tIris large pedigree for hypercholesterolemia.

This is confinued by the bivariate analysis. The univariate analysis

uncovers little evidence for a major gene segregating for diastolic

blood pressure, but there is a suggestion that perhaps there may be

a major gene for systolic blood pressure segregating among females

in the Right Pedigree.

It should be noted that when Elston and Stewart (1971) derived

the likelihood for doing pedigree analysis, random mating was assumed.

As Table 2.9 showed, there is evidence of assortative mating for these

traits as indicated by the inter-spouse correlations: 0.44 for systolic

blood pressure, 0.41 for Zak Cholesterol, and 0.22 for diastolic blood

pressure and for Sperry cholesterol levels; the first two inter-spouse

correlations are statistically significant.

The question is what is the effect of assortative mating on the

144

analysis. ~~cLean, Morton, and Lffi~ (1975) performed a series of

simulation experiments on nuclear family data to study the power of

segregation analysis of quantitative traits and the robustness when

various assumptions of the model are violated.

To test tIle effect of assortative mating, they generated data

with total genetic correlation between mates. The samples were

generated with no major locus, and their results show that, even in

the presence of an inter-spouse correlation of one, the likelihood

ratio for testing for a major gene never approached significance.

However, it should be noted that they were considering the likelihood

of the phenotypes of a sibship ~onditional on the parental phenotypes

so whether or not assortative mating can simulate a major locus in the

unconditional situation is not yet resolved.

Li (1975) points out that, while the observed correlation between

husband and wife with respect to a trait is the phenotypic correlation,

what is of interest is the genetic correlation between spouses. Under

the assumption that gene effects are additive both intra-locus mld

inter-locus (i.e. no donunance nor epistasis) and that environmental

and genetic effects are uncorrelated, Li states that the relationship

between the phenotypic correlation r and the genetic correlationpp

m can be expressed by

m = h2 r pp

where h2 is the heritability.

Therefore, although the inter-spouse correlations (phenotypic

correlations) for systolic blood pressure is 0.44 and for Zak cholesterol

is 0.41, the genetic correlations, based on estimates of heritability

145

from Chapter VI,are 0.2 and 0.28, respectively. Although the genetic

correlations are smaller than phenotypic correlations, their effect

on pedigree analysis must await fur:her simulation experiments.

Finally, as Table 2.6 shows, for systolic blood pressure and

Sperry and Zak cholesterol, even after taking the log-transfonmation

of the measures of these traits ~ld age-adjusting, the resulting

distributions still are significantly skewed. Is it possible that, in

the presence of skewness, there is a serious danger of detecting

spurious maj or loci? Go, Elston, and Kaplan (1978) use r·1onte Carlo

methods to simulate data with skewness in order to test the robustness

of pedigree segregation analysis. They conclude that "skewness per se

will not lead to the detection of a spurious locus." Their studies

show that only in the presence of polygenic inheritance as well as

sibling environmental correlation is there a possibility that skewness

in the data may falsely detect a major locus.

QIAPTER V

TIlE POLYGENI C I1YFarllESI S

In this chapter, the polygenic model will be considered. Under

this model, we assume that a genotypic value is made up of an "infinite"

number of equal and additive gene effects. Conceptually, we can think

of the phenotype as being determined by the genes at an infini te

number of unlinked loci. At each locus, assume that there are only two

alleles, one which has no effect and the other which increases the

measure of the quantitative trait under study. Further, assume that,

at each locus,the magnitude of the gene effect is equal, and that the

total gene effect at all the loci is simply the sum of the gene effects

at each lOCUS, i.e. there is no interlocus interaction.

Elston and Stewart (1972) have derived the likelihood of a set of

pedigree data under this polygenic model. Their procedure will be

briefly described in the next section, and will be used to determine

the significance of additive genetic effects on serum cholesterol

levels, systolic and diastolic blood pressures under this polygenic

model. In addition, several other variables whose measurements were

obtained on this pedigree will be similarly studied.

5.1 The Polygenic f'.lodel

Consider a pedigree with n individuals and a measure of a

quantitative trait x. ,1

(i = l, ... ,n) on each individual. The

mathematical model for x. under polygenic inheritance can be written1

147

asx.. = ~ + g. + e.

1 1 1[5.1]

where ~ is the overall mean, g.1

is the effect due to a large number

of additive genetic factors, ~ld e. is an environmental effect; the1

two random effects are assumed to act independently. The polygenic

the polygenic effect can be partitioned into two

the additive genetic variance.

ise

Since the

(J2 is calledg

is distributed

are to be estimated.

and the environmental effect

Thus the phenotype x.1

to additive genetic factors,

..,o~),

and 02e

N(O,

20g

is dueg

2 2Ocr + 0e)'

eo

For offspring,

polygenic effect

effect is distributed

distributed N(O, 0;);

independent components,

g. = b + y.1 1

[5.2]

where b is the midparental effect and Yi is the individual deviation

from the midparental effect. Both b and y. are normally distri1

buted with mean zero, and under random mating, b and y. each1

contribute half the variance of g.:1

Thus, if the parental genotypic effects are gM and gF' under

panmixia, the p.d.f. of the genotypic effect within each sibship is

1 2N(2 (gr-.tgF)' °g/2) .

The notation ¢(u, ( 2) will be used to denote the ordinate at u

of the distribution N(O, ( 2):

1

¢(u, ( 2) = (2n0 2) -7 exp [_u2/(202)].

148

The ordinate at u of the distribution N(jJ, 02) is thus denoted by

1? 2 -...,. 2 2

¢(u-jJ, 0-) = (2no) W exp[-(u-jJ) /(20 )]

~ote that ¢(u-lJ, 02) = ¢(jJ-u, 0

2). In light of this, the ordinate at?

u of the distribution ~~(J.I, OW)

2of the distribution ~(u-jJ, 0 ),

is equivalent to the ordinate at zero

and both can be written ¢(u-jJ, 02).

In Chapter IV, it was stated that the likelihood of the set of

pedigree data under the major gene model could be expressed as a

function of three quantities: The genetic mechanism (expressed as a

genetic transition matrix), the phenotype-genotype relationship in the

form of ~(x), the conditional p.d.f. of observing phenotype x

given the uth genotype, and the p.d.f. of the genotypes among those

"external" to the pedigree, i.e. the original parents and individuals

marrying into the pedigree. For the likelihood under the polygenic

model, there are three corresponding quantities.

1. If the parental genotypic effects are gH and gp and the

genotypic effect for their offspring is

the latter being the p.d.f. of the genotypic

the probabilities Pstu' the p.d.f.

02;2) which is the ordinate at g.g 1

gM+gF 20l (2 ,;;gl 2) ,

of the distribution

2.

effect among the offspring given g~1 and gp'

The phenotype-genotypic value relationship is expressed by f(x. Ig·),1 1

the conditional p.d.f. of Xi given genotypic effect gi' The

distribution of x. conditional on g. is taken to be N(jJ + g.,1 1 1

and thus for an individual with measure x. ,1

f(x·lg·) =1 1

149

3. For individuals "external" to the pedigree in that they have no

parents in the pedigree,

is NCO, O~), and thus

the p.d.f. of their genotypic effect h

Zfeh) = ¢(h, 0g)'


genotypic values, the genotypic values and the phenotypes of the off

spring are independent of one another. Thus, the likelihood L of

observing a sibship of size n with measures

n= IT fCxi!gr-t' gF)

i=l

X., xz' ... , x given1 n

=(

IT J f Cx. Ig .) f (g. IgAP gp). 1 1 1 1'.1 g.

1

where f means that everything following it is to be integrated withgi

respect to cr. from -00 to + 00.°1

With the assumption of random mating, i.e. an individual's pheno-

type is independent of the phenotype and genotypic value of his

spouse, it is not difficult to consider spouses in the model. Given

his parents' genotypic effects are g~1 and gp' the likelihood of

observu1g an individual with measure x and his spouse with measure

Y is

= fex Ig~l' gp) fCy)

150

Eence, the likelihood of a sibship and their spouses, given parental

genotypic effects g~l and gF' is

L = ~ f f (x. Ig.) f (g. I g~l gF) f f (y. Ih.) f (h.) .. 1 1 1 1. h. 1 1 11= gi 1

For persons with no spouse, the second integral is set equal to one.

Under the random mating assumption and the implicit assumption

that there is no separable common sibling environmental effect nor

common parent-offspring environmental effect, Elston and Stewart (1972)

derived the likelihood of observing a set of pedigree data under the

polygenic model as a series of products and integrals and as a function

of the three quantities f(g.lgM, gF)' f(x·lg·),1 • 1 1

and f(h.),1

as

defined above. The integrations are performed over all possible

genotypic values, and hence the limits of integration are from -00 to

+ 00. There is a Fortrml program (Green 1972) which constructs the

likelihood for simple pedigrees, i.e. pedigrees that originate from

a single pair of individuals.

2 2There are three parameters to the model: 0g' 0e' and ~. Subject

to the constraints that the variances are non-negative, the maximum

likelihood of the pedigree data can be obtained, maximizing over all

the unknown parameters.

1\e are also interested in estimating the polygenic heritability,

defined as the proportion of the total variance accounted for by the

222additive genetic variance, 0g/Cog + 0e); this estimate is obtained

by replacing the parameters in this expression by their maximum likeli

hood estimates. The likelihood ratio test will be used to test HO:

against H :a2o > O.g

151

As in Chapter IV, the logarithmic-transformed values of the traits

(properly age- and sex-adjusted) are used.

formed value of a trait for the

Let y. be the log-trans1

i th person in the pedigree and let

First, the values on both males and females

using sex-dependent regression coefficients. If

are

y. - bl

(w. - 30)1 m 1

adjusted value is

value for a female is y. 1

(blm

, blf) and (b 2m , b2f)

individual is a male, the

- 302); the corresponding

30) - b7f(w~ - 302) where_ 1

2-b 2m (Wi

blf(wi -

w. be his or her age.1

are adjusted to age 30

the i th

the least squares estimates of the regression coefficients for age and

age2, respectively. Then the values on females are further adjusted

to have the same mean and variance as males at age 30, using the

formula:

where Yf is the final adjusted value for females,

Yf,30 and Ym,30 are the female and male means,

respectively, at age 30,

Sf and sm are the standard deviations for females

and males, respectively, and

Yf,30 is the individual female value adjusted to

age 30.

5.1.1 Sperry and Zak Cholesterol

The ~~ estimates of the parameters for the polygenic model are

tabulated in Table 5.1. For Sperry cholesterol levels, the hypothesis

02 = a is rejected (P < 0.01). The estimate of polygenicg

152

Table 5..! Maximum Likelihood Estimates of the Parametersfor the Polygenic ~1odel for Sperry and ZakCholesterol by Pedigree

PedigreeTrait Left Both Right Ileterog 2

X-.)

Sperry Cholesterol

02 .074* .071 .068g2 .009* .028 .032°e

J..I 5.252* 5.290 5.304

Heritability (%) 89.1 72.1 67.8 3.44

.fuL 45.121 170.671 126.923

2 to test HO: 2 0 30.64 20.90Xl ° =g

Zak Cholesterol

2 .056 .050 .045°g

02 .020 .035 .040e

J..I 5.422 5.478 5.497

Heritability (%) 74.0 58.8 53.4

.fuL 37.257 135.832 100.156 3.16

2 HO

: 02 = 0 4.60 13.48 7.56Xl to test g

Signif. level .032 <.001 .006

*Obtained tmder constraint 2 + 2 20g °e =sT

.e153

heritability is 67.8% for the Right Pedigree, 89.1~ for the Left

Pedigree, and 72.1% for Both Pedigrees combined.

NWllerical problems were encowltered in obtaining the ML estimates

of the parameters in the Left Pedigree. The estimate for the environ-

mental variance 2oe converged to zero, which largely contributed to

obtaining overflmv-errors on the computer. In order to circumvent

these difficulties, the maximization was redone with the constraint

respectively, and the estimate for the

Under the assumption that the measures in the

that the estimates of the additive genetic variance and the

mental variance should sum to the sample total variance si,

2 2 U d h' . h .0e = sT' n er t IS constraInt, t e estlmates

2 25.252 for 0 , 0, and ~,g e

heritability is 89.1%.

are

environ-

. 2I.e. 0 +g

0.74, 0.009, and

Left Pedigree and in the Right Pedigree are independent, using the

method as outlined in Chapter IV, the heterogeneity x2 was computed

to be 3.44; thus, there is no indication that the estimates of the

parameters for the two pedigrees are not homogeneous.

The results for Zak cholesterol are almost identical to those

for Sperry cholesterol except that the estimate of heritability for

Sperry cholesterol is about 14% higher than the one for Zak cholesterol.

The hypothesis of no additive genetic variance is rejected (5% level

for the Left Pedigree and 1% level for the Right Pedigree and for Both

Pedigree combined). Again, there is no indication that the estimates

for the separate pedigrees are heterogenous.

5.1.2 Systolic and Diastolic Blood Pressure

The ML estimates of the parameters are shown in Table 5.2. For

systolic blood pressure, it is estimated that additive genetic effects

account for only 17-21% of the phenotypic variation, while for diastolic

2Heterog x

::J

Table 5.2

Trait

154

Maximum Likelihood Estimates of the Parametersfor the Polygenic ~b<1el for Systolic andDiastolic Blood Pressure by Pedigree

P d'. e 19reeLeft Both Right

Systolic BP2

0g

0;1..l

Heritability (%)

xi to test HO:

.002 .003 .003

.011 .010 .010

4.852 4.822 4.811

16.8 21.2 20.8

78.061 324.992 251.528 9.19(p=.027)

0.26 2.84 2.14

Diastolic BP2 .000# .004 .0030g...,

0" .012 .012 .012e

1..l 4.451 4.421 4.386

Heritability (%) 0.0 27.1 21.8

lIlL 80.548 308.311 234.941 14.36(p=. 002)

2 . 2 0 0.00 3.62 1.96Xl to test HO' o =g

DEstimate converged to a boundary value, zero.

155

blood pressure, the estimate is 22-27%. For both systolic and diastolic

blood pressures, the hypothesis HO: a~ = 0 cannot be rejected. This

suggests that, for these two traits, any familial correlation is not

significantly due to additive gene action. Furthermore, the hetero

geneity x2 of 14.36 for diastolic blood pressure indicates that the

estimates for the separate pedigree are not homogeneous; in particular,

the estimated mean is higher in the Left Pedigree.

5.1.3 Other Traits

Since the polygenic model can be expressed as a function of only

three parameters and since the Fortran program to obtain maximum like

lihood estimates of these three parameters is fairly efficient in terms

of computer time needed, it is interesting to investigate the herit-

ability of the other traits for '''hich data are available. The ML

estimates of the parameters are tabulated in Table 5.3.

Polygenic heritability is nearly non-existent in this pedigree for

height and for weight since the estimate for additive genetic variance

has either converged to zero or is very nearly zero. There is no

heterogeneity in the estimates between the component pedigrees for

either height or weight.

These results for height and weight are unexpected because other

studies (a recent example is Rao, Maclean, Morton and Yee 1975) have

found a significant additive gene effect for height and weight. In

fact, Rao et al (1975), in their segregation analysis of nuclear

families, fitted a mixed.model (segregation of a major gene together

with a polygenic background) to height and to cube root of weight and

concluded that there is no significant major locus for height or for

weight and that the polygenic heritability is significant. Their

156

Table 5.3 MaxirllUJil Likelihood Estimates of the Parametersfor the Polygenic Model for Other Traits byPedigree

Pedi£rees,

H .. 2Trait Left Beth Right e ...erog X7....

Height2 0.000# 0.0004 0.000#0a

'"2 0.006 0.0068 0.007ae

\.I 4.244 4.229 4.223


.en L 95.493 428.308 332.925 0.22

2 ..,.. 0 0.00 0.52 0.00Xl to test HO: a =g

Weight..,

er- 0.000# 0.0001 0.001g

02 0.047 0.048 0.048e

J.l 5.109 5.116 5.121


.fuL 50.362 239.667 189.683 0.76

2 2 = 0 0.00 0.00 0.12Xl to test HO: ag

Uric Acid2 .058 .094 .0940g

a~ .026 .029 .032

J.l 1.416 1.295 1.245

Heritability (%) 69.6 76.2 74.4

tnL 40.414 142.247 106.755 9.84..,

02 =(P = O. 02)

xi to test HO: a 9.52 37.14 29.72g

Signif. level .002 <.001 <.001

IEsttffiate converged to a boundary value, zero.

157

PedigreeTrait Left Both Right 2Heterog X3

Alpha-Lipoprotein,cr .. .020 .008 .006g

cr2 .001 .012 .013e

j.l 4.295 4.277 4.266

Heritability (%) 93.5 42.0 31. 7

.en L 64.392 236.569 176.133 7.91(P '"' .048)

2 2 0 4.82 3.46 1. 54Xl to test HO: crg =

Signif. level .028 .063 >.10

Beta-Lipoprotein2 .136 .138 .137ag2 .039 .071 .081cre

j.l 4.800 4.886 4.913

Heritability (%) 77 .6 66.0 62.6

tnL 21.822 66.399 45.795 2.44(P > .10)

2 .,Xl to test HO: cr" '"' 0 5.05 14.75 8.95

g

Signif. 1eve1 .025 <.001 .001

Cholesterol Esters2 .100 .073 .073crg

cr2 .003 .028 .031ej.l 4.940 4.983 4.994

Heritability (%) 97.6 72.5 70.1

tnL 45.255 167.972 123.926 2.42(P > .10)

2 . 2 0 8.40 29.90 22.50Xl to test HO' crg '"'

Signif. level .004 <.001 <.001

158

PedigreeTrait Left Both Right 2Heterog X...

.)

Prebeta-Lipoprotein2 .062 .123 .149crg

cr2 .084 .174 .197e

J.1 2.993 3.048 3.067

Heritabili ty (%) 42.3 41.4 43.0

.tIlL 21.444 29.176 12.791 10.12(P '" . 018)

2 2 0 1.92 4.72 2.90Xl to test HO: crg =

Signif. level >.10 .030 .089

Phospholipid2 .005 .005 .0060:cr0

cr2 .014 .014 .013e

J.1 5.701 5.720 5.724

Heritability (%) 26.8 28.1 31.2

.tIlL 72.930 305.222 234.172 3.76(P • .053)

2 2 0 1.02 3.90 3.96Xl to test HO: cr =g

Signif. level >.10 .047 .045

159

estimates of the heritabilities are 0.396 + 0.026 for height and 0.363

~ 0.027 for cube root of weight.

However, it must be emphasized that the results obtained in the

present investigation are specific to this pedigree; they should not be

generalized to apply to other pedigrees or other families. In this2pedigree, as shown in Table 2.5, age and age account for about 75%

and 85% of the total variation for height and weight, respectively.

Hence, once age is accolDlted for, the residual variation for height

and for weight is very small, and in light of this small residual

variance, it may not be surprising that efforts to further partition

the variance have been frustrated.

For uric acid, the estimated heritability is 70-76%. The x2

value of 9.84 indicates that there is significant heterogeneity (0.01 <

P < 0.05) in the estimates between the Left Pedigree and the Right

Pedigree, and a comparison shows that the estimate of the overall mean2 2is larger for the Left Pedigree while the estimates of 0g and 0e

are larger for the Right Pedigree.

The analysis of alpha-lipoprotein in the Right Pedigree shows that

the additive genetic variance is not significantly different from zero.

The chi square of 7.91 indicates significant heterogeneity (5% level)

in the estimates between the two sides of the pedigree. In fact, o~

is larger and 0; is smaller in the Left Pedigree. However, in the

Left Pedigree, the estimation of the variance components is based on

40 persons with available values while there are almost three times

that many in the Right Pedigree. Thus, the preponderant weight of the

evidence favors the conclusion that there is no significant additive

160

genetic variance in the inheritance of alpha-lipoprotein.

There is a significant additive genetic variance in the inheritance

of beta-lipoproteins, with the estimated heritability of about 66%.

These results for beta-lipoproteins mirror those obtained for Sperry

and Zak cholesterol, as the correlation between beta-lipoproteins and

either of the two measurements of cholesterol is more than 0.90

(Table 2.7).

Table 2.7 also shows that cholesterol ester levels and Sperry

cholesterol are almost completely correlated; there is a correlation

between cholesterol ester and either Zak cholesterol or beta-lipoprotein

of about 0.90. The estimated polygenic heritability for cholesterol

ester is 97.6% in the smaller Left Pedigree and 70.1% in the Right

Pedigree. Although there is heterogeneity in the heritability estimates,

there is no doubt that the additive genetic variance for cholesterol

ester is significantly larger than zero.

For prebeta-1ipoprotein, the chi square of 10.12 indicates some

heterogeneity between the estimates of the Left Pedigree and the Right

Pedigree. Indeed, the variance component estimates for the Right Pedigree

are more than twice as large_as those for the Left Pedigree. However,

there is little heterogeneity in the proportion of the total variance

accounted for by additive gene action - more than 40%, but the test of

2HO: 0g = a does not attain statistical significance until both

pedigrees are combined.

There is no evidence of heterogeneity in the ~~ estimates for the

phospholipid data. The estimated heritability is about 30%. The

hypothesis that the additive genetic variance is zero is rejected in

the Right Pedigree (5% level) but not in the Left Pedigree due possibly

161

to the small number of individuals with phospholipid data. Tllercfore,

the weight of the evidence suggests that there is an additive genetic

component in the inheritance of phospholipid.

5.2 Conclusions

The analyses in this chapter indicate that, under this polygenic

model, additive genetic effects are significant in this pedigree for

Sperry cholesterol, Zak cholesterol, uric acid, beta-lipoprotein, and

cholesterol ester. Since the model does not specifically allow for

environmental correlations between relatives, this must be interpreted

with caution. In this analysis, additive genetic effects are largely

confounded with environmental correlations. Under these same reserva

tions, for prebeta-lipoprotein and phospholipid levels, the analyses

suggest that there is a significant additive genetic component, but the

results are not as clear-cut as for the variables enumerated above. For

height, weight, alpha-lipoprotein, systolic blood pressure, and diastolic

blood pressure, the estimated additive genetic variance is either very

small or has converged to the bound of zero. These results are

specific to this pedigree.

The analyses in Chapter IV indicated that, while there is little

evidence for a major gene for diastolic blood pressure, there is a

suggestion that perhaps a major gene is segregating for systolic blood

pressure, at least in the Right Pedigree. Also, the evidence of

Chapter IV is rather persuasive that there is a major gene segregating

in this pedigree for hypercholesterolemia. In this chapter, it was

shown that there is a significant additive genetic variance for

cholesterol, and for systolic blood pressure, the estimated herit

ability is about 20%. The question is: Can the genetic variance be

162

apportioned into two components, the major gene component and the poly

genic component?

In the next chapter, we will try to answer this question by

attempting to analyze the cholesterol data as well as the blood pressure

data using a Inixed model. In the preSCJ1Ce of a polygenic background,

are the data consistent \~ith the presence of a segregating major gene?

If there is a major gene segregating, is there a significant additive

genetic variance in the residual variance?

Q-iAPTER VI

THE MIXED H)DEL

The mixed model will be considered ill this chapter. This is

a model that allows for segregation of a major gene together with a

polygenic and environmental background. In a sense, this model combines

features of the major gene model from Chapter IV with features of the

polygenic model from Chapter V.

Consider an autosomal locus with two alleles, A and B, and for

convenience, let the genotypes be numbered M = 1, AB = 2, and

BB = 3. Assume that each of these genotypes makes a specific contri

bution to the measure of the trait x. of the i th individual in the1

pedigree. Labeling this major genotypic effect m, the mathematical

model for the phenotype Xi under the mixed model can be written as

[6.1]

\vhere g. is the polygenic effect and e. the environmental effect1 1

considered in Chapter V. In this model, m has a discrete distri-

bution (see section 4.2), and g. and e. have continuous distributions.1 1

The three variables are assumed to be mutually independent. Each m

takes on the value of one of the three means ~l' ~2' or ~3'

corresponding to genotypes M, An, and BB, respectively, around \1hich

there is random polygenic and environmental variation. The polygenic

effect g is distributed N(O, a~), and e is distributed NCO, a~).

x. ,1

conditional on a particular ffi,

164

is distributed ~(m, o~ +

For offspring, g. can be partitioned into two independent1

components, as in Chapter V,

As before, b and y.1

g. = b + y ..1 1

are each distributed NCO, 02/2)g

[6.2]

Therefore,

as individual's genotypic value consists of two parts: a major geno-

typic effect mu(u = 1,2,3) and a polygenic effect gi'


genotypic values, the p.d.f. of the genotypic effect and the phenotypes

of the offspring are independent. Let ms and mt (s,t = 1,2,3)

refer to the parental major genotypic effects. Thus, the likelihood

of observing a sibship of size n with measures Xl" ",xn given

ms ' mt , and the mid-parental effect b is

= IT f (x. Ib, m , mt

) .i 1 S

Let mu (u=1,2,3) refer to the major genotypic effect of a child. Now,

the probability that the ith child of a sibship has measure x.1

3given ms ' illt , and b is 2 1 f(x. 1m , b) fCm 1m , mt ). The secondu= 1 U U 5

qumltity of this eA~ression ca~ be rewritten as the more familiar

P t of C~apter IV.s u Thus,

LCxlm , mt , b)- s

ihe polygenic effect of the ith offspring is g ..1

165

L(xlm , mt , b) = 11 I p t r f(x·lm, g.) f(e·lb) [6.3]- s ius u )g. 1 U 1 1

1

where Jgi is used to mean that everything following it is to be inte

grated with respect to gi from - ~ to +~. lVith minor modification,

these p.d.f. 's are similar to those in Chapter V.

The first p.d.f.

value relationship.

f(x.lm , g.) expresses the phenotype-genotypic1 u 1

The distribution of xi conditional on the

pOlygenic -effect &i and the major genotypic effect ffiu2be N em + g., a) and thus f (x. Ig., m ) = ¢(x. - m -u 1 ell u· 1 u

distribution of g. given b is ~I(b, ii/2), and thus1 g

is taken to

2gi' ae)· The

f(gi 1b) =

2Hg. - b, a /2) .1 g

Under the assumption of random mating, i.e. an individual's phcno-

type is independent of the phenotype and genotypic value of his

spouse, the likelihood of observing a spouse with measure y( major

genotypic effect mv and polygenic effect h) is

L(y) = 13 ~v f f(Ylh, mv) f(h) [6.4]v=1 h

where f(Ylh, mv) = ¢(y - mv - h, a;), f(h) = ¢(h, a~), and ~v

is the proportion with major genotypic value mv (v = 1,2,3). For

individuals in ~he pedigree with no spouse, the likelihood in equation

6.4 is set equal to one.

Furthermore,implicit1y assuming no separable cornmon sibling

environmental effect nor common parent-offspring environmental effect,

Elston and Stewart (1972) indicated how the likelihood of observing a

set of pedigree data could be expressed under the mixed model as a

function of Pst u' f(xlg, ffiu)' f(glb), ~v' f(Ylh, mv)' and f(h)

166

where s, t, u, x, g, and b refer to persons in the pedigree, and

v, y, and h refer to persons "external" to the pedigree. The

resulting likelihood involves prouucts, summations, and integrations

and contains five parameters: 3 means, \.ll' \.lZ' and \.l3' and Z variance

1"\"2 and 1"\"2components , v vg e

6.1 ~~thod of Analysis

Consider a pedigree with n individuals numbered from 1 to n.

Let xi:: phenotype of i th individual, and let ;S = (xl', ... ,xn) be

the vector of phenotypes. t!ith 2 alleles at an autosomal locus, there

are three possible major genotypes for each individual. Let m be an

1 h ·th I . h· f h . thn x vector w ose 1 e ement IS t e maJor genoty'pe 0 tel

individual. The vector m will be called the genotypic configuration

for the pedigree. Since each person can be assigned any of 3 genotypes,

there are 3n genotypic configurations possible. The likelihood of

observing the pedigree, ,under the mixed model, can be written

L :: L f(~ Im) P (m)m

[6.5]

where the (multiple) summation is over all genotypic configurations.

~lany of the 3n genotypic configurations will not be compatible \vith

the pedigree structure, so for them, P(m) :: O. Even so, with any

moderate size pedigree, the number of terms soon exceeds the capacity

of modern day computers. Therefore, until the likelihood can be

computed, perhaps using numerical integration, methods to approximate

the likelihood ''lill be sougllt.

Ott (1978) suggests t~~ing a random sample of the genotypic con-

figurations. Suppose ng vectors were sampled. Since these vectors

were sampled at random, they are independent, and each has the same

probabili ty of occurring

is approximated by

167

ling' Thus, the likelihood in equation 6.5\' k k thL f ~ 1m) where m is the k sampledk ,..,

configuration, and the summation is over all the sampled vectors. l1ith

this random sampling scheme, some of the mk ~ay have ?(mk) close tokzero, and for others, P(E) may be relatively large.

An apparent improvement to random sampling may be to stratify the

3n vectors of m into those m for which P(m) is relatively large

and those for which P(m) is relatively small. Although this

s tratifieci sampling may be <l reasonable al ternative to random sampling,

there is no simple algorithm yet known to select, say the l most

probable genotypic configurations. Therefore, as a first look at the

mixeci model, we have decided to approximate the likelihood of observing

a set of pedigree data by computing the likelihood conditional on the

"most probable genotypic configuration".

6.1.1 Genotypic Classification of Individuals

Under a specific genetic hypothesis HO' for example Mendelian

inheritance, the likelihood of observing a set of pedigree data can be

computed and is denoted by L(PIHO' 2), where e,.., are the other para-

meters of the model. Specifically for the w~jor gene model, the

likelihood can be written L (P IT1=1, T 2= i, T 3""0, Q) where Q includes

the means, variances, regression coefficients, etc. Suppose the .th1

member in the pedigree has phenotype x ..1

Then the likelihood can be

expressed as the sum of 3 conditional likelihoods, each being the

likelihood of the pedigree where the i th individual has genotype

5 = I, 2,3 (.-\A, AB, BB).

[6.6]

168

The posterior probability that the i th individual should have genotype

t, given what is known about his relatives, can thus be estir;Klted as

"q (t IP, ~) = [6.7]

In this way,individual.

for the.th1

is the ~ estimates of the lUlknOwn parameters Q. Thus if

i th individual, then

where ~

q(uIP,~) is the max q(tIP, 2)t=1,2,3

u is the most probable genotype for the

the most probable genotype can be obtained for all members of the

pedigree, and let m* be the nxl vector of the most probable eeno

types. If m* is not the most probable genotypic configuration, it

should nevertheless be reasonabJy close to it. The vector m* should-be checked for consistency of genotypes within the pedigree, i.e. we

must not have P(m*) = o. The vector of phenotypes x and the vector-m* are used in the mixed model analysis.

The phenotypes used in the following analysis are the age and sex-

adjusted values described in Section 5.1. The Fortran program used in

the polygenic analysis has been modified to allow for the existence of

three means corresponding to the three major genotypic effects. Thus,

conditional on the estimated genotypic configuration, the likelihood

of the pedigree under the mixed model can be computed and maximtnn

likelihood estimates of the parameters obtained. The likelihood ratio

test will be used to test biO hypotheses: HO: ~l = ~2 = ~3 vs.

HI : ~l" ~2 or ~2" ~3' and HO: <1~ = 0 vs. HI : <1~ > O.

Rejection of the first hypothesis indicates that, conditional on the

estimated genotypic configuratiol1, more than one distribution needs to

be fitted to the phenotypes; evidence of this alone is not sufficient

169

to prove the existence of a major gene, but it is certainly consistent

with it. Rejection of the second hypothesis inJicates that, after

allo\~ing for the presence of a major gene effect, there is a signifi

cant additive genetic component to the residual variance.

6.2 Sperry and Zak Cholesterol

under the ~lendelian hypothesis C'r1 = 1, T 2 = ~, T 3 = 0) and

Hardy-Weinberg equilibritD11, the likelihood of the pedigree was used

to assign the most probable genotypes to all the individuals. The

resulting genotypic configuration for both measures of cholesterol may

well be the most probable one as, with very fe\v exceptions, the prob

ability of the most probable genotype for each individual, q(uIP, 8)from equation 6.7, is at least 0.90.

Using this configuration, the maximum likelihood estimates of the

parameters under the mixed model for Sperry and Zak cholesterol are

shm~ in Table 6.1. The estimates of total heritability are tabulated

in Table 6.2.

For Sperry cholesterol, the hypothesis HO: ~l = ~2 = ~3 is

decisively rejected indicating that the data are consistent with the

existence of a major gene segregating for hypercholesterolemia. In

the presence of tIns major locus, the hypothesis HO: a~ = 0 is also

rejected indicating a significant additive genetic component to the

residual variance. About 70% of the total variability in Sperry

cholesterol is accounted for by heritable effects, if environmental

correlations between relatives do not affect the trait. It should be

pointed out that for Sperry cholesterol in the Left Pedigree, after

the variability due to the major gene and the additive genes has been

accounted for, there is very little variability due to random environ-

170

mental effects. This is also true of Zak cholesterol in the Left

Pedigree.

For Zak cholesterol, the hypothesis HO: ~l = ~2 = ~3 is also

decisively rejected. There is signific~~t heterogeneity between the

estimates for the Left Pedigree and those for the Right Pedigree. In

addition to the much smaller estimate of 02 in the Left Pedigree,e

the estimated means are a little larger in the Right Pedigree. For

the Right Pedigree, in the presence of the major gene, there is an

additive gene component but it does not attain statistical significance.

As with Sperry cholesterol, about 70% of the total variability in

Zak cholesterol values can be accounted for by heritable effects.

Figure 6.1 shows a plot of the density functions for Sperry

cholesterol (Figure 6.la) and for Zak cholesterol (Figure 6.lb)

illustrating the mixture of two normal distributions. These plots are

of the cholesterol values for individuals in the larger Right Pedigree;

tIle log-transformed values have been adjusted to age 30. The best

cutoff point is tIle point on the abscissa at which the two component

p.d.f. 's intersect. As one C~l see, the area of overlap is small; the

two components are fairly clearly separated. About 5% of the Sperry

cholesterol values and about 9% of the Zak cholesterol values in this

pedigree are in the higher distribution.

6.3 Systolic Blood Pressure

The assigning of the most probable genotypes to all individuals

in the pedigree in the case of systolic blood pressure was not as

unequivocal as was the case for Sperry and Zak cholesterol, since the

difference in the probability between the most probable genotype and

the next probable genotype is small for many individuals. This suggests

171

Table 6.1 ~~imum Likelihood Estimates of the Parametersfor the r,Iixed Model for Sperry and ZakCholesterol by Pedigree

Pedigree ..,Trait Left Both Right l-ieterog X4

Sperry Cholesterol2 .030 .020 .0180g

02 .002 .014 .017e

fll and fl*Z 5.832 5.833 5.834

fl* 5.188 5.209 5.2153fuL 72 .231 268.574 199.682 6.68

(P > .10)2 .. tX -stat1st1c to tes :

i) HO: ].J.l=fl2=].J.3 53.53 195.81 145.52

•• ) t.: 2 9.17 18.70 10.5911. 1iO: 0=0g

Zak Cholesterol

02 .017 .018 .013g

02 .004 .012 .017e

fl1 and fl~ 5.945 5.963 5.976

fl* 5.349 5.385 5.4003

fuL 63.450 215.522 159.052 13.96(P = .007)

2 .. t tX -stat1st1c 0 tes :

i) EO: fll=fl2=fl3 52.39 159.38 117.79

ii) nO: o~ = 0 9.08 10.92 2.60


172

Table 6.2 Variance Component Estimates, Proportion of theTotal Variance, and Total Heritability Estimatesfor the Various Traits by Pedigree

2 2 2 TotalTrait Pedigree °mg 0g °e Heritability %

Sperry Cliol. Left .042 .030 .002 97.556.5% 41.0% 2.6%

Right .020 .018 .017 68.335.9% 32.4% 31.8%

Both .026 .020 .014 76.542.7% 33.8% 23.5%

Zak Cho1. Left .044 .017 .004 94.668.3% 26.3% 5.4%

Right .027 .013 .017 71.147.7% 23.4% 28.9%

Both .034 .018 .012 80.552.4% 28.1% 19.4%

Systolic BP Left .007 .000 .005 57.457.4% 0.0% 42.6%

Right .005 .000 .006 45.045.0% 0.0% 55.0%

Both .006 .000 .007 44.344.3% 0.0% 55.7%

2 variance attributable to a major gene°rog2 additive genetic variance0g

2 environmental variance°e

Figure 6.1 Cor.~onent and Total Theoretical Density FWlctions

173

a. Sperry Cholesterol

Estimated • I

• I,

t t• ,• • I

2 i percentile• I , I

128 237,

I,II I

97 i percentileI I266 I 493

Mean 184 342

Best Cutoff Point 295

b. Zak Cholesterol

Estimated II I I,I I I

2 ~ percentileI• I I158

I I97 ~ percentile 311 I 553IHean 2 1 394


174

that there may be many genotypic configurations for systolic blood

pressure clustering close to the most probable configuration.

Table 6.3 tabulates the maximum likelihood estimates of the para

meters for the mixed model for systolic blood pressure. The total

heritability estimates are listed in Table 6.2. The hypothesis 110:

~l = ~2 = ~3 is rejected indicating that the data are consistent

witll a major locus for systolic blood pressure. Furthermore, all the

genetic variance in this pedigree, which constitutes about 45% of the

total trait variability, is accounted for by this major locus;

the estimate for cr~ converged to a bound, zero. However, it is

conceivable that if the sibling correlation is much larger than the

parent-offspring correlation, then this situation could mimic genetic

dominance. The large heterogeneity x2 is due to the fact that

the estimates for the means are larger in the Left Pedigree; however

note that the displacement (~l - ~3) is about the same in the Left

Pedigree and the Right Pedigree.

Figure 6.lc shows a plot of the density functions for systolic

blood pressure values in the Right Pedigree. Note that the area of

overlap for systolic blood pressure is larger than for serum choles

terol. Presuming a major locus for essential hypertension, figure 6.lc

suggests that blood pressure, as measured indirectly by the portable

sphygmomanometer, is not a good discriminator. Recall that Hall (1966)

suggest that measuring the blood pressure directly may have the effect

of more clearly separating subgroups. Also note that the estimated best

cutoff point for the Right Pedigree is 135 rom Hg , very similar to

the value of 140 rom Hg used by many clinicians and investigators.

175

Table 6. 3 ~1aximum Likelihood Estimates of the Parametersfor the !'fuced ~1ode1 for Systolic Blood Pressureby Pedigree

Systolic BP2

0g

02e

~i

~2 and ~3

lnL

2 . to tx-stat1s 1C 0

Pedigree

Left Both Right

.000# .000# .000#

.005 .007 .006

5.012 4.951 4.950

4.803 4.780 4.777

99.253 378.414 290.809

Heterog x24

23.30(P < .001)

42.38 106.84 78.56

#Converged to a boundary value, zero


Figure 6.1

c. Systolic Blood Pre5sure

176

,Estimated

,I, fI

2 -i percentile 101I II

I! I97 1percentile

I1391 165

II

Mean 119 141


177

No attempt 'vas made to fit the diastolic blood pressure data to

the mixed model. In Chapter IV, there was no evidence of a major gene

segregating for diastolic blood pressure, and the polygenic analysis

in Chapter V failed to detect any significant additive genetic effect.

In addition, there was manifest heterogeneity between the Left and

Right Pedigree so that results from further analysis, if not meaning

less, would be difficult to interpret.

6.4 Conclusions

For both Sperry and Zak cholesterol, the genetic variance can be

apportioned into two components, a maj or gene component and, with

the possible exception of Zak cholesterol in the Right Pedigree, an

additive genetic component. However, it should be kept in mind that

the model does not include any effects to specifically allow for

environmental correlations between relatives so that the additive

genetic effects in this analysis may be inflated in the presence of

any envirop~ental correlations. The systolic blood pressure data are

consistent with the existence of a major gene.

rnAPTER VI I

S1J1-MARY AND CONCLUSIONS

Data from a five-generational pedigree with 235 individuals from

Bay City, Hichigan were analyzed without having to rely on arbitrary

cut-off points to dichotomize or trichotomize the quantitative data.

MacLean, l\forton, and Lew' (1975) have shown that, based on simulation

studies of nuclear families, there is an appreciable loss of infonna

tion when quantitative data are converted to a dichotomy or trichotomy.

The models used in the analyses are all multifactorial in the

sense that both genetic and environmental influences are considered to

be involved in detennining the phenotype. That is, regardless of

whether the genetic mechanism is a segregating major gene or segre

gation of genes at many loci or a mLxture of a segregating major gene

together with a polygenic background, environmental effects are also

included in the model.

Preliminary to doing any pedigree analysis, initial analyses

,vere done ignoring the familial structure of the data. Mixtures of two

or more univariate normal distributions were fitted to the serum

cholesterol and blood pressure data by maxirrn.Im likelihood methods in

order to test, using the likelihood ratio criterion, for significant

departure from single nonnal distributions. Significant departures

were found for serum cholesterol and for systolic blood pressure in

females, but not for systolic blood pressure in males nor for diastolic

blood pressure.

179

This dissertation considers three specific underlying genetic

models in performing pedigree analyses on quantitative traits. The

first model is the major gene moJel by which is meant a single gene

that can account for a significant portion of the phenotypic variance.

As developed in Chapter IV, for each individual in the pedigree, the

age-adjusted quantitative trait was assumed to come from one of three

lognormal distributions with a common variance, with the three pheno

typic distributions corresponding to the three genotypes of a two

allele locus. The likelihood of observing the phenotypes in the

pedigree was expressed as a function, among other parameters, of

three transmission probabilities (1'S), and goodness of fit of the

Mendelian hypothesis to the data was tested by comparing the maximum

likelihood obtained when the 1'S were allowed to vary between zero

and one with the maximum likelihood obtained when the 1'S were fixed

at their Menuelian values (11 = 1, 1 2:= ~, 1 3 = 0).

In Chapter V, the polygenic model was considered. Under this

r:lodel, the phenotype was determined by a large number of equal and

additive gene effects. There are three parameters to the model: an

overall r.l.ean j.J, an additive genetic variance 02 , and an environ-g

mental variance 02• The heritability, which is the portion of thee .

total variability of the trait in the pedigree that is due to heritable

effects, can be estimated assuming that environmental effects common

to relatives do not affect the trait. The likelihood ratio test was

used to test for a significant additive genetic variance.

Finally, in Chapter Vl, the mixed model was considered; this

model allrn~s for segregation of a major gene together with a polygenic

and environmental background. The importance of this model is that it

180

simultaneously considers the two extreme genetic hypotheses - major gene

on the one hanJ and polygenic inheritance on the other. Using this

model, we can analyze the data to determine if most of the genetic

variation is due to one locus or if many loci are involved.

iiitherto, analyses of pedigree data using the mixed model have

not been attempted for lack of an efficient algorithm to calculate the

likelihood of a pedigree under the mixed model. The problem is, with

n individuals in the pedigree and with 2 alleles at an autosomal locus,

there are three possible major genotypes for each individual and 3n

genotypic configurations possible for the pedigree. IIi th any moderate

size pedigree, the number of terms soon exceeds the limits of modern

day computers. Therefore, as a first look at the mixed model, we

decided to approximate the likelihood function by computing the likeli

llood conditional on one genotypic configuration, namely the most

probable configuration.

The most probable genotypic configuration for the pedigree is

approximated by obtaining tile most probable genotype for each

individual in the pedigree. The most probable genotype is the one for

which the posterior probability that an individual should have that

genotype is maximized. The result is an n x 1 vector of the most

probable genotypes for all individuals in the pedigree. If this vector

is not the most probable genotypic configuration for this pedigree,

it should nevertheless be reasonably close to it. Thus, conditional on

this estimated genotypic configuration, the likelihood of the pedigree

under the mixed model was computed and maximum likelihood estimates

of the parameters, three means and two variance components, obtained.

The likelihood ratio test was used to test whether, conditional on the

181

estimated genotypic configuration, more than one distribution needed

to be fitted to the phenotypes; a significant departure from a single

distribution is consistent with the existence of a major gene. A test

of the hypothesis that the additive genetic variance is zero was done;

rejection of the hypothesis indicates that, after allowing for the

presence of a major gene, there remains a significant additive genetic

component to the residual variance.

The analyses indicated an autosomal dominant gene for hyper

cholesterolemia segregating in this pedigree, regardless of whether

the serum cholesterol levels are measured by the Sperry method ~r the

Zak method. It is estimated that about 70% of the total variability

in serum cholesterol in this pedigree can be accounted for by heritable

effects. The existence of the dominant gene for hypercholesterolemia

in this pedigree agrees with the results of studies by Elston,

Namboodiri et a1 (1975) and Schrott, Goldstein et a1 (1972). This

result has been confirmed by the discovery of the biochemical

mechanism (Goldstein and Brown 1974, 1975) and the reported linkage

between a hypercholesterolemia locus and the C3 locus (Ott, Schrott

et al 1974; Elston, Namboodiri et al 1976; Berg and Heiberg 1977).

The same methods which detected the dominant gene for hyper

cholesterolemia in this pedigree were used to analyze the blood

pressure data. Apriori, since this pedigree was ascertained because

of hypercholesterolemia, the chances of also finding a major locus

segregating for higIl blood pressure is relatively low. Despite this,

the data are consistent with an autosomal recessive gene segregating

for high systolic blood pressure, at least in the larger Right Pedigree.

It is estimated that under the mixed model, about 45% of the total

182

variability in systolic blood pressure in the Right Pedigree can be

accounted for by heritable effects. The mixed analysis indicates

that, in the presence of the major gene, there is no additive genetic

crnlvonent to the residual variance; the residual variance is all

accounted for by environmental effects. The major gene analysis

estimates that about 23% of those marrying into the Right Pedigree

have this gene in the homozygous form. In an analysis which ignored

the pedigree structure, two distributions fit the systolic blood

pressure data significantly better than one in females, but not in

males.

Analogous analyses show that in the Left Pedigree neither the

Mendelian hypothesis or the environmental hypothesis could be rejected,

suggesting that the likelillood surface is flat and that there is little

evidence for a major gene for systolic blood pressure segregating in

the Left Pedigree.Little evidence for a major gene segregating for diastolic blood

pressure is detected in this large pedigree. In addition to the mani

fest heterogeneity between the Left and Right Pedigrees, the environ

mental hypotllesis and the Mendelian hypothesis have similar likeli

hoods, again suggesting a flat likelihood surface.

Various studies have reported an association between weight (or

obesity) and blood pressure. Consequently, the possibility that the

major gene detected for high systolic blood pressure in the Right

Pedigree may be due to an association of systolic blood pressure with

weight must be considered. In our analyses, weight is adjusted for

when adjustments are made for age. In this pedigree, age and age2

account for over 80% of the variability in weight. Although tlle

183

ordinary product moment correlation between systolic blood pressure

and weight is 0.65, the partial correlation coefficient, partialling

age and age2, is reduced to 0.24. Hence, in this pedigree, once

adjustments are made for age, it is unlikely for weight to have much

influence on the results.

Since a major gene is detected for high serum cholesterol, and

one is detected for high systolic blood pressure, and since there is

a correlation between systolic blood pressure and serum G~olesterol,

is it possible that we are observing the pleiotropic effects of only

one gene? In light of the analyses at the end of Chapter III, this

possibility is unlikely. tVhen mixtures of bivariate lognormal dis

tributions were fitted to the systolic blood pressure and serum

cholesterol data (regardless of whether measured by the Sperry method

or the Zak method), two local maxima were found, one corresponding to

serum c11olesterol and the other corresponding to systolic blood

pressure. TIlis result suggests that, if tllere is a major gene for

systolic blood pressure and a major gene for serum cholesterol, there

are two separate genes.

For several reasons, one must be cautious in interpreting the

results of the mixed model analysis. First, the analysis was done

conditional on just one genotypic configuration. The configuration

obtained for Sperry and Zak cholesterol, if they are not the most

probable vector of genotypes, should be very close to them. For

systolic blood pressure, the best that can be said is that the genotypic

configuration used in the ~lalysis is just one of many; hopefully all

of them are clustered near the most probable one.

184

Second, since the configuration was generated using a likelihood

computed assuming an underlying ~1endelian model, the resulting analysis

is biased toward detecting a major gene. It is difficult to determine

to what extent the results are affected by su~~ bias. Comparisons may

have to await more sophisticated numerical methods that will allow the

evaluation of the complete likelihood; the present conditional analysis

could then be replaced by a more general unconditior~l one.

Future research can be focussed in two broad areas: methodologic

and genetic. One methodolic problem has been mentioned above, to

numerically evaluate the appropriate likelihood to perform an uncon

ditional mixed model analysis. Another is to enrich the polygenic and

mixed model by including terms for environmental effects cornmon within

a family. Boyle (1978) has shown in theory how to incorporate an

effect due to common sibling environment, assortative matin~, and an

effect due to common family (parent-offspring) environment. The

challenge is to incorporate the theory intc existing computer

algorithms.

On a more theoretical level, work needs to be done to determine

the asymptotic distribution of the likelihood ratio when some of the

parameters have converged to a boundary value. In addition, the theory

behind testing for a significant fit of a mixture of two or more

univariate (or multivariate) normal distributions over a single normal

distribution should be examined. \volfe (1971) suggests, based on his

Monte-Carlo investigations, that likelihood ratios for mixture problems

are not distributed as a chi square distribution with the degrees of

freedom equal to the number of variables; his studies indicate that

doubling the number of degrees of freedom gives a better fit to the

sampling distribution.

185

In the area of genetics, it would be interesting to compare the

present results with a similar analysis on a pedigree in which it is

more likely for hypertension to be segregating or at least where the

prevalence of hypertension is higher. This might result in a clearer

genetic analysis. At the same time, the search for a biochemical defect

should be continued. However, in the absence of any demonstrable

biochemical defect, the most forceful evidence for a major locus

existing may be derived from linkage analysis. If the presumed locus

for hypertension could be linked, with high probability, to one of the

existing polymorphic genetic markers, then this would be considered

conclusive evidence for a major gene for hypertension.

Appendix 1

Secondary Hypertension: rlypertension OccurringAs a t'<lanifestation of a Known Disease.

1. Disease of the kidneys and urinary tract.

(a) Nephritis; chronic or acute glomerulonephritis.(b) Chronic pyelonephritis

(c) Coarctation of the renal arteries(d) Polycystic kidneys(e) Diabetic glomerulosclerosis

(f) Connective tissue diseases(g) Amyloid contracted kidney(h) Certain tumors

2. Adrenal cortical hyperplasia or tumor

(a) Cushing's syndrome

(b) Aldosteronism

3. Pheochromocytoma

4. Coarctation of the aorta

S. Pre-eclamptic toxemia of pregnancy

6. Post-toxemic hypertension

7. Miscellaneous conditions affecting the nervous system.

8. Oral Contraceptives

Sources: Mend10witz (1961)

Pickering (1968)

186

Appendix 2

List of Variables Observed in 1947

1. Sex

2. Age

3. Total Cholesterol (Bloor and Sperry Method)

4. Cholesterol Esters

5. Phospholipids

6. Total Lipids

7. Systolic Blood Pressure

8. Diastolic Blood Pressure

9. ABO Blood Group

10. !-N Blood Group

11. Rh Blood Group

187

Appendix 3

List of Variables Observed in 1958

l. Sex

2. Age

3. Weight

4. Height

5. Systolic Blood Pressure

6. Diastolic Blood Pressure

7. Pulse

8. Hemoglobin

9. Total Cholesterol Level (Sperry and Zak Hethods)

10. Free Cholesterol

11. Cholesterol Esters

12. Phospholipids

13. Total Fatty Acids

14. Serum Magnesium

15. Uric Acid

16. Ultracentrifuge

Alpha-lipoprotein

Beta-lipoprotein

Prebeta-1ipoprotein

17. ABO Blood Group

18. M~ Blood Group

19 Rh Blood Group

188

189

Appendix 4

Sex. ~e, Hei2ht. Wei¢ht. Systolic Blood Pressure, Diastolic Blood Pressure.Sperry and :&1; Ololesterol, Beta and Prebeulipoprotein Values for ~1el!t>ers

of the PediiTee Observed in 1958. C-l si~ifies data unavailable)

IIGl !It .';,.1 lIE IGH 'I SYS~ liP OIlS BP S PE IiIl Y ZU: BE'U PRE-S!:'! 1i'!F.SC::1 S~X (YPSI (UI (L8S) (~~ H"I (P1~ ilG) (l'lGJ) (''''1 (IlGIlI ('!Glll

L III :: F 73 64.5 172 160 100 207 242 138 213 1;" ~8.V 10; 0 150 80 1<:19 221 118 394 F 63 61.5 146 111J 84 199 229 113 49

L III 7,RlIIlb F' 'is 0;3.0 128 I'll) 110 2214 263 153 30L III 11 " 53 71.0 I~S 165 100 250 27e 1714 29

1.2 F ~6 0.5 157 130 90 300 325 1'77 3713 51 71. v 1<15 120 '10 199 -1 -1 -114 r 46 62.5 172 130 90 189 -1 - 1 -115 w 14 I'! 'i9.5 183 150 95 318 4~J 311 2916 ! 46 65.5 146 132 84 176 -1 -1 -11~

.."'~ 1'';.0 1149 195 1 10 14 18 4 JR 31 Q 38

L III 18.RlII 4 r. 59 6'.5 1'13 130 QO 4 I 8 435 31'1 44RIII ~ ,LIII 19 F 60 61.5 146 134 90 1453 464 JII9 :.16R III i ~ 61 6'.5 1138 130 90 160 21J 105 JIj

8 ,. 59 66.0 145 150 90 222 260 160 209 .. 53 611.~ 175 1110 100 15 J 191 97 20

10 F 52 -l.J 126 124 80 1'19 232 114 3911 F 6" 6 1.0 134 151' 914 200 21,~ 135 2~

1:: " 67 11<:1.0 163 130 05 189 201 11 1 813 ~ 65 lit-.v 195 120 80 165 -1 - 1 -1H F 65 62.5 164 140 80 266 -1 -1 -1

R III lS,LIII 8 w 67 66.0 161 140 85 241 269 148 411RIll 19 r. 50 66.v 155 1140 H 38e 411 280 31

:0 ! 147 614.5 15'\ 130 88 231 323 20 ~ 21\21 :1 69 67.5 194 170 110 19b 222 101 4822 F t9 60.0 leI 190 110 23 I 328 220 31

L IV 1 r 311 1\4.0 135 120 80 163 183 95 12~ 40 71.0 165 130 80 191 -1 -1 -I

3 , III 7l.v 200 130 ~o 1159 1<;5 99 74 F 38 60.5 1114 120 90 119 156 57 105 140 f:7. \) Hl0 1140 100 3106 414 289 246 P' 31; 64.5 173 132 84 196 240 1119 127 F li2 1i"'.0 115 1 HI 78 15 .. 1111 98 7S n 14 I ".5 ,qll 120 90 20 I 231 130 ,.,9 F 39 63.0 h2 140 lOS 191 231 127 13

10 ~ 3<:1 7 0.0 208 130 84 245 231 126 2712 ~: 37 1;9.0 1714 I 10 85 188 209 107 3113 i' 29 65.0 126 114 BO 19 I -, -1 -115 :- 32 62.0 I IS 120 SO 137 176 87 '216 .. 3.. ., 1. 5 175 170 120 212 252 136 26

R IV ~S F ~, 60.5 155 140 104 235 -I -, -I49 w uS '" 8.0 14E: 110 ~O 237 -1 -1 - I50 :1 J9 6e.~ 202 140 90 2145 263 lSI 41151 r 37 1>14.0 173 1214 all 153 168 100 10~. !1 H 6'.0 190 135 '00 2145 286 149 6033 .,. 34 - 1.0 131 1 18 BO 219 - 1 - I -154 I 30 61.0 130 130 90 217 2115 1119 12

L IV 15 F 3 I -1.0 - I lila loa 1111 215 i 1 5124 F 24 6 1.5 1'4 130 ao 186 205 10'! 19~5 ~ 110 67.v 114 1140 90 158 193 93 2126 F 27 67.5 143 130 88 2113 293 198 37,-

~ 2q - 1.0 ,- I 120 !l0 2'9 260 155 20.,28 ~ 25 7'.5 18' 120 75 114 7 193 92 1829 .,. 23 611.0

" ;2120 80 160 -1 -1 -I

30 - 21> E:9.0 IUS 130 80 11414 - I -, - I31 F 19 - 1.0 -1 120 80 222 161 71 20-, :- 1!\ - 1.0 -I 120 80 , 14 2 178 93 II~-33 H f>6.0 1)5 14;) SO 1144 175 86 Ie3~ ~ III 62.5 98 100 60 147 -, -1 -I

190

.~ u! H!lGi:T W!!CH": SIS! S? :H~S Bll SC'E:!lRY ZAK BlTA PRE-IlET AP!:i'SC~ S:X (ns) (IN) (L1:lS) (~!I HG) ('~ MG) ( ~G') llllO') (l~(" ) (llG\)

L IV 35" 20 7:1. V 200 1110 90 )110 - 1 - 1 - 1

R rv ~ 110 6).v 17~ Ill; SO 4)11 1162 296 9)l' 36 f, ~. 0 175 120 '311 207 260 156 10

3 l' )q 62.0 171 11111 911 388 11118 - I 224 ~ 112 66.0 160 1115 110 2::5 249 1211 30S !' )11 63.5 1118 111l 80 ) I II 260 - 1 -I9 ~ )7 6°.5 175 130 90 227 JOEl -1 -1

10 F )2 61.0 1)3 1211 80 230 273 '" III11 .. ):1 F!lI.~ 160 120 80 2)7 J 03 203 912 .. 31 (d.O 1115 11111 90 4Cl6 1176 348 6213 .1) 77.0 :ll~ 18~ 80 230 262 111'3 2714 2~ "71. V 172 120 80 1111 2011 106 1515 r )6 611.0 135 118 7u 340 366 28 J 716 ~ 37 71.0 16B luO 100 261l 280 Ie 3 3217 ~ 25 C,6.0 157 11 R 76 312 361 270 2618 ~ 2) 1'14.5 165 140 90 1128 42d 34 ) 17:.: ~ 13 - 1.0 - 1 120 811 4)5 427 307 28:3 ~ 39 !>'l.5 165 1)0 8 e 212 259 143 3624 l' 36 f,6.U 135 120 qo 222 /.77 158 18o· r 36 611.0 131 120 ~o 214 2113 127 12-~26 ;\ 3:1 71.0 235 130 80 19 I 224 10 I 2627 r. 3!- 67.0 172 1/.5 85 22~ 251 15 I 3028 1" 30 63.5 -1 120 70 189 217 124 1831 ~ )0 69.0 200 135 8~ 189 2) 1 12 :I J332 F 2~ 64.5 '''~ 120 '30 168 200 109 1633 f 20 611.5 1II 3 140 80 179 193 107 834 22 71.5 162 1::0 "711 207 22A 112 3036 r 15 E-5.5 1111 100 70 Cl6 130 52 e37 f 13 e2.-: 1111 11 U 60 150 178 911 11139 ~ .16 67.0 1f,1I 120 80 207 2::9 147 11140 !' 32 63.0 179 1311 90 171 198 102 1141 ~ B fl6.v 166 120 P5 12q 256 129 3P43 ~ 40 -loU -1 120 eo 253 -1 -1 -144 '" 3'7 6 1. G 97 120 80 2117 -1 -1 - I46 ~ 3S H.O 10 I 10 80 24~ 279 11\"7 I I47 :- 3 I flll.1i 1) 1 104 70 153 lP. lOll 1057 ~ 112 67 .0 156 100 75 191 -, - 1 - I58 f 36 ~11.0 13A I 10 70 140 -1 -I -I59 F 40 "3.5 1)8 120 80 382 392 299 1960 II) 71. v 16) 130 AS !tll -I -1 -161 ~ 3d 68.C 182 140 90 305 359 211~ )062 ~ 33 (,3.0 150 1 10 70 163 2)7 132 1663 f 35 60.v 1)) 110 70 297 - 1 -1 -165 ~ 3U {''i.u If>2 130 70 144 -1 -I -167 ~ 32 6~.0 1911 1110 80 289 3117 2f,e J568 r 2q '=' 2 .(1 111 0 I U0 eo 2011 2119 116 5169 ~ J I £-7. S 1~0 lOll 70 201 20 I 11 : 1771 '" 29 63.5 1UO 120 80 196 2011 11: 13I.

~ 33 6~.u 1"5 130 90 't 0 175 96 1573 r 1111 - 1.1,) 135 128 80 2 I 2 2111 130 1674 42 '- 1.u 125 1211 H 17) - I -, - I76 ~ 1111 li8.0 1'70 ISO 90 217 0 211'l 117 57I, 39 "1I.1! 120 120 ~O 153 215 8e 4778 ~ III 73. U IllS 110 70 189 185 92 1779 f )~ 611.5 21111 1211 84 lS3 197 105 1380 Of 110 7).5 161 120 70 240 277 202 21181 )':> bB.O :nl> 130 95 156 222 107 2582 F 33 611.5 129 126 80 181 2211

1'"13

83 )) -1. v I If, lOU 611 lQ .. 21 Q 125 10

191

AuE li£I" .. 1 >:c: Hili '! SYS-r Ei' DIAS BP ::iP!ilFY ZH. etTA PIlE-BETAi'!IlS( N s:r. IYFS) (!:Il (Lh·<n (~~ HG! I:I~ HG) I ~"") (:lG") (!'Ie,;) (I'G';)

R n' 84 36 7].5 1104 120 70 199 23R 14 ~ 986 :- 2" '51.5 126 114 7f:. 110:< 170 8'1 1687 ] 1 73.0 20~ 115 85 212 245 142 ] 1

L V 1 15 611.5 124 -1 -1 196 223 12 " 152 F 12 ~O.O 97 120 80 163 185 9~ 173 1S 66.0 11] 110 80 1)7 180 77 74 1 - 1. a -1 -1 -1 -1 -1 - 1 - 15 I'; ';4.5 In 100 70 122 151 65 156 · 1 1 56.0 81 110 78 117 141 02 9

, 3 61.0 90 100 80 168 199 103 1108 ,~ 13 61. a 90 100 70 lb) 19] 92 159 f 15 ('5.0 119 1111 80 158 1!!9 CJC 1.)

10 6 46.0 1I1i -1 -1 -1 -1 -1 -111 ~ II 102.5 38 -1 - 1 -1 -1 - 1 -1

RV ~6 : 21 61. Ii 1 1~ 11~ '\0 1ll 10 -I - 1 - 1~7 :l Iii "8.5 128 100 70 147 -1 - I - 148 ~ 7 50.0 ~ 1 100 "0 1R1 -1 - 1 -149 F 1] ti4.0 I I I 118 70 1(.) 1910 121 9

SO 'f ..,~7.5 52 - I - I - 1 -1 - 1 -1

51 F ) - 1.'; )2 -1 - 1 - 1 - 1 -1 -15: :- 10 53.5 60 100 60 - 1 -1 -, - 153 :- 8 48.5 60 110 70 - 1 - 1 - 1 -154 : 4 4:<. \) 40 - 1 - I - 1 -1 - I -1

L V 12 ," ) 37.0 ]) -1 - 1 lEd - I -1 - 113 • 1 - 1.0 -1 -1 -1 - 1 - 1 - 1 -114 · 0 -I.e - I - I -1 - 1 - 1 - 1 -115 · 1 - 1.0 25 - 1 - 1 - 1 -, -1 -,

R V 1 1 I 55.5 74 110 70 266 - 1 - 1 -1~ : It> 5'l.0 '14 100 50 lP 155 78 9~

3 F 14 - 1.0 -, -1 - 1 356 - 1 - 1 -14 F 13 /'0.0 92 124 64 207 224 142 21S ~ 7 4'1.5 64 100 70 <03 302 219 86 f 2 - 1.0 2'3 -1 -1 -1 - 1 - I -1

F 17 67.0 160 114 ilO , 5] 193 100 68 F 14 .. O. 0 111 102 54 - 1 -I - 1 - 19 '0 58.0 80 1 14 60 )1:< 355 211 1 18

10 ~ " I",. :, 45 - I - 1 173 -1 -1 -112 f 111 b2.~ 1201 140 80 260 301 213 913 : 11 "6.1l 85 lH 84 140 -1 - 1 - 114 ~ 10 51.1i 90 110 70 245 JH 205 815 .. 11 Su.S 7 I 120 70 191 231 129 1216 f 1 S 1.0 Gl -, - 1 155 211 98 2217 · 8 53.0 8.. 120 liO 201 -1 -1 -118 " 5 45.5 59 110 50 184 202 111 16

19 r 9 ., 3. ~) '" 104 70 271 311 221 2020 ". ~ 51.0 64 - 1 - 1 281 310 220 1321 r 5 - 1.0 47 -1 ·1 176 202 Ill! 1022 F 3 37.0 36 - 1 - 1 -1 -1 -1 -123 ; 10 5 J. L' &0 110 70 158 204 101 26;4 10 <:3.0 &2 110 eo -1 -1 -1 -12S 3 U1.5 ]f, 92 bO - 1 -1 - 1 -116 ; 51,;.0 .,., 100 bO 181 215 @6 22, ..

F 5 u5.5 u2 100 60 147 167 82 11;"'18 r ~ 39.0 36 - 1 -1 - 1 ·1 - 1 -129 2 31.0 30 -1 - 1 -1 -1 -1 -130 10 5).e 60 Ion 70 P4 18b 105 2131 .. 7 u8.0 51 - 1 -1 180 210 121 12.,

~ 5 4C.V C;f, 100 60 IcC; 206 108 20,).

33 /; 41. C 15 ·1 - 1 - 1 - 1 -1 - 1

192

AG: H!!GHr W!IGf!: SYST SF ens BF SP!':FFY ZlK BETA PRE-eETAPFP.SC ~ sr:x (YIiS) (;; ~) (US) (!'!~ IIG) (~~ Hr.) ( !lu~) (!lG~) (~G ~) (!l(il)

R V 34 v 2 -1.u - I - I - I -I -I - 1 - 135 :- (, ~ e. c 40 - I - 1 - I - I -, -,36 ~ 4 - 1.0 41 - 1 -I - I -I - I - I-.

.~ 1 -1.0 24 -I -1 - I - I - I -1;"

39 .. '7 61.5 120 110 ao 153 19& 104 lli40 ;' III 57.0 7e, 1110 70 ISS - I - I - I41 .. 9 4'1.0 55 'It) 60 - I -I - I -142 ~ 2 - 1.0 - I - I - I - 1 - I - 1 - I43 ~ 1 U .. f>. 0 130 1;:'0 70 160 I ali '06 444 ., 9 50.1.1 62 '10 60 196 20 13 I 945 F 6 1I7.1l 112 -, -, Hl6 214 1)0

"59 ,. 18 6 1.5 I It) lOa 60 160 167 e~ 360 l' 64.0 140 100 60 117 - I -I -I61 III 5€.0 95 100 75 127 -I - I - I62 9 4 R. 5 53 120 80 15P. - I - I -I63 f 9 119.\l 48 100 56 150 - I - 1 -I64 5 " 3. \) 37 - I - I I I 9 - I - I -I65 3 Ji.O 35 - I - I 140 -1 - I - I06 ," 12 52.5 100 100 60 11111 160 5& 21167 ,. 13 62.0 119 100 70 201 23li 119 3068 F 13 62.0 92 130 80 137 166 &7 2069 .. 10 50.0 55 '15 &5 150 180 7ll 3270 r 7 lJij.5 110 -, -, 243 - I -1 -,il ~ 4 II 1.5 ~3 - 1 - 1 222 258 173 1&-' f I 33.0 22 -1 -1 - 1 - I -1 -173 14 Ft4.0 109 120 70 135 157 6li 1074 f 10 53. \l 67 100 70 lli7 170 74 1675 F f' 53. v 68 100 70 27 I - I - 1 -176 f II 39.0 5li -I -I ISS - I - I -177 ~ I - 1. 0 32 - I - I -1 - I - I -Ii8 !l 8 119.0 53 100 70 150 - I - I -179 .. 1 u5.0 42 -1 - I 140 -1 - I -I80 . 3 .n.0 26 -I - I 186 -I - I - I81 .. 10 511.5 e2 - I -1 122 171 62 2li82 f 9 liP..O 50 90 60 1;:>2 '66 77 2483 ,. 6 1I".1l 50 87 &0 127 154 73 lli84 ,. II 16.5 30 - I - I - I - I - 1 - I85 v A 50.0 6 I 90 60 163 1611 7 I lli86 'I 5 li7.0 50 - I - I - I -1 - 1 -I87 e 53.5 7: 100 60 It> a ,n 76 IeSo 6 115.5 51 90 60 It> 5 -1 -1 - I39 3 II 1. Ii 33 -, -1 20'1 '8' c; I 290 15 I'> I;. 5 1lit' 100 60 122 16& 80 1591 III 6 J. 5 125 laO 60 132 , a 1 82 199: 13 "<I. v "'I lOU 70 163 209 , 12 1993 8 50.1.1 &7 9li &0 219 217 106 1994 II 39.0 30 -1 - I -, -1 -, -195 6 118.0 56 '10 60 178 2 I I 101 2096 10 54.\) 18 '10 50 ISO '" ~q 1297 ~ 52.5 70 100 60 lb3 193 100 3098 7 14<1.0 .,~ lOu 70 11414 176 88 2099 6 41'>.5 52 - I - I 153 177 88 17

100 3 38.0 36 -1 - I -, - I - 1 -,101 Ii) 55.0 95 110 711 1~9 227 123 1110: 6 53.0 f,0 I 10 70 -1 - I -1 -1103 3 39.0 2'1 -I -1 -1 -1 -1 -I104 2 .17.0 31 -1 - I - I -, - I -1105 0 - 1. C 27 -, - I -1 -, - I - I

BIBLIOGRAPHY

Acheson, R.H. and G.B. Fowler (1967). On the Inheritance of Statureand Blood Pressure. Journal of Chronic Diseases 20:731-745.

Adlersberg, D., A. Parets, E. Boas (1949). Genetics of Atherosclerosis. Studies of Families with Coronary Artery Disease Under theAge of Fifty Years. Journal of the American Medical Association.141:246-253.

Alvord, R.M. (1949). Coronary Heart Disease and Xanthoma TuberosumAssociated with Hereditary tlyperlipemia. Archives of InternalMedicine 84: 1002-1019.

Berg, K. and A. Heiberg (1977). Confirmation of Linkage Between Familial Hypercholesterolemia with Xanthomatosis and the C3 Polymorphism.IVth International Workshop on Human Gene Mapping~ Winnipeg, ~·1ani

toba, August 14-l~.

Borhani, N.O., M. Feinlieb, R.J. Garrison, J .C. Christian, R.H. Rosenman (1976). Genetic Variance in Blood Pressure. Acta GeneticaeMedicaeet GemeUologiae 25:137-144.

Boyle, C.R. (1978). The Likelihood of a Sibship Under Some PolygenicM>dels with Familial Correlation. Unpublished Hanuscript.

Burns, F.S. (1920). A Contribution to the Study of the Etiology ofXanthoma Multiplex. Archives of Dermatology and Syphilogy 2:415.

Cassel, J.C. (ed.) (1971). Evans COlUlty Cardiovascular and Cerebrovascular Epidemiologic Study. Archives of Internal Medicine 128:883-986.

Chakraborty, R., W.J. Schull, E. Harburg, H.A. Schork, P. Roeper (1977).Heredity, Stress, and Blood Pressure, A Family Set Method. V. Heritability Estimates. Journal of Chronic Diseases 30:683-699.

Chauffard, A. and G. LaRoche (1910). Pathogenie du Xanthelasma. SerrrlineMed 30:241.

Cruz-Coke, R. (1959). The Hereditary Factor in Hypertension. ActaGenetica et Statistica Medica (Basel) 9:207-212.

Cruz-Coke, R. (1960). Environmental Influences and Arterial BloodPressure. Lancet ii:885-886 .

194

Downie, W.W., J.A. Boyle, \'I.R. Grieg, W.W. Buchanan, F.P. A1epa (1969).Relative Roles of Genetic and EnviroI1r.1ental Factors in Control ofBlood Pressure in Normotensive Subjects. British Heart Journal 31:21-25.

Elston, R.C. and C.E. Bok1age (1978). An Examination of FundamentalAssumptions of the Twin Method. in International Twin Conference2nd, 1977, Washington, D.C. New York: Alan Liss, in press.

Elston, R.C., K.K. Namboodiri, C.J. Glueck, R. Fa11at, R. Tsang,V. Leuba (1975). Study of the Genetic Transmission of Hypercholesterolemia and liypertrig1yceridemia in a 195 Member Kindred. Annalsof Human Genetics 39:67-87.

Elston, R.C., K.K. Namboodiri, R.C.P. Go, R.H. Siervoge1, C.J. Glueck(1976) . Probable Linkage Between Familial Hypercholesterolemia andThird Complement Component (C3). Baltimore Conference (1975): ThirdInternational Workshop on Human Gene ~~apping. Birth Defects: Original Article Series, XII, 7, 1976, pp. 294-297, The National Foundation, New York.

Elston, R.C., K.K. Namboodiri, E.B. Kaplan (1978). Resolution of HajorLoci for Quantitative Traits. Submitted for Publication.

Elston, R.C. and J. Stewart (1971). A General ~bde1 for the GeneticAnalysis of Pedigree Data. Human Heredity 21:523-543.

Epstein, F.H., W.D. Block, E.A. Hand, T. Francis, Jr. (1959). FamilialHypercholesterolemia, Xanthomatosis, and Coronary Heart Disease.American Journal of Medicine 26: 39-53.

Falconer, D.S. (1960). Introduction to Quantitative Genetics. NewYork: The Ronald Press Company.

Feinlieb, ~1., M. Halperin, R. Garrison (1969). Relationship BetweenBlood Pressure and Age: Regression Analysis of Longitudinal Data.Presented at the Ninety-seventh Annual Meeting of the AmericanPublic Health Association, Philadelphia, Pennsylvania, November1969.

Frederickson, D.S. and R.S. Lees (1965). A System for PhenotypingHyperlipoproteinemia. Circulation 31: 321- 327 .

Frederickson, D.S. and R.S. Lees (1966). Familial Hyper1ipoproteinemia. In The Metabolic Basis of Inherited Disease, J. Stanburg,J. wyngaarden, and D.S. Frederickson (eds.) , Second Edition,pp. 429-485, New York: McGraw-Hill, Inc.

Frederickson, D.S. and R.S. Lees (1972). Familial Hyperlipoproteinemia. In The Metabolic Basis of Inherited Disease, J. Stanburg, J._wyngaarden, and D.S. Frederickson (eds.) , Third Edition, pp. 545614, New York: HcGraw-Hill, Inc.

195

Gearing, F.R., E.G. Clark, G.A. Perera, M.D. Schweitzer (1962). Hypertension Among Relatives of Hypertensives: Progress Report of aFamily Study. American Journal of Public Health 52:2058-2065.

Go, R.C.P., R.C. Elston, E.B. Kaplan (1978). Efficiency and Robustness of Pedigree Segregation Analysis. American Journal of HumanGenetics 30:28-37.

Godal, H.C., J.C. Lund, E. Sivertssen (1956). Lipids and Lipoproteinsin Serum in Cases of Familial Essential Hypercholesterolemia andXanthomatosis. Acta Medica Scandinavica~ supplement 319: 125-133.

Goldstein, J. and ~'I.S. Brown (1974). Expression of the Familial Hypercholesterolemia Gene in Heterozygotes: ~bdel for a Dominant Disorderin Han. Transactions of the Association of American Physicians 87:120-131.

Goldstein, J. and M.S. Brown (1975). Familial Hypercholesterolemia.A Genetic Regulatory Defect in Cholesterol Metabolism. AmericanJournal of Medicine 58:147-150.

Green, P . (1972). Personal Communications.

Guravich, J.L. and J. Venegas (1962). Familial Hypercholesterolemia.Federation Proceedings 21 (Part II): 44- 51.

Hall, E.C. (1966). Analysis and Decomposition of Blood Pressure Levelin a Population By Age, Race, and Sex. Unpublished Ph.D. Dissertation, University of North Carolina, Chapel Hill.

Hamilton, ~1., G.W. Pickering, J.A. Fraser Roberts, G.S.C. Sowry (1954a).The Etiology of Essential Hypertension. 1. The Arterial Pressurein the General Population. Clinical Sciences 13:11-35.

Hamilton, M., G.W. Pickering, J.A. Fraser Roberts, G.S.C. Sowry (1954b'.The Etiology of Essential Hypertension. 2. Scores for Arterial BloodPressures Adjusted for Differences in Age and Sex. Clinical Sciences13:37-50.

Hamilton, ~1., G.W. Pickering, J.A. Fraser Roberts, G.S.C. Sowry (1954c'.The Etiology of Essential Hypertension. 4. The Role of Inheritance.Clinical Sciences 13:273-304.

Harburg, E., J.C. Erfurt, C. Chape, L.S. Hauenstein, W.J. Schull,M.A. Schork (1973). Socio-ecological Stressor Areas and Black-lfuiteBlood Pressure: Detroit. Journal of Chronic Diseases 26:595-611.

Harburg E., J.C. Erfurt, W.J. Schull, M.A. Schork, R. Colman (1977).Heredity, Stress, and Blood Pressure, A Family Set Method - T. StudyAims and Sample Flow. Journal of m,ronic Diseases 30:625-648.

196

Harlan, W., J. Graham, E.H. Estes (1966). Familial Hypercholesterolemia: A Genetic and ~1etabolic Study. Medidne 45: 77 -1l0.

Harlan, W., R.K. Osborne, A. Graybiel (1962). A Longitudinal Studyof Blood Pressure. Circula~ion 26:530-543.

Harris-Jones, J.N., E.G. Jones, P.G. Wells (1957). Xanthomatosis andEssential Hypercholesterolemia. Lancet i:855-857.

Herndon, C.N. (1954). Genetics of the Lipidoses. Research Publications - .4ssociarion for Research in Nervous and Mental Disease33:239-258.

Hirschhorn, K. and C.F. Wilkinson (1959). The Hode of Inheritance inEssential Fami] ial Hypercholesterolemia. American Journal of Medicine 26:60-67.

Jensen, J. and D.H. Blankenhorn (1972). The Inheritance of FamilialHypercholesterolemia. American Journal of Medicine 52: 499- 516.

Jensen, J., D.H. Blankenhorn, V. Kornerup (1967). Coronary Disease inFamilial Hypercholesterolemia. Circulation 36:77-82.

Karme1, W. B. and T. Gordon (eds.) (1970). The Framingham Study. AnEpidemiological Investigation of Cardiovascular Disease. Washington,D.C.: U.S. Government Printing Office, Section 27.

Kaplan, E.B. and R.C. Elston (1972). A Subroutine Package for MaximumLikelihood Estimation (MAXLIK). Institute of Statistics Mimeo SeriesNo. 823, University of North Carolina.

Kaplan, E.B. and R.C. Elston (1975). GENPED - A General PedigreeAnalysis Package. Unpublished Program Write-up.

Kendall, M.G. and A. Stuart (1973). The Advanced Theory of Statistics J

Volume 2J Third Edition. New York: Hafner Publishing Company.

Khachadurian, A. (1964). Inheritance of Essential Familial Hypercholesterolemia. American Journal of Medicr~ne 37: 402-407.

Khachadurian, A. (1969). Lack of Inhibition of Hepatic Synthesis byDietary Cholesterol in Cases of Familial Hypercholesterolemia.Lancet ii:778-780.

Langer, T., W. Strober, R. Levy (1969). Familial Type I I Hyperlipoproteinemia: A Defect of Beta Lipoprotein Apoprotein Catabolism.Jeurnal of Clinical Investigarion 48:49a.

Leonard, J.C. (1956). Hereditary Hypercholesterolaemic Xanthomatosis.Lancet ii:1239-l242.

197

Levy, R. (1971). Classification and Etiology of Hyperlipoproteinemias. Federation Proaeedings 30: 829- 834.

Li, C.C. (1975). Assortative Mating in Man. Memorias de 1a IIRetmi6n Nacional de 1a Sociedad Mexicana de Genetica; Hazat1an,Sin., marzo, 1974. pp. 48-108. Ed. SMG, con el patrocinio deFornento Cultural ~X, Mex., D.F. November 1975.

Lilienfeld, A.(1959). A Methodological Problem in Testing a RecessiveGenetic Hypothesis in Human Disease. Ameriaan Journal of PubliaHealth 49: 199-204.

Lowe, C. R. and T. McKeown (.1g62). Arterial Pressure in an IndustrialPopulation and Its Bearing on the Problem of Essential Hypertension.Lancet i: 1086.

~lcDonough, J.R., G.E. Garrison, C.G. Hames (1964). Blood Pressure andHypertensive Disease Among Negroes and Whites, A Study in EvansCounty, Georgia. Annals of Internal Mediaine 61: 208-228.

McGinley, J., H. Jones, J. Gofman (1952). Lipoproteins and Xanthomatous Diseases. Journal of Investigative Dermatology 19: 7182.

McIlhaney, M.L., J. Shaffer, E.A. Hines, Jr. (1975). The Heritabilityof Blood Pressure: An Investigation of 200 Pairs of Twins Using theCold Pressor Test. John8 Hopkin8 Medical Journal 136: 57-64.

rlcKusick, V. (1960a). Genetics and the Nature of Essential Hypertension. Circulation 22: 857-863.

McKusick, V. (1960b). Medical Genetics - Hypertension. Journal ofCh~onic Di8eases 12: 112-114.

~mcLean, C.J., N.E. Morton, R. Lew (1975). Analysis of FamilyResemblance. IV. Operational Characteristics of SegregationAnalysis. Ameriaan Journal of Human Genetics 27: 365-384.

Meilman, E., C. Ho1t~, P. Samuel (1964). Familial Hypercholesterolemia and Xanthomatosis - Report on a Rare Type of Kindred. AmeriaanJournal of Mediaine 36: 277-280.

Mendlmvitz, M. (1961). Symposium on Hypertension. Introduction.Journal of the Mount Sinai Hospital~ New York. 28: 97-100.

Mendlowitz, ~f., S. Gitlow, N. Naftchi (1959). The Cause of EssentialHypertension. Perspeatives in Biology and Mediaine 2: 354-361.

;"lendlowitz, M., S. Gitlow, R. Wolf, N.E. Naftchi (1964). On the Integration of Factors in Essential Hypertension. Ameriaan HeartJournal 67: 397-409.

Mendlowitz, M., R.L. Wolf, S.E. Gitlow (1970). Catecholamine Metabolismin Essential Hypertension. Ameriaan HeG.I't Journal 79: 401-407.

198

Miall, W.E., P. Heneage, T. Khosla, H.G. Lovell, F. Moore (1967).Factors Influencing the Degree of Resemblance in Arterial Pressureof Close Relatives. CZiniaal Saienaes 33: 271-283.

~tiall, W.E. and H.G. Lovell (1967). The Relationship Between Changeof Blood Pressure and Age. British Mediaal Journal i: 660-664.

Miall, W.E. and P.D. Oldham (1963). The Heredity Factor in ArterialBlood Pressure. British Mediaal Journali: 75-80.

rbrrison, S.L. and J.N. Morris (1959). Epidemiological Observationson High Blood Pressure Witllout Evident Cause. Lanaet ii: 864-870.

~'brrison, S.L. and J.N. Mbrris (1960). Nature of Essential Hypertension. Lanaet ii: 829-832.

Morton, N.E. (1967). The Detection of Major Genes Under AdditiveContinuous Variation. Ameriaan JournaZ of Human Genetias 19: 2324.

Murphy, E. (1964). One Cause? Hany Causes? The Argument from theBinomial Distribution. Journal of Chronia Diseases 17: 301-324.

~urphy, E.A., C.B. Thomas, D.R. Bolling (1966). Genetic Factors inBlood Pressure Control? Evidence From the Distributions.Ciraulation 34 (Supplem. 3): 23.

National Center for Health Statistics (1966). Hypertension and Hypertensive Heart Disease in Adults. United States. 1960-1962. PubZiaHealth Serviae Publiaation No. 1000, Series 11, Number 13.

j~ational Center for nealth Statistics (1969). Blood Pressure, As itRelates to Physique, Blood Glucose, and Serum Cholesterol. Uni~ed

States. 1960-1962. Publia Health Serviae Publiaation No. 1000,Series 11, Number 34.

Neyman, J. and E.S. Pearson (1928). On the Use and Interpretation ofCertain Test Criteria for the Purposes of Statistical Inference.Biometrika 20A: 175 and 263.

Osborne, R.H., F.V. DeGeorge, J.A.L. Mathers (1963). The Variabilityof Blood Pressure: Basal and Casual Measure~ents in Adult Twins.Ameriaan Heart Journal 66: 176-183 .

Ostfeld, A. and O. Paul (1963). The Inheritance of Hypertension.Lanaet i: 575-579.

Ott, J. (1978). The Maxbnum Likelihood Estimates of the VarianceComponents in numan Pedigrees Under Polygenic and Hixed ~1odels ofInheritance. Submitted for Publication.

Ott, J., H.G. Schrott, J.L. Goldstein, W.R. Hazzard, F.H. Allen, Jr.,C.T. Falk, A.G. Motulsky (1974). Linkage Studies in a Large Kindredwith Familial Hypercholesterolemia. Ameriaan Journal of HumanGenetias 26: 598-603.

199

Pearson, E.S. and H.O. Hartley (Editors) (1954). Biometrika Tables fopStati8ticians,Volume I.Cambridge University Press.

Perera, G.A. (1960). Nature of Hypertension. Lancet ii: 153.

Pickering, G. (1961). The Natupe of Essential Hypeptension. Gruneanci Stratton, Inc.

Pickering, G. (1967). The Inheritance of Arterial Pressure. In TheEpidemiology of Hypertension, J. Stamler (ed.), :~ew York and London:Grune and Stratton, Inc., pp. 18-27.

Pickering, G. (1968). High Blood Ppessure. New York: Grune andStratton, Inc.

Piper, J. and L. Orrild (1956). Essential Familial Hypercholesterolemiaand Xanthomatosis. American Journal of Medicine 21: 34-46.

Platt, R. (1959). Nature of Essential Hypertension. Lancet ii: 55-57.

Platt, R. (1961). Essential Hypertension. Incidence, Course, andheredity. Annals of Internal lfedicine. 55: 1-11.

Platt, R. (1963). Heredity in Hypertension. Lancet i: 899-904.

Platt, R. (1967). The Influence of Heredity. In The Epidemiology ofHypertension, J. Stamler (ed.), New York and London: Grune andStratton, Inc., pp. 9-17.

Pollack, H. and D.E. Krueger (ed.) (1960). Epidemiology of Cardiovascular Diseases. Methodology. American Journal of Public Health(supplement) 50: 1-124.

Rao, C.R. (1962). Advanced Stati8tical Method8 in Biometric Re8earch.;~e\"" York: John Wiley and Sons, Inc.

Rao, D.C., C.J. ~1acLean, N.E. Morton, S. Yee (1975). Analysis ofFamily Resemblance. V Height and Weight in Northeastern Brazil.American Journal of Human Genetic8 27: 509-520.

Rayer, P.F.O. (1836). Traite Theorique et FPatique8 des !1aladies dela Peau. Paris.

Report of Inter-Society Commission for Heart Disease Resources (1970).Primary Prevention of Hypertension. Circulation 42: A39-A4l.

Schrott, H.G., J.L. Goldstein, W.R. Hazzard, M.M. ~~Goodwin, A.Motulsky (1972). Familial io/Percholesterolemia in a Large Kindred.Evidences for a Monogenic Mechanism. Annals of Internal Medicine76: 711-720.

Schull, W.J., E. Harburg, M.A. Schork, J. Weener, C. Chape .(1977).Heredity, Stress and Blood Pressure, A Family Set Method - IIIFamily Aggregation of Hypertension. Journal of Chronic Diseases30: 659-669.

BIOMATHEMATICS TRAINING PROGRAM 200

Slack, J. and N.C. Nevin (1968). Hyper1ipidaemic Xanthomatosis.Increased Risk of Death From Ischaemic Heart Disease in FirstRelatives of 53 Patients i~ith Essential Hyper1ipidaemia andXanthomatosis. Journal of MedicaZ Genetics 5: 4-8.

I.Degree

Snedecor, G.W. and W.G. Cochran (1967). Statistical Methods) SixthEdition. Ames, Imva: The Imva State University Press.

Solbye, P. (1948). Heredity of Essential Hypertension and nephrosclerosis: Genetic CZinical Study of 200 Propositi Suffering FromNephrosclerosis. CopenhageIl, A. Busck, Nyt Nordisk.

Sperry, W.~'1. (1945). The Schoenheimer-Sperry Nethod of the Determination of Cholesterol, Revised Edition. Department of Biochemistry,New York Psychiatric Institute.

Stecher, R.M. and A.H. Hersh (1949). Note on the Genetics of Hypercholesterolemia. Science 109: 61-62.

Stone, N. and R. I. Levy (1972). Hyper1ipoproteinemia and CoronaryIieart Disease. Progress in Cardiovascular Diseases XIV: 341-359.

Svendsen, ~l. (1940). Are Supernormal Cholesterol Values in SerumCaused by a Dominantly- Inheri ted Factor? Acta t1edica Scandinavica104: 235-244.

Thomas, C.B. and B.H. Cohen (1955). The Familial Occurrence of Hypertension and Coronary Artery Disease, With Observations ConcerningObesity and Diabetes. Annals of Internal Medicine. 42: 90-127.

Todd, J.C. and A.H. Sanford (1943). Clinical Diagnosis 10th Edition)Philadelphia: W.B. Satmders, Co., p. 380.

\~ee1er, E.O. (1957). The Genetic Aspects of Atherosclerosis. AmericanJournal of Medicine 23: 653-660.

~~ilkinson, C. F., E. Hand, ~1. F1iegelman (1948). Essential FamilialHypercholesterolemia. Annals of Internal Medicine 29: 671-686.

\'Iilks, S.S. (1938). The Large Sample Distribution of the LikelihoodRatio for Testing Con~osite ~iypotheses. Annals of MathematicalStatistics 9: 60-62.

Wolfe, J. (1971). A Monte Carlo Study of the Sampling Distribution ofthe Likelihood Ratio for Mixture of Mu1tinormal Distributions. NavalPersonnel and Training Research Laboratory, Technical Bulletin STB72-2.

Nright, S. (J934). Results of Crosses Between Inbred Strains of GuineaPigs, Differing in ;~lD1lber of Digits. Genetics 19: 537-551.

Zak, B., R.C. Dickenman, E.G. \~ite, H.H. Burnett, P.J. Cherney (1954).Rapid Estimation of Free and Total Cholesterol. American Journalof Clinical Pathology 24: 1307-1315.

-'

BIOMATHlMATICS - NC State Department of · PDF fileBIOMATHlMATICS TRAINING PROGKAM ... method...

Documents

Transcript of BIOMATHlMATICS - NC State Department of · PDF fileBIOMATHlMATICS TRAINING PROGKAM ... method...