April 20
-
Upload
shana-noble -
Category
Documents
-
view
33 -
download
0
description
Transcript of April 20
![Page 1: April 20](https://reader035.fdocuments.net/reader035/viewer/2022070502/56812d43550346895d92489f/html5/thumbnails/1.jpg)
April 20• Life-table calculation - handout• Proportional hazards (Cox) regression
• Exam on April 27• Review session Friday at 1:00PM
![Page 2: April 20](https://reader035.fdocuments.net/reader035/viewer/2022070502/56812d43550346895d92489f/html5/thumbnails/2.jpg)
Uses of Regression• Combine lots of information– Look at several variables
simultaneously
• Explore interactions–model interaction directly
• Control (adjust) for confounding
![Page 3: April 20](https://reader035.fdocuments.net/reader035/viewer/2022070502/56812d43550346895d92489f/html5/thumbnails/3.jpg)
Proportional hazards regression
• Can we relate predictors to survival time?
– We would like something like linear regression
• Can we incorporate censoring too?
• Use the hazard function
...22110 XBXBBt
![Page 4: April 20](https://reader035.fdocuments.net/reader035/viewer/2022070502/56812d43550346895d92489f/html5/thumbnails/4.jpg)
Hazard function
• Given patient survived to time t, what is the probability they develop outcome very soon? (t + small amount of time)
• Approximates proportion of patients having event around time t
![Page 5: April 20](https://reader035.fdocuments.net/reader035/viewer/2022070502/56812d43550346895d92489f/html5/thumbnails/5.jpg)
Hazard function
) (Prob
)(TttTt
t
Hazard less intuitive than survival curve
Conditional p the event will occur between t and t+ given it has not previously occurred
Rate per unit of time, as goes to 0 get instant rate
Tells us where the greatest risk is given survival up to that time
![Page 6: April 20](https://reader035.fdocuments.net/reader035/viewer/2022070502/56812d43550346895d92489f/html5/thumbnails/6.jpg)
Possible Hazard of Death from BirthProbability of dying in next year as function of age
0 6 17 23 80
t)
At which age would the hazard be greatest?
![Page 7: April 20](https://reader035.fdocuments.net/reader035/viewer/2022070502/56812d43550346895d92489f/html5/thumbnails/7.jpg)
Possible Hazard of Divorce
0 2 10 25 35 50
![Page 8: April 20](https://reader035.fdocuments.net/reader035/viewer/2022070502/56812d43550346895d92489f/html5/thumbnails/8.jpg)
Proportional hazards regression model
)exp()()( 110 Xtt
0(t) - unspecified baseline hazard; the hazard for subject with X=0
1 = regression coefficient associated with the predictor (X)
1 positive indicates larger X increases the hazard
Can include more than one predictor
![Page 9: April 20](https://reader035.fdocuments.net/reader035/viewer/2022070502/56812d43550346895d92489f/html5/thumbnails/9.jpg)
Interpretation of Regression Parameters
)exp()()( 110 Xtt
For a binary predictor; X1 = 1 if exposed and 0 if unexposed,
exp(1) is the relative hazard for exposed versus unexposed
(1 is the log of the relative hazard)
exp(1) can be interpreted as relative risk
![Page 10: April 20](https://reader035.fdocuments.net/reader035/viewer/2022070502/56812d43550346895d92489f/html5/thumbnails/10.jpg)
Example - risk of outcome forwomen vs. men
For males;
For females;
)exp()(
)exp()(malesfor hazard
femalesfor hazardhazard Relative 10
10
tt
)exp()()( 110 Xtt Suppose X1=1 for females, 0 for males
)()0*exp()()( 010 ttt
)exp()()1*exp()()( 1010 ttt
![Page 11: April 20](https://reader035.fdocuments.net/reader035/viewer/2022070502/56812d43550346895d92489f/html5/thumbnails/11.jpg)
Example - Risk of outcome for1 unit change in blood pressure
For person with SBP = 114
)exp()113114exp(
)*113exp()()*114exp()(
1
11
10
10
tt
)exp()()( 110 Xtt Suppose X1= systolic bloodpressure (mm Hg)
)114*exp()()( 10 tt
)113*exp()()( 10 tt
Relative risk of 1 unitincrease in SBP:
For person with SBP = 113
![Page 12: April 20](https://reader035.fdocuments.net/reader035/viewer/2022070502/56812d43550346895d92489f/html5/thumbnails/12.jpg)
Example - Risk of outcome for10 unit change in blood pressure
For person with SBP = 110
)10exp()100110exp(
)*100exp()()*110exp()(
1
11
10
10
tt
)exp()()( 110 Xtt Suppose X= systolic bloodpressure (mmHg)
)110*exp()()( 10 tt
)100*exp()()( 10 tt
Relative risk of 10 unitincrease in SBP:
For person with SBP = 100
![Page 13: April 20](https://reader035.fdocuments.net/reader035/viewer/2022070502/56812d43550346895d92489f/html5/thumbnails/13.jpg)
Proportional hazards regression model
...)exp()()( 3322110 XXXtt
Above is model with multiple predictors
This allows to use the “usual” regression techniques;
• adjust for confounding
• model interactions
• relate non-linear terms to survival
![Page 14: April 20](https://reader035.fdocuments.net/reader035/viewer/2022070502/56812d43550346895d92489f/html5/thumbnails/14.jpg)
Why “proportional hazards”?
Ratio of hazards measures relative risk
If we assume relative risk is constant over time…
The hazards are proportional (does not depend on t
RR(t) (t) for exposed
(t) for unexposed
(t) for exposed(t) for unexposed
c
(t) for exposed = c * (t) for unexposed
![Page 15: April 20](https://reader035.fdocuments.net/reader035/viewer/2022070502/56812d43550346895d92489f/html5/thumbnails/15.jpg)
Parameter estimation
• How do we come up with estimates for i?
• Can’t use least squares since outcome is not continuous
• Maximum partial-likelihood– Given our data, what are the values of i that are most
likely?
• See page 392 of Le for details
![Page 16: April 20](https://reader035.fdocuments.net/reader035/viewer/2022070502/56812d43550346895d92489f/html5/thumbnails/16.jpg)
Inference for proportional hazards regression
• Collect data, choose model, estimate is
• Describe hazard ratios, exp(i), in statistical terms. – How confident are we of our estimate?– Is the hazard ratio is different from one due to
chance?
![Page 17: April 20](https://reader035.fdocuments.net/reader035/viewer/2022070502/56812d43550346895d92489f/html5/thumbnails/17.jpg)
Confidence Intervals for proportional hazards regression coefficients
• General form of 95% CI: Estimate ± 1.96*SE– Bi estimate, provided by SAS– SE is complicated, provided by SAS
• Related to variability of our data and sample size
• Equivalent to a hypothesis test; reject Ho: i = 0 at alpha = 0.05
![Page 18: April 20](https://reader035.fdocuments.net/reader035/viewer/2022070502/56812d43550346895d92489f/html5/thumbnails/18.jpg)
95% Confidence Intervals for the relative risk (hazard ratio)
• Based on transforming the 95% CI for the hazard ratio
• Supplied automatically by SAS
“We have a statistically significant association between the predictor and the outcome controlling for all other covariates”
• Equivalent to a hypothesis test; reject Ho: RR = 1 at alpha = 0.05 (Ha: RR1)
),( 96.196.1 SEiSE ee i
![Page 19: April 20](https://reader035.fdocuments.net/reader035/viewer/2022070502/56812d43550346895d92489f/html5/thumbnails/19.jpg)
Hypothesis test for individual PH regression coefficient
• Null and alternative hypotheses– Ho : Bi = 0, Ha: Bi 0
• Test statistic and p-values supplied by SAS
• If p<0.05, “there is a statistically significant association between the predictor and outcome variable controlling for all other covariates” at alpha = 0.05
• When X is binary, identical results as log-rank test
![Page 20: April 20](https://reader035.fdocuments.net/reader035/viewer/2022070502/56812d43550346895d92489f/html5/thumbnails/20.jpg)
Hypothesis test for all coefficients• Null and alternative hypotheses– Ho : all Bi = 0, Ha: not all Bi 0
• Several test statistics, each supplied by SAS– Likelihood ratio, score, Wald
• p-values are supplied by SAS
• If p<0.05, “there is a statistically significant association between the predictors and outcome at alpha = 0.05”
![Page 21: April 20](https://reader035.fdocuments.net/reader035/viewer/2022070502/56812d43550346895d92489f/html5/thumbnails/21.jpg)
Example survival analysis• Veteran’s Administration lung cancer data
• 137 Males with inoperable lung cancer
• Randomized to standard or new chemo therapy
• Primary endpoint; time to death
• 9 observations censored– 9 patients survived for length of study
![Page 22: April 20](https://reader035.fdocuments.net/reader035/viewer/2022070502/56812d43550346895d92489f/html5/thumbnails/22.jpg)
Example - VA Lung Cancer variablesSurvTime - time to death or study end
death - 1 if died, 0 if censored
treatment - new or standard treatment1 = new, 0 = standard
celltype - type of canceradeno, squamous, small cell ,large cell
kps - general health measure (0-100)
diagtime - time between diagnosis and study entry
age - age at entry
prior - prior treatment, 1 = yes, 0 = no
![Page 23: April 20](https://reader035.fdocuments.net/reader035/viewer/2022070502/56812d43550346895d92489f/html5/thumbnails/23.jpg)
PROC PHREGPROC PHREG DATA = vet; MODEL SurvTime*death(0) = treatment; RUN;
• Fit proportional hazards model with time to death as outcome
• “death(0)”; observations with death variable = 0 are censored
– death = 1 means an event occurred
• Look at effect of new vs. standard treatment on mortality
Same as LIFETEST
![Page 24: April 20](https://reader035.fdocuments.net/reader035/viewer/2022070502/56812d43550346895d92489f/html5/thumbnails/24.jpg)
PROC PHREG OutputSummary of the Number of Event and Censored Values Percent Total Event Censored Censored
137 128 9 6.57
Analysis of Maximum Likelihood Estimates Parameter Standard HazardVariable DF Estimate Error Chi-Square Pr > ChiSq Ratio
newtrt 1 b1 0.01633 0.18065 0.0082 0.9280 1.016
Relative risk of death for new vs. standard treatment
P-value for test of regression coefficient
(hazard ratio)
exp(b1)
![Page 25: April 20](https://reader035.fdocuments.net/reader035/viewer/2022070502/56812d43550346895d92489f/html5/thumbnails/25.jpg)
Logistic Regression VersusCox Regression
• If event rate is small will likely get similar results for betas
• If event rate is high could get quite different results (almost everyone has event)
![Page 26: April 20](https://reader035.fdocuments.net/reader035/viewer/2022070502/56812d43550346895d92489f/html5/thumbnails/26.jpg)
Interactions and non-linear terms
• Regression allows for us to model effects of predictors in different ways
• Can add quadratic terms and interactions, just like in linear and logistic regression
• Similar issues with testing coefficients– Calculation needed to get appropriate relative
risks from parameter estimates
![Page 27: April 20](https://reader035.fdocuments.net/reader035/viewer/2022070502/56812d43550346895d92489f/html5/thumbnails/27.jpg)
Interactions and non-linear terms PROC PHREG DATA = heart; MODEL time*status(0) = trt prior trt_prior; RUN;
• Fit interaction between new treatment and prior – trt_prior variable defined in DATA STEP as “trt_prior = trt*prior”
PROC PHREG DATA = vet; MODEL time*status(0) = age age2; RUN;
• Fit a quadratic term for effect of age– age2 variable defined in DATA STEP as “age2 = age*age”
![Page 28: April 20](https://reader035.fdocuments.net/reader035/viewer/2022070502/56812d43550346895d92489f/html5/thumbnails/28.jpg)
Interactions in proportional hazards regression
Effect modification leads to complications in interpreting parameter estimates (the Bis)
Example;
(t) = 0(t) exp(B1newtrtrt + B2prior + B3trt*prior)
What is relative risk for those on the new treatment vs. the standard treatment?
How does prior treatment effect this RR?
![Page 29: April 20](https://reader035.fdocuments.net/reader035/viewer/2022070502/56812d43550346895d92489f/html5/thumbnails/29.jpg)
Interactions in proportional hazards regression
(t) = 0(t) exp(B1newtrt + B2prior + B3newtrt*prior)
For those with no prior treatment (prior = 0);
newtrt)exp()( 0)*newtrt 0* newtrt exp()( )(
10
3210
ttt
Relative risk for new vs. standard treatment;
)exp( 0)*exp()( 1)*exp()(
standardfor hazardnewfor hazard
110
10
tt
![Page 30: April 20](https://reader035.fdocuments.net/reader035/viewer/2022070502/56812d43550346895d92489f/html5/thumbnails/30.jpg)
Interactions in proportional hazards regression
(t) = 0(t) exp(Bo + B1newtrt + B2prior + B3newtrt*prior)
For those with prior treatment (prior = 1);
newtrt)newtrt exp()( 1)*newtrt 1* newtrt exp()( )(
3120
3210
ttt
Relative risk for new vs. standard treatment;
)exp( 0)*0*exp()( 1)*1*exp()(
standardfor hazardnewfor hazard
313120
3120
tt
![Page 31: April 20](https://reader035.fdocuments.net/reader035/viewer/2022070502/56812d43550346895d92489f/html5/thumbnails/31.jpg)
Tied event times• Observations with outcome on same time are
called “ties”• They complicate the calculations• Different methods for dealing with them
– Breslow, discrete, Efron, exact
• Not many ties; all methods similar– Exact best, but computer intensive– Efron probably the next best– Breslow default in SAS
![Page 32: April 20](https://reader035.fdocuments.net/reader035/viewer/2022070502/56812d43550346895d92489f/html5/thumbnails/32.jpg)
Methods for ties in PROC PHREG
PROC PHREG DATA = vet; MODEL time*status(0) = newtrt / ties = exact;
RUN;
• Use “exact” method for handling ties
• Other options “efron”, “breslow”, and “discrete”
![Page 33: April 20](https://reader035.fdocuments.net/reader035/viewer/2022070502/56812d43550346895d92489f/html5/thumbnails/33.jpg)
Ties in PROC PHREGVA lung cancer data, effect of treatment, different methods for
ties
Analysis of Maximum Likelihood Estimates Parameter Standard HazardMethod Estimate Error Chi-Square Pr > ChiSq Ratio
Exact 0.01775 0.18066 0.0097 0.9217 1.018
Efron 0.01775 0.18066 0.0097 0.9217 1.018
Breslow 0.01633 0.18065 0.0082 0.9280 1.016
Discrete 0.01645 0.18129 0.0082 0.9277 1.017
For this data, exact and Efron methods are identical
![Page 34: April 20](https://reader035.fdocuments.net/reader035/viewer/2022070502/56812d43550346895d92489f/html5/thumbnails/34.jpg)
Complications with PH regressionSimilar issues arise that we saw in linear and logistic
regression; assumptions may not hold
• Independence of observations?– Correlation can cause problems; use other methods
• Linearity of terms?– Can check for quadratic term, transform
• Correlated predictor variables?– Causes interpretation problems for individual parameter
estimates
![Page 35: April 20](https://reader035.fdocuments.net/reader035/viewer/2022070502/56812d43550346895d92489f/html5/thumbnails/35.jpg)
Complications with PH regressionUnique issue; proportional hazards assumption
One example of violation, crossing survival curves
Remedies;
• Stratify time scale so PH assumption holds over intervals, fit model to each interval
• Transformation of time variable (example; log)
• Use other models