An Analytic Road Map for Incomplete Longitudinal Clinical Trial Data Craig Mallinckrodt Graybill...

An Analytic Road Map for Incomplete Longitudinal Clinical Trial Data

Craig Mallinckrodt

Graybill ConferenceJune 12, 2008Fort Collins, CO

PhRMA Expert Team on Missing DataPeter Lane GSKCraig Mallinckrodt LillyJames Mancuso PfizerYahong Peng MerckDan Schnell P&G

Geert Molenberghs

Ray Carroll

Many Lilly colleagues

Acknowledgements

Why do we care What do we know

Theory

Application

What we should do

Outline

Every hour we expect

195 deaths due to cancer 1950 new diagnoses of anxiety disorders 15 new diagnoses of schizophrenia 30 osteoporosis related hip fractures1500 surgeries requiring pain treatment 70 deaths due to cardiovascular disease

Alan Breier – Nov 2006

Medical Needs

Need for More Effective Medicines

Therapeutic Area Efficacy rate(%)

Alzheimer’s 30Analgesic’s (Cox-2) 80Asthma 60Cardiac Arrhythmias 60Depression (SSRI) 62Diabetes 57HCV 47Incontinence 40Migraine (acute) 52Migraine (prophylaxis) 50Oncology 25Osteoporosis 48Rheumatoid arthritis 50Schizophrenia 60

Trends in Molecular Medicine 7(5):201-204, 2001

There is an efficacy gap in terms of customer

expectations andthe drugs we

prescribe

R&D Productivity Decreasing

Source: PhRMA, FDA, Lehman Brothers; [Dr. Robert Ruffolo]

$0

$5

$10

$15

$20

$25

$30

$35

$40

$45

$50

0

20

40

60

80

100

120

140

160

180

200

Annual NMEApprovals

Industry R&D Expense

($ Billions)

R&D InvestmentNME & Biologics Approvals


Theory

Application

What we should do

Outline

No universally best method for analyzing longitudinal data

Analysis must be tailored to the specific situation at hand

Consider the hypothesis to be tested, desired attributes of the analysis, and the characteristics of the data

Starting Point

MCAR - missing completely at random

• Conditional on the independent variables in the model, neither observed or unobserved outcomes of the dependent variable explain dropout

MAR - missing at random

• Conditional on the independent variables in the model, observed outcomes of the dependent variable explain dropout, but unobserved outcomes do not

Missing Data Mechanisms

MNAR - missing not at random

• Conditional on the independent variables in the model and the observed outcomes of the dependent variable, the unobserved outcomes of the dependent variable explain dropout

Missing Data Mechanisms

Missing data mechanism is a characteristic of the data AND the model

Differential dropout by treatment indicates covariate dependence, not mechanism

Mechanism can vary from one outcome to another in the same dataset

Consequences

Missing Data in Clinical Trials

• Efficacy data in clinical trials are seldom MCAR because the observed outcomes typically influence dropout (DC for lack of efficacy)

• Trials are designed to observe all the relevant information, which minimizes MNAR data

• Hence in the highly controlled scenario of clinical trials missing data may be mostly MAR

• MNAR can never be ruled out

Implications

• All analyses rely on missing data assumptions

• Any options in the trial design to minimize dropout should be strongly considered

Assumptions

• ANOVA with BOCF / LOCF assumes

• MCAR & constant profile

• MAR always more plausible than MCAR

• MAR methods will be valid in every case where BOCF/ LOCF is valid

• BOCF / LOCF will not be valid in every scenario where MAR methods are valid

1. Arch. Gen. Psych. 50: 739-750.

2. Arch. Gen. Psych. 61: 310-317.

3. Biol. Psychiatry. 53: 754-760.

4. Biol. Psychiatry. 59: 1001-1005.

5. Biometrics. 52: 1324-1333.

6. Biometrics. 57: 43-50.

7. Biostatistics. 5:445-464.

8. BMC Psychiatry. 4: 26-31.

9. Clinical Trials. 1: 477–489.

10. Computational Statistics and Data Analysis. 37: 93-113.

11. Drug Information J. 35: 1215-1225.

12. J. Biopharm. Stat. 8: 545-563.

13. J. BioPharm. Stat. 11: 9-21.

Research Showing MAR Is Useful And / Or Better Than LOCF

14. J. Biopharm. Stat. 12: 207-212.

15. J. Biopharm. Stat. 13:179-190.

16. J. Biopharm. Stat. 16: 365-384.

17. Neuropsychopharmacol. 6: 39-48.

18. Obesity Reviews. 4:175-184.

19. Pharmaceutical Statistics. 3:161-170.



22. Pharmaceutical Statistics (2007 early view) DOI: 10.1002/pst.267

23. Statist. Med. 11: 2043-2061.

24. Statist. Med. 14: 1913-1925.

25. Statist. Med. 22: 2429-2441.

Research Showing MAR Is Useful And / Or Better Than LOCF

Why Is LOCF Still Popular

• LOCF perceived to be conservative

• Concern over how MAR methods perform under MNAR

• More explicit modeling choices needed in MAR methods

• LOCF thought to measure something more valuable

Conservatism Of LOCF

• Bias in LOCF has been shown analytically and empirically to be influenced by many factors

• Direction and magnitude of bias highly situation dependent and difficult to anticipate

• Summary of recent NDA showed LOCF yielded lower p value than MMRM in 34% of analyses

Biostatistics. 5:445-464.

BMC Psychiatry. 4: 26-31.

Performance Of MAR With MNAR Data

• Studies showing MAR methods provide better control of Type I and Type II error than LOCF

Arch. Gen. Psych. 61: 310-317.Clinical Trials. 1: 477–489.Drug Information J. 35: 1215-1225.J. BioPharm. Stat. 11: 9-21.J. Biopharm. Stat. 12: 207-212.Pharmaceutical Statistics (2007 early view) DOI: 10.1002/pst.267JSM Proceedings. 2006. pp. 668-676. 2006.

More Explicit Modeling Choices Needed

• MMRM 6 lines of code, LOCF 5 lines of code

• Convergence and choice of correlation not difficult in MMRM

Clinical Trials. 1: 477–489.

LOCF Thought To Measure Something More Valuable

• LOCF is “effectiveness”, MAR is “efficacy”

• LOCF is what is actually observed

• MAR is what is estimated to happen if patients stayed on study

• Non longitudinal interpretation of LOCF

• LO, LAV

• Dropout is an outcome

• An LOCF result can be interpreted as an index of rate of change times duration on study drug - a composite of efficacy, safety, tolerability

• An index with unknown weightings

• The same estimate of mean change via LOCF can imply different clinical profiles

• The LOCF penalty is not necessarily proportional to the risk

• Result can be manipulated by design

Non-longitudinal Interpretation Of LOCF

Proportion of completers

0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

1

Study

Drug PLA

Completion Rates in Depression Trials

Drug Placebo

Placebo Dropout Rates Influenced by Design In a Recent MDD NDA

% % Trial DC-AE Dropout

1 4.3 34.32 6.7 41.3 3 3.3 31 4 9.0 42 5 3.2 19 6 1.0 9 7 2.5 29.5 8 4.3 35.3

Lillytrials.com

Trials 5 and 6 had titration dosing and extension phases


Theory

Application What we should do

Outline

Modeling Philosophies

• Restrictive modeling

• Simple models with few independent variables

• Often include only the design factors of the experiment

Psychological Methods, 6, 330-351.

Modeling Philosophies

• Inclusive modeling

• Auxiliary variables included to improve performance of the missing data procedure – expand the scope of MAR

• Baseline covariates

• Time varying post-baseline covariates: Must be careful to not dilute treatment effect. Can be dangerous to include time varying postbaseline covariates in analysis model, may be better to use via imputation (or propensity scoring or weighted analyses)

Psychological Methods, 6, 330-351.

Rationale For Inclusive Modeling

• MAR: conditional on the dependent and independent variables in the analysis, unobserved values of the dependent variable are independent of dropout

• Hence adding more variables that explain dropout can make missingness MAR that would otherwise be MNAR

Analytic Road Map

• MAR with restrictive modeling as primary

• Use MAR with inclusive modeling and MNAR methods as sensitivity analyses

• Use local influence to investigate impact ofinfluential patients

Pharmaceutical Statistics. 4: 267–285.J. Biopharm. Stat. 16: 365-384.

Why Not MNAR As Primary

• Can do better than MAR only via assumptions

• Assumptions untestable

• Sensitivity to violations of assumptions and model misspecification more severe in MNAR

• MNAR methods lack some desired attributes of a primary analysis in a confirmatory trial

• No standard software

• Complex

Implementing The Road Map: Example From A Depression Trial

259 patients, randomized 1:1 ratio to drug and placebo

Response: Change of HAMD17 score from baseline

6 post-baseline visits (Weeks 1,2,3,5,7,9)

Primary objective: test the difference of mean change in HAMD17 total score between drug and placebo at the endpoint

Primary analysis: LB-MEM

Patient Disposition

Drug Placebo

Protocol complete 60.9% 64.7%

Adverse event 12.5% 4.3%

Lack of efficacy 5.5% 13.7%

Differential rates, timing, and/or reasons for dropout do not necessarily distinguish between MCAR, MAR, MNAR

proc mixed; class subject treatment time site; model Y = baseline treatment time site

treatment*time ; repeated time / sub = subject type = un; lsmeans treatment*time / cl diff; run;

This is a full multivariate model, with unstructured modeling of time and correlation. More parsimonious approaches may be useful in other scenarios

Treatment contrast 2.17, p = .024

Primary Analysis: LB-MEM

Inclusive Modeling in MI: Including

Auxiliary AE Data

• Imputation Models• *Yih = µ +1 Yi1 +…+ h-1 Yi(h-1) + ih

• Yih = µ + 1 Yi1 +…+ h-1 Yi(h-1) + 1 AEi1 +…+ h-1 AEi(h-1) + ih

• Yih= µ + 1 Yi1 +…+ h-1 Yi(h-1) + 1 AEi1 +…+ h-1 AEi(h-1)

+11 (Yi1 *AEi1 ) + …+i(h-1) (Yi(h-1) * AEi(h-1) ) + ih

• Analysis Model

• MMRM as previously described

Result

• MI results were not sensitive to the different imputation models

Endpoint contrastMMRM 2.2MI Y+AE 2.3MI Y+AE+Y*AE 2.1

• Including AE data might be important in other scenarios. Many ways to define AE

MNAR Modeling

• Implement a selection model– Had to simplify model: modeled time as linear + quadratic, and

used ar(1) correlation

• Compare results from assuming MAR, MNAR

• Also obtain local influence to assess impact of influential patients on treatment contrasts and non-random dropout

Selection Model Results

MAR MNAR

Contrast (p-value)

2.20 (0.0179)

2.18 (0.0177)

Missingness Parameters

Estimate SE

0-2.46 0.27

1 0.11 0.05

2-0.08 0.06

Local Influence: Influential Patients

Patient

Ci

0 50 100 150 200 250

02

46

81

01

2

#6

#30

#50 #154

#179

iC

Individual Profiles with Influential Patients Highlighted

placebo

Weeks

ch

an

ge

in

HA

MD

17

2 4 6 8

-30

-20

-10

01

0

Duloxetine

Weeks

ch

an

ge

in

HA

MD

17

2 4 6 8

-30

-20

-10

01

0

# 30

Investigating The Influential Patients

The most influential patient was #30, a drug-treated patient that had the unusual profile of a big improvement but dropped out at week 1

This patient was in his/her first MDD episode when s/he was enrolled

This patient dropped out based on his/her own decision claiming that the MDD was caused by high carbon monoxide level in his/her house

This patient was of dubious value for assessing the efficacy of the drug

Selection Model: Influential Patients RemovedRemoved Subjects ( 30, 191) (6, 30, 50, 154, 179,

191)

MAR

MNAR MAR MNAR

Diff. at endpoint(p-value)

2.07 (0.0241)

2.07 (0.0237)

2.40 (0.0082)

2.40 (0.0083)

Missingness Parameters

0-2.22 (0.14) -2.44 (0.27) -2.23 (0.15) -2.47 (0.28)

10.05 (0.02) 0.11 (0.05) -0.05 (0.02) 0.11 (0.06)

2-0.07 (0.06) -0.08 (0.06)

Implications

Comforting that no subjects had a huge influence on results. Impact bigger if it were a smaller trial

Similar to other depression trials we have investigated, results not influenced by MNAR data

We can be confident in the primary result

Discussion

MAR with restrictive modeling was a reasonable choice for the primary analysis

MAR with inclusive modeling and MNAR was useful in assessing sensitivity

Sensitivity analyses promote the appropriate level of confidence in the primary result and lead us to an alternative analysis in which we can have the greatest possible confidence

• Inclusive modeling has been under utilized

• More research to understand dropout would be useful

• Did not discuss pros and cons of various ways to implement inclusive modeling. Use the one you know? Be careful to not dilute treatment

• The road map for analyses used in the example data is specific to that scenario

Opinions

• No universally best method for analyzing longitudinal data

• Analysis must be tailored to the specific situation at hand

• Considering the missingness mechanism and the modeling philosophy provides the framework in which to choose an appropriate primary analysis and appropriate sensitivity analyses

Conclusions

Conclusion

• LOCF and BOCF are not acceptable choices for the primary analysis

• MAR is a reasonable choice for the primary analysis in the highly controlled situation of confirmatory clinical trials

• MNAR can never be ruled out

• Sensitivity analyses and efforts to understand

and lower rates of dropout are essential

An Analytic Road Map for Incomplete Longitudinal Clinical Trial Data Craig Mallinckrodt Graybill...

Documents

Transcript of An Analytic Road Map for Incomplete Longitudinal Clinical Trial Data Craig Mallinckrodt Graybill...