1 Who Needs Bayesian Phase I Trials? Rick Chappell Professor, Departments of Statistics and of...

37
1 Who Needs Bayesian Phase I Trials? Rick Chappell Professor, Departments of Statistics and of Biostatistics and Medical Informatics University of Wisconsin Madison [email protected] 2010 Joint Statistical Meetings; Vancouver

Transcript of 1 Who Needs Bayesian Phase I Trials? Rick Chappell Professor, Departments of Statistics and of...

Page 1: 1 Who Needs Bayesian Phase I Trials? Rick Chappell Professor, Departments of Statistics and of Biostatistics and Medical Informatics University of Wisconsin.

1

Who Needs Bayesian Phase I Trials?

Rick ChappellProfessor,Departments of Statistics and of Biostatistics

and Medical InformaticsUniversity of Wisconsin [email protected]

2010 Joint Statistical Meetings; Vancouver

Page 2: 1 Who Needs Bayesian Phase I Trials? Rick Chappell Professor, Departments of Statistics and of Biostatistics and Medical Informatics University of Wisconsin.

My thanks to

The International Indian Statistical Association, Biometrics Section and

Professor Debajyoti Sinha, Florida State University

for inviting me to speak;

An unnamed CMO of an unnamed pharmaceutical company;

The audience.2

Page 3: 1 Who Needs Bayesian Phase I Trials? Rick Chappell Professor, Departments of Statistics and of Biostatistics and Medical Informatics University of Wisconsin.

3

Outline

I. Goal of Phase I Trials in Cancer & Other Serious Diseases

II. How we need Bayesian Phase I Trials

III. Options for Dose Escalation

IV. Example, with Design and Simulations

V. Conclusions

Page 4: 1 Who Needs Bayesian Phase I Trials? Rick Chappell Professor, Departments of Statistics and of Biostatistics and Medical Informatics University of Wisconsin.

4

I. Goal of Phase I Trials in Cancer & Other Serious Diseases

• Goal first delineated by Schneiderman (1967) as estimation of dose yielding a fixed percent (often 33%) of dose-limiting toxicities (DLTs) but not lower. This is called the Maximum Tolerable Dose (MTD).

• Phase I Trials are Usually Outcome-adaptive

Page 5: 1 Who Needs Bayesian Phase I Trials? Rick Chappell Professor, Departments of Statistics and of Biostatistics and Medical Informatics University of Wisconsin.

5

Phases of a Clinical Trial

• Preclinical: Biochemical and animal studies

• Phase I, Cancer - estimate toxicity rates using few typically very sick subjects.

• Phase II - determines a therapy’s potential efficacy usually using a few very sick patients.

• Phase III – a large randomized controlled, possibly blinded, possibly multicenter, experiment

• Phase IV - a controlled trial of an approved treatment with long-term followup of safety and efficacy.

Page 6: 1 Who Needs Bayesian Phase I Trials? Rick Chappell Professor, Departments of Statistics and of Biostatistics and Medical Informatics University of Wisconsin.

6

Schematic of Phase I Trial

Dose

% T

oxic

ity

0

33

100

d1 d2 . . . mtd

Page 7: 1 Who Needs Bayesian Phase I Trials? Rick Chappell Professor, Departments of Statistics and of Biostatistics and Medical Informatics University of Wisconsin.

7

Allowable DLT rate can be lower than 33% but rarely higher

Dose

% T

oxic

ity

0

20

100

d1 d2 .. mtd

Page 8: 1 Who Needs Bayesian Phase I Trials? Rick Chappell Professor, Departments of Statistics and of Biostatistics and Medical Informatics University of Wisconsin.

8

How We Need Bayesian Phase I Trials

Rick ChappellProfessor,Departments of Statistics and of Biostatistics

and Medical InformaticsUniversity of Wisconsin Madison

2010 Joint Statistical Meetings; Vancouver

Page 9: 1 Who Needs Bayesian Phase I Trials? Rick Chappell Professor, Departments of Statistics and of Biostatistics and Medical Informatics University of Wisconsin.

9

II. How we need Bayesian Phase I Trials

A. In the Design- How will we set the doses? Guess at a series of levels starting at a

low toxicity rate and moving to higher rates.- Isn’t this a prior?- Formal prior elicitation methods can be very helpful.

B. In Dose Escalation

C. In the Analysis- The data from a single phase I trial usually provide only a little information. - Information is usually available from other sources.- Model-based smoothing (e.g., for monotonicity) often useful.- Computation using conjugate priors is simple.- Multiple analyses can (should) be done.- Storer (1989) recommended that analysis be separated from design.

Page 10: 1 Who Needs Bayesian Phase I Trials? Rick Chappell Professor, Departments of Statistics and of Biostatistics and Medical Informatics University of Wisconsin.

10

III. Options for Dose Escalation

A. Algorithmic designs (Shih & Lin, 2006, in Chevret’s text).- These could be defined as designs where (de-)escalation depends only on outcomes in the most recent groups of patients.- However, some “permanent” changes, such as adjustments in cohort sizes, have been included in “algorithms”.- Most intuitive definition is “A (de-)escalation system which a human being can keep in his head.”

B. Bayesian and other model-based designs.

Page 11: 1 Who Needs Bayesian Phase I Trials? Rick Chappell Professor, Departments of Statistics and of Biostatistics and Medical Informatics University of Wisconsin.

11

A. Simple Algorithmic “Up-and-Down” Designs

– Usually small cohorts of patients (1 – 5);

– Dose levels increased and/or decreased until a predetermined sample size is reached;

– Designed to estimate the MTD as 33%-ile with cohorts of size 3, 20%-ile with cohorts of size 5, etc.;

– Not flexible (can also spend a lot of patients at low-toxicity doses), but can easily be modified to speed up via adaptive cohort sizes;

– Other sources of information can be used.

Page 12: 1 Who Needs Bayesian Phase I Trials? Rick Chappell Professor, Departments of Statistics and of Biostatistics and Medical Informatics University of Wisconsin.

12

Design A (“Traditional”): Groups of three patients are treated. Escalation occurs if no toxicity is observed in all three: otherwise, an additional three patients are treated at the same dose level. If only one of six has toxicity, escalation again continues; otherwise, the trial stops.

Design B: Single patients are treated. The next patient is treated at the next lower dose level if a toxic response is observed, otherwise at the next higher dose level until sample size is reached.

Storer’s (1989) phase I up-and-downclassification

Page 13: 1 Who Needs Bayesian Phase I Trials? Rick Chappell Professor, Departments of Statistics and of Biostatistics and Medical Informatics University of Wisconsin.

Design D: Groups of three patients are treated. Escalation if no toxicity is seen and de-escalation if more than one patient has toxicity. If one patient has toxicity, next group of three is treated at same dose level; repeat until sample size is reached. Similar to traditional except can go down.

( Design C: Similar to design D, except that the rule applies to the preceding three patients at any point, instead of using discrete batches of three. )

- Larger cohorts can be used to estimate the MTD as the dose with a 1/4 = 25%, 1/5 = 20% or 1/6 = 17% rates.

- Designs can be combined, such as “BD”.

Page 14: 1 Who Needs Bayesian Phase I Trials? Rick Chappell Professor, Departments of Statistics and of Biostatistics and Medical Informatics University of Wisconsin.

14

1. Prior guess is made as to the dose-response (toxicity) curve;

2. First patient is assigned to the prior MTD;

3. After the patient is fully followed, his or her outcome is used to update the prior curve;

4. Next patient is assigned to the new “posterior” MTD;

5. Repeated until sample size is reached.

6. Final posterior MTD is estimate.

B. Bayesian Designs - The Continual Reassessment Method (CRM; O’Quigley, et al., 1990)

Page 15: 1 Who Needs Bayesian Phase I Trials? Rick Chappell Professor, Departments of Statistics and of Biostatistics and Medical Informatics University of Wisconsin.

15

C. Operating Characteristics are Paramount

1. “Operating Characteristics”: How a design escalates the dose and changes cohort size, and how it stops.

2. Clinicians want to know how treatment decisions are made – they are fearful of the black box.

3. Operating Characteristics of Bayesian / other model-based designs can be described by example and simulation.

4. One cannot rely on clinicians' priors to produce [Bayesian] designs with acceptable operating characteristics. In practice, the statistician must at least specify prior uncertainty.

Page 16: 1 Who Needs Bayesian Phase I Trials? Rick Chappell Professor, Departments of Statistics and of Biostatistics and Medical Informatics University of Wisconsin.

16

“4. One cannot rely on clinicians' priors to produce [Bayesian] designs with acceptable operating characteristics. In practice, the statistician must at least specify prior uncertainty.”

Sometimes this requirement leads to absurdities. Ji, Li, and Bekele (2007) proposed a prior for a Bayesian method “that can be easily understood and implemented by nonstatisticians.” It is simple, and leads to a practical and sensible algorithm.

Page 17: 1 Who Needs Bayesian Phase I Trials? Rick Chappell Professor, Departments of Statistics and of Biostatistics and Medical Informatics University of Wisconsin.

17

“4. One cannot rely on clinicians' priors to produce [Bayesian] designs with acceptable operating characteristics. In practice, the statistician must at least specify prior uncertainty.”

Sometimes this requirement leads to absurdities. Ji, Li, and Bekele (2007) proposed a prior for a Bayesian method “that can be easily understood and implemented by nonstatisticians.” It is simple, and leads to a practical and sensible algorithm.

But the recommended prior was Beta(.005, .005):

Page 18: 1 Who Needs Bayesian Phase I Trials? Rick Chappell Professor, Departments of Statistics and of Biostatistics and Medical Informatics University of Wisconsin.

18

D. How ethical operating characteristics subvert current Bayesian designs

1. The “Continual Reassessment Method” [CRM; O’Quigley et al., 1990] attempts to assign subjects to the MTD and so efficiently estimate it.

2. The CRM does not directly address the following ethical/logistical issues in Phase I trials:

a) The need for a starting dose lower than the prior MTD;b) The need for a maximum increase in dose per cohort;c) A desire to let cohort sizes vary; d) A desire to allow data on sub-dose-limiting toxicities to inform

escalation & cohort size choices.

3. The CRM can be modified;Or one can incorporate elements of Escalation with Overdose Control (EWOC), which constrains predicted overdose rates;Or we can combine the two.

Page 19: 1 Who Needs Bayesian Phase I Trials? Rick Chappell Professor, Departments of Statistics and of Biostatistics and Medical Informatics University of Wisconsin.

19

3. The CRM can be modified (cont.);The most common way to modify the CRM is to overlay algorithms (for initial dose, maximum increase, etc.) on top of it. But this also has disadvantages.

Cheung (2005) defined coherence as “An escalation for a newpatient is said to be coherent only when the previous patient does notshow sign of toxicity. Likewise, a de-escalation is coherent only when atoxic outcome has just been seen.”

He shows the CRM, EWOC, & all unmodified likelihood-based methods to be coherent.

But many of the simple common modifications are not.

Page 20: 1 Who Needs Bayesian Phase I Trials? Rick Chappell Professor, Departments of Statistics and of Biostatistics and Medical Informatics University of Wisconsin.

20

A Practical, Easy, Understandable, and Scientifically Sound Algorithmic Design for a Phase I Trials

Rick ChappellProfessor,Departments of Statistics and of Biostatistics

and Medical InformaticsUniversity of Wisconsin [email protected]

2010 Joint Statistical Meetings; Vancouver

Page 21: 1 Who Needs Bayesian Phase I Trials? Rick Chappell Professor, Departments of Statistics and of Biostatistics and Medical Informatics University of Wisconsin.

21

IV. Example, with Design and SimulationsA. Design Considerations

1. Agent is a radiopharmaceutical available in doses of 6.25;

2. Concern centers on toxicity to bone marrow and also kidney, bladder, and liver;

3. FDA requires that starting dose be 12.5 (units unspecified);

4. Investigators consider this to be very low and want rapid initial escalation;

5. Information is available on “sub-dose-limiting” toxicities – e.g., 999 neutrophils/mm3 is grade 3 neutropenia but 1,001 is grade 2.

Page 22: 1 Who Needs Bayesian Phase I Trials? Rick Chappell Professor, Departments of Statistics and of Biostatistics and Medical Informatics University of Wisconsin.

22

4. Investigators consider a 12.5 starting dose to be very low and want rapid initial escalation;

5. Information is available on “sub-dose-limiting” toxicities – e.g., 999 neutrophils/mm3 is grade 3 neutropenia but 1,001 is grade 2.

How to address these requirements?

- Escalate if 0 DLTs in a cohort;- De-escalate if ≥ 2 DLTs in a cohort;- Start with small cohorts (size 2), escalate dose in increments of

12.5 , maintain until higher risk suspected;- Permanently expand cohorts to size 5 when excessive risk seen,

defined as ≥ 1 DLT and/or 2 sub-DLTs;- Permanently halve dose increment to 6.25 when ≥ 2 DLTs and/or

≥ 3 sub-DLTs seen;- Maintain dose if 1 DLT in a cohort;- Stop when ≥ 20 subjects reached (max 24);- “Highest dose with ≤ 20% DLTs recommended for phase II”.

Page 23: 1 Who Needs Bayesian Phase I Trials? Rick Chappell Professor, Departments of Statistics and of Biostatistics and Medical Informatics University of Wisconsin.

23

B. Designs under investigation

Design TypeCohort

Size#

DLTs# Sub-DLTs

ActionSample

SizePhase 2 Dose

Design 1

Traditional 3+3

3 or 6

0/3

1/3

1/6

>1/6

>1/3

Any

Any

Any

Any

Any

Escalate to next cohort

Expand cohort to 6 pts at same dose

Escalate to next cohort

Stop escalation – MTD defined

Stop escalation – MTD defined

Random

One below the stopping dose

Page 24: 1 Who Needs Bayesian Phase I Trials? Rick Chappell Professor, Departments of Statistics and of Biostatistics and Medical Informatics University of Wisconsin.

24

B. Designs under investigation

Design TypeCohort

Size#

DLTs# Sub-DLTs

ActionSample

SizePhase 2 Dose

Design 2

“Traditional” 5+5

5 or 10

0/5

1/5

1/10

>1/10

>1/5

Any

Any

Any

Any

Any

Escalate to next cohort

Expand cohort to 10 pts at same dose

Escalate to next cohort

Stop escalation – MTD defined

Stop escalation – MTD defined

RandomHighest dosewith 20%

DLTs

Page 25: 1 Who Needs Bayesian Phase I Trials? Rick Chappell Professor, Departments of Statistics and of Biostatistics and Medical Informatics University of Wisconsin.

Design TypeCohort

Size#

DLTs# Sub-DLTs

ActionSample

SizePhase 2 Dose

Design 3

Up-and-down, cohort size adaptive to two sub-DLTs; increment size adaptive

2 then 5

0/2

0/2

1/2

2/2

0/5

0/5

1/5

>1/5

0, 1/2

2/2

Any

Any

2

> 2

2

> 2

Any

Escalate to next cohort of 2 subjects

Expand cohort, 5 subjects at same dose; all future cohort sizes = 5

Expand cohort, 5 subjects at same dose; all future cohort sizes = 5

De-escalate; all future cohort sizes = 5 and increments = 6.25

Escalate to next cohort

Escalate to next cohort; all future increments = 6.25 Expand cohort, 10 subjects at same dose

Expand cohort, 10 subjects at same dose; all future increments = 6.25

De-escalate; all future increments = 6.25

~20-24Highest dose with 20%

DLTs

Page 26: 1 Who Needs Bayesian Phase I Trials? Rick Chappell Professor, Departments of Statistics and of Biostatistics and Medical Informatics University of Wisconsin.

26

B. (cont.) Other designs considered

1. 3+3, up-and-down until ≥ 20 patients reached:Slow, focuses on too-high doses, ignores sub-DLTs.

2. 5+5, up-and-down until ≥ 20 patients reached:Very slow, ignores sub-DLTs.

3. Like Design 3 but cohort size expands 2 → 5 with a single DLT instead of two:Too slow, stays at low doses, doesn't identify phase 2 dose well.

4. Like Design 3 but cohort size expands with a single DLT instead of 2 and continues to accrue subjects until ≥ 2 toxicities are observed in a single cohort:Can have very large sample size and long duration.

Page 27: 1 Who Needs Bayesian Phase I Trials? Rick Chappell Professor, Departments of Statistics and of Biostatistics and Medical Informatics University of Wisconsin.

27

C. Simulations

1. Simulations compared:Design 1: Traditional 3+3 escalation only;Design 2: “Traditional 5+5” escalation only;Design 3: Up-and-down, cohort size adaptive, dose increment

size adaptive.

2. Two scenarios used, High Toxicity and Low Toxicity.

3. Comparisons made in terms of Accuracy, Toxicity, Sample size, and Duration.

4. 10,000 simulations in each case, all figures accurate to # of digits reported.5+5, up-and-down until ≥ 20 patients reached:Very slow, ignores sub-DLTs.

Page 28: 1 Who Needs Bayesian Phase I Trials? Rick Chappell Professor, Departments of Statistics and of Biostatistics and Medical Informatics University of Wisconsin.

28

Simulation: High-Toxicity Scenario

Page 29: 1 Who Needs Bayesian Phase I Trials? Rick Chappell Professor, Departments of Statistics and of Biostatistics and Medical Informatics University of Wisconsin.

29

Simulation Results: High-Toxicity ScenarioPrecision

Page 30: 1 Who Needs Bayesian Phase I Trials? Rick Chappell Professor, Departments of Statistics and of Biostatistics and Medical Informatics University of Wisconsin.

30

Simulation Results: High-Toxicity ScenarioToxicities

“Sub-therapeutic”: at least 12.5 below true phase II dose

Page 31: 1 Who Needs Bayesian Phase I Trials? Rick Chappell Professor, Departments of Statistics and of Biostatistics and Medical Informatics University of Wisconsin.

31

Simulation Results: High-Toxicity ScenarioSample Size / Duration

Page 32: 1 Who Needs Bayesian Phase I Trials? Rick Chappell Professor, Departments of Statistics and of Biostatistics and Medical Informatics University of Wisconsin.

32

Simulation: Low-Toxicity Scenario

Page 33: 1 Who Needs Bayesian Phase I Trials? Rick Chappell Professor, Departments of Statistics and of Biostatistics and Medical Informatics University of Wisconsin.

33

Simulation Results: Low-Toxicity ScenarioPrecision

Page 34: 1 Who Needs Bayesian Phase I Trials? Rick Chappell Professor, Departments of Statistics and of Biostatistics and Medical Informatics University of Wisconsin.

34

Simulation Results: Low-Toxicity ScenarioToxicities

“Sub-therapeutic”: at least 12.5 below true phase II dose

Page 35: 1 Who Needs Bayesian Phase I Trials? Rick Chappell Professor, Departments of Statistics and of Biostatistics and Medical Informatics University of Wisconsin.

35

Simulation Results: Low-Toxicity ScenarioSample Size / Duration

Page 36: 1 Who Needs Bayesian Phase I Trials? Rick Chappell Professor, Departments of Statistics and of Biostatistics and Medical Informatics University of Wisconsin.

36

V. Conclusions:

A. A very simple algorithmic phase I design was presented which had good operating characteristics and produced good estimates of the phase II dose.

B. The protocol of which this design formed a part was recently approved by the FDA with no comments whatsoever.

C. The Chief Medical Officer with authority over the trial was satisfied:

“We'd be delighted for you to present the statistics publically. I hopethat the title is something along the line of " An incredibly brilliantand clinically perfect clinical trial design".....!

Page 37: 1 Who Needs Bayesian Phase I Trials? Rick Chappell Professor, Departments of Statistics and of Biostatistics and Medical Informatics University of Wisconsin.

37

An Incredibly Brilliant and Clinically Perfect Clinical Trial Design

Rick ChappellProfessor,Departments of Statistics and of Biostatistics

and Medical InformaticsUniversity of Wisconsin [email protected]

2010 Joint Statistical Meetings; Vancouver