Multiplicity Correction in a Group-Sequential Oncology ... · PDF file• Largest...

44
Multiplicity Correction in a Group-Sequential Oncology Trial Including Subgroup Analyses and Multiple Primary Endpoints: Case Study European Statistical Meeting on Oncology and Survival Analysis Brussels, 17 November 2017 Agnes Balogh Bristol-Myers Squibb

Transcript of Multiplicity Correction in a Group-Sequential Oncology ... · PDF file• Largest...

Page 1: Multiplicity Correction in a Group-Sequential Oncology ... · PDF file• Largest difference: for Exp1 in scenario#1 (OS BM+: 11%, OS All: 13%, PFS BM+: 5%) 25 30 35 40 45 50 55 60

Multiplicity Correction in a Group-Sequential Oncology Trial Including Subgroup Analyses and

Multiple Primary Endpoints Case Study

European Statistical Meeting on Oncology and Survival Analysis

Brussels 17 November 2017

Agnes Balogh

Bristol-Myers Squibb

Disclaimer

The case study presented here is an actual BMS study However

bull Study number disease setting drug names will not be provided

bull Numerical parameters of the statistical design are not the actual ones (rather hypothetical)

2

Outline

1 Study Design

2 Testing strategy

3 Concluding remarks

3

1 Study Design

4

Hypothetical Study Design

Control Experimental 1 Experimental 2

Population Metastatic Oncology

Randomize 111

Primary Population BM+ subpopulation

OS PFS OS PFS

Secondary Population All Randomized

OS PFS OS PFS

α = 001 α = 0015 α = 001 α = 0015

About 40 of the population prognostic and predictive

N = 1200

5

Group-Sequential Nature of the Design

First patient randomized

Enrolment

Time 1 Time 2

22months 29months 44months

PFS

OS (1)

PFS in BM+ is mature

OS in BM+ is mature

6

(1) Lan-DeMets α spending function used for the two BM+ OS endpoints OrsquoBrien and Fleming for the two All Randomized OS endpoints Pocock type of boundary

OS PFS Underlying Survival Curves

7

Legend

- - - - BM+ treatment

_______ All treatment

- - - - BM+ control

_______ All control

OS BM+ vs All Randomized PFS BM+ vs All Randomized

PFS

Rat

es

of Events in All 2 Arms vs in Control Arm Analysis will be conducted when

bull Time 1 PFS in BM+ is mature

bull Time 2 OS in BM+ is mature

Question should maturity be defined based on the of events in

1 All 3 arms in total OR

2 The control arm (requires independent group)

Statistical properties were investigated under 4 scenarios

8

Scenario Details

Protocol Assumptions

1 OS PFS Exp2 no treatment effect

OS and PFS no treatment effect in Experimental2 arm

2 OS PFS Exp2 better OS and PFS treatment effect (delay cure rate HR in non-cure patients) improved by 30 in Experimental2 arm

3 Drop-out in all 3 arms 5 drop-out rate in all 3 arms

of Events in Total vs Control Simulation Results

Protocol assumptions

2 Exp2 better

1 Exp2 no treatment effect

3 5 drop-out in all 3 arms

9

Top line (dotted) is based on of events in control arm Bottom line (solid) is based on of events in all 3 arms

of Events in Total vs Control Statistical power bull For all 8 endpoints across all scenarios power is never lower if timing is later bull Largest difference for Exp1 in scenario1 (OS BM+ 11 OS All 13 PFS BM+ 5)

25 30 35 40 45 50 55 60 65 70

Months from First Patient Randomized

Time of Analysis (90 CI) under Different Scenarios

3 Testing Strategy

A Base Case

B Graphical Approach Introduction

C Chosen testing strategy bull Illustration of Sequentially Rejective Procedure

bull Power Gain

D Alternative testing strategies

10

11

Experimental1 vs Control Experimental2 vs Control

Testing Strategy Base Case

Time 1

0015

PFS BM+

PFS in All

OS BM+

OS in All

[0002] 0015

PFS BM+

PFS in All

OS BM+

OS in All

[0002]

Experimental1 vs Control Experimental2 vs Control

OS BM+

OS in All

[0009]

OS BM+

OS in All

[0009]

Time 2

For OS in BM+ information fraction assumed 70 (OrsquoBrien-Fleming)

Possibility for Improvement Graphical Approach

bull Historical overview bull Introduced in 2009 by Bretz et al (and Burman et al)

bull 2013 extension to group-sequential trials (Maurer Bretz)

bull 2017 Supported by the draft FDA guidance (Multiple Endpoints in Clinical Trials)

bull Closed test procedure =gt Strong control of family-wise error rate

bull A method to depict develop communicate a strategy consisting of Bonferroni-based sequential methods

Additional consideration PFS is not analysed at Time2 12

Experimental1 vs Control Experimental2 vs Control

Graphical Approach Introduction

0015

0

1

PFS BM+

PFS in All

OS BM+

OS in All

1

001

0

0015

0

1

PFS BM+

PFS in All

OS BM+

OS in All

1

001

0

Vertices = nodes

Endpoint-specific alpha asymp Local significance level

Edges = paths

Weight of edge

Blue expressions terminology of FDA guidance 13

Experimental1 vs Control Experimental2 vs Control

Improving Testing Strategy Add Edges

0015

0

1

PFS BM+

PFS in All

OS BM+

OS in All

1

001

0

1

0015

0

1

PFS BM+

PFS in All

OS BM+

OS in All

1

001

0

1

1

1

Along the green edges alpha can be passed only at Time1

14

15

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 1a Time 1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

1

Along the green edges alpha can be passed only at Time1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

16

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 1b Time 1

0015

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

1

Along the green edges alpha can be passed only at Time1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

17

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 2a Time 1

0015

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

1

Along the green edges alpha can be passed only at Time1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

18

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 2b Time 1

OS BM+

OS In All

1

0025 [00060023]

0

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

1

Along the green edges alpha can be passed only at Time1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

19

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 3a Time 2

OS BM+

OS In All

1

0025 [00060023]

0

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

20

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 3b Time 2

OS BM+

OS In All

1

0025 [00060023]

0

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

21

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 3c Time 2

OS In All

0025 [002001]

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

1 1

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

1

1

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ +171 00 +179 00

All +176 00 +179 00

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ +66 +57

All +77 +73 22

How Does the Power Gain Change under Different Assumptions

Endpoint Modified

Scenario Details

OS 1 OS larger difference in treatment effect by BM status

HR (for non-cure patients and after delay) increased by 10 in the BM- and decreased by 10 in the BM+ population (while keeping HR for all randomized the same) (a)

2 OS shorter delay 10 shorter delay in all 4 OS endpoints (a)

OS PFS 3 OS PFS Exp2 better OS and PFS treatment effect (delay cure rate HR in non-cure patients) improved by 10 in Experimental2 arm

4 OS PFS Exp2 worse OS and PFS treatment effect (delay cure rate HR in non-cure patients) worsened by 10 in Experimental2 arm

5 OS PFS longer delay OS delay increased with 50 PFS delay with 300

PFS 6 PFS no delay No delay in any of the 4 PFS endpoints

Time of analysis is generated based on of events in all 3 arms 23

0

01

02

03

04

05

06

07

0

01

02

03

04

05

06

07

PROTOCOL 1 OS Larger BM effect 2 OS shorter delay 3 Exp2 better 4 Exp2 worse 5 OS PFS longer delay 6 PFS no delay

Benefit of Additional Edges in Statistical Power of OS Time1

OS1+

OS1All

OS2+

OS2All

OS1

OS1AC

OS2

OS2AC

05

055

06

065

07

075

08

085

09

095

1

05

055

06

065

07

075

08

085

09

095

1

PROTOCOL 1 OS Larger BM effect 2 OS shorter delay 3 Exp2 better 4 Exp2 worse 5 OS PFS longer delay 6 PFS no delay

Benefit of Additional Edges in Statistical Power of OS Time2

OS1+

OS1All

OS2+

OS2All

OS1

OS1AC

OS2

OS2AC24

Experimental1 vs Control Experimental2 vs Control

Alternative1 Secure All Endpoints within Comparison

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

ε

ε

1 - ε 1 - ε

25

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dashed edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative1 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

ε

ε

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

ε

ε

1-ε 1-ε

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 170 + 14 + 178 + 17

All + 176 + 16 + 178 + 20

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 64 + 56

All + 75 + 71 26

Experimental1 vs Control Experimental2 vs Control

Alternative2 Giving BM+ Maximum Chance

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

001

0

1

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

001

0

1

ε2 ε2

1 - ε 1 - ε

1

1 1

ε2 ε2

1

27

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative2 Change in Power vs Base Case (under Protocol Assumptions)

1 - ε 1 - ε

1

1 1

ε2

0015

0

PFS BM+

PFS In All

OS BM

+

OS In All

0002

0

1

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

0002

0

1

ε2 ε2 ε2

1

Time 1 Time 2

OS BM+

OS In All

0002

0

OS BM+

OS In All

0002

0

05 05

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 135 + 08 + 138 + 08

All - 197 - 80 - 19 - 79

+20

+15

+10

+5

0

-5

-10

-15

-20

28

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 37 + 32

All - 67 - 68

4 Concluding Remarks

29

Concluding Remarks MTP

bull Limitation of the results on statistical power bull OS and PFS were generated independently

bull Identifying the testing strategy for an actual clinical trial always requires the input of several functions and the power results represent only one aspect of these considerations

bull Using graphical approach may result in overpowered endpoints However this still may be desirable reasons bull Protocol assumptions are complicated =gt just one detail off can result in power drop

(even in the presence of meaningful treatment effect)

bull At an interim analysis probability of crossing the efficacy boundary increases =gt gives larger chance for earlier read-out

30

Thank you for your attention

31

References

bull Bretz F Maurer W Brannath W Posch M A graphical approach to sequentially rejective multiple test procedures Statistics in Medicine 2009 28 586-604

bull Burman C-F Sonesson C and Guilbaud O (2009) A recycling framework for the construction of Bonferroni-based multiple tests Statistics in Medicine 28 739ndash761

bull Bretz F Posch M Glimm E Klinglmueller F Maurer W Rohmeyer K Graphical approaches for multiple comparison procedures using weighted Bonferroni Simes or parametric tests Biometrical Journal 2011 53(6) 894-913

bull Ye Y Li A Liu L and Yao B (2013) A group sequential Holm procedure with multiple primary endpoints Statistics in Medicine 32 1112-1124

bull Maurer W and Bretz F (2013a) ldquoGraphical Multiple Test Procedures With Memoryrdquo Statistics in Medicine DOI 101002sim5711

bull Willi MAURER and Frank BRETZ Multiple Testing in Group Sequential Trials Using Graphical Approaches American Statistical Association Statistics in Biopharmaceutical Research November 2013 Vol5 No4

bull Xi D Tamhane AC Allocating recycled significance levels in group sequential procedures for multiple endpoints Biom J 2015 Jan57(1)90-107

bull Tamhane A C Mehta C R and Liu L (2010) Testing a primary and a secondary endpoint in a group sequential design Biometrics 66 1174ndash1184

32

Back-up

33

Basic Parameters of the Study Design Table 1 Basic Parameters of the Study Design

Overall Survival Progression-Free Survival

BM+ BM- All Rand BM+ BM- All Rand

randomized Total (per comparison)

asymp 480 (asymp 320) (c)

asymp 720 (asymp 480) (c)

1200 (800) (fixed)

As OS As OS As OS

Significance level (a) 001 0015

Delayed Effect (b) 4m 5m 1m 1m

Cure Rate in Experimental (b) 15 10 0 0

HR after Delay Effect (b) (d) 06 075 056 075

Overall HR (b) Time1 Time2 070 061 084 074 077 069 063 084 072

Median controlexperimental 8m 12m 9m 12m 9m 12m 4m 64m 4m 47m 4m 55m

events Total Time1 Time2 asymp 285 385 (fixed)

asymp 693 asymp 963 380 (fixed) asymp 989

events per comparison Time1 Time2

asymp 198 asymp 266 asymp 478 asymp 661 asymp 261 asymp 672

(a) Initially allocated endpoint-specific alpha used for sample size calculation (b) For each comparison with the two experimental arms (c) Based on an expected BM+ proportion of 40 (Total number randomized is fixed) (d) As assumed for non-cure patients

34

Literature Overview

Graphical approach Bonferroni-based (Bretz et al 2009)

Non-parametric extensions (Bretz et al 2011)

GSP extension special case (Ye et al 2013)

GSP general allows for selection of recycling stages (Xi Tamhane 2015)

GSP extension general (Maurer Bretz 2013)

Graphical approach Bonferroni-based (Burman 2009)

GSP = group-sequential procedure Dark blue boxes referenced by FDA guidance

Graphs with memory (Maurer Bretz 2013a)

-649

-648 -577 -459

35

Multiplicity History of Regulatory Guidances bull E9 Statistical Principles for Clinical Trials (1998 Section 56)

bull Pre-specification of MTP bull EMA Points to Consider on Multiplicity Issues in Clinical Trials (2002)

bull Strong control of FWER bull EMA Concept paper on the need for a guideline on multiplicity issues in

clinical trials (Draft 2012) =gt Nov2012 Workshop bull EMA Guideline on multiplicity issues in clinical trials bull FDA Guidance document on multiplicity issues in clinical trials bull PMDA no multiplicity guidance (ldquoMTPs do not necessarily match with

labellingrdquo)

36

DRAFT Released for public

consultation in 2017

Related document within BMS Reporting Observed Significance Levels (p-values) in Clinical Study Reports Regulatory Documents and Publications 2014 (Stephane Munier)

Multiplicity Latest Regulatory Guidancersquos (2017) EMA Guideline on multiplicity issues

in clinical trials FDA Guidance document on

multiplicity issues in clinical trials

Principles bull Family-wise strong control bull Pre-specification of multiplicity adjustment

Secondary Endpoints bull Secondary endpoints expressing supportive evidence needs no MTP

bull Secondary endpoints that may become base for additional claims need MTP

(Section 6)

Distinction between different types of secondary endpoints is not clear all require MTP (Section IIIA)

Scope Similar (not the same)

Subgroups Section 7 (also own guidance) Excluded (another guidance)

Estimation and descriptive statistics Section 10 Not included

Graphical approach Not mentioned Explicitly encouraged

37

Experimental1 vs Control Experimental2 vs Control

Alternative4 Importance of Experimental1 PFS

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

ε

1

1 - ε

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain 38

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative4 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

ε

1

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

ε

1

1-ε

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 166 + 13 + 131 0

AC + 162 + 19 + 116 0

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 56 + 30

AC + 69 + 40 39

Experimental1 vs Control Experimental2 vs Control

Alternative3 Importance of both OS BM+

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

05 05

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

05

05

40

Timepoint 1 Timepoint 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative3 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

05

0002

0

05

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

0002

0

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

0009

0

OS BM

+

OS In All

0002

0

05

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 104 00 + 105 00

All + 87 - 14 + 84 - 16

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 46 NA + 40 NA

All + 49 NA + 42 NA

05 05 05 05 05

41

05 05 05 05

B) Final OS Look Adjusting Significance Levels bull The time of the final OS analysis will be determined by the number of

BM+ OS events in the 3 arms or in the control arm

bull As a result the number of events for each OS comparison is a random variable by design

bull Consequence bull Time 1 Assume none of the OS endpoints was rejected bull Time 2 If the of events for a given comparison is more (less) than expected

=gt the information fraction and nominal significance level used at Time 1 was larger (smaller) than what we should have used

For the final look critical values for OS will be re-calculated using the number of exactly observed events at the final analysis

The effect of adjustment is limited (always lt 0004 and in general lt 0001)

42

Illustration Effect of Final Alpha Adjustment

43 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Legend Observed IF at Time 1 lt Q1 Q1 - Q2 Q2 - Q3 Q3 lt

Illustration Effect of Final Alpha Adjustment

44 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Page 2: Multiplicity Correction in a Group-Sequential Oncology ... · PDF file• Largest difference: for Exp1 in scenario#1 (OS BM+: 11%, OS All: 13%, PFS BM+: 5%) 25 30 35 40 45 50 55 60

Disclaimer

The case study presented here is an actual BMS study However

bull Study number disease setting drug names will not be provided

bull Numerical parameters of the statistical design are not the actual ones (rather hypothetical)

2

Outline

1 Study Design

2 Testing strategy

3 Concluding remarks

3

1 Study Design

4

Hypothetical Study Design

Control Experimental 1 Experimental 2

Population Metastatic Oncology

Randomize 111

Primary Population BM+ subpopulation

OS PFS OS PFS

Secondary Population All Randomized

OS PFS OS PFS

α = 001 α = 0015 α = 001 α = 0015

About 40 of the population prognostic and predictive

N = 1200

5

Group-Sequential Nature of the Design

First patient randomized

Enrolment

Time 1 Time 2

22months 29months 44months

PFS

OS (1)

PFS in BM+ is mature

OS in BM+ is mature

6

(1) Lan-DeMets α spending function used for the two BM+ OS endpoints OrsquoBrien and Fleming for the two All Randomized OS endpoints Pocock type of boundary

OS PFS Underlying Survival Curves

7

Legend

- - - - BM+ treatment

_______ All treatment

- - - - BM+ control

_______ All control

OS BM+ vs All Randomized PFS BM+ vs All Randomized

PFS

Rat

es

of Events in All 2 Arms vs in Control Arm Analysis will be conducted when

bull Time 1 PFS in BM+ is mature

bull Time 2 OS in BM+ is mature

Question should maturity be defined based on the of events in

1 All 3 arms in total OR

2 The control arm (requires independent group)

Statistical properties were investigated under 4 scenarios

8

Scenario Details

Protocol Assumptions

1 OS PFS Exp2 no treatment effect

OS and PFS no treatment effect in Experimental2 arm

2 OS PFS Exp2 better OS and PFS treatment effect (delay cure rate HR in non-cure patients) improved by 30 in Experimental2 arm

3 Drop-out in all 3 arms 5 drop-out rate in all 3 arms

of Events in Total vs Control Simulation Results

Protocol assumptions

2 Exp2 better

1 Exp2 no treatment effect

3 5 drop-out in all 3 arms

9

Top line (dotted) is based on of events in control arm Bottom line (solid) is based on of events in all 3 arms

of Events in Total vs Control Statistical power bull For all 8 endpoints across all scenarios power is never lower if timing is later bull Largest difference for Exp1 in scenario1 (OS BM+ 11 OS All 13 PFS BM+ 5)

25 30 35 40 45 50 55 60 65 70

Months from First Patient Randomized

Time of Analysis (90 CI) under Different Scenarios

3 Testing Strategy

A Base Case

B Graphical Approach Introduction

C Chosen testing strategy bull Illustration of Sequentially Rejective Procedure

bull Power Gain

D Alternative testing strategies

10

11

Experimental1 vs Control Experimental2 vs Control

Testing Strategy Base Case

Time 1

0015

PFS BM+

PFS in All

OS BM+

OS in All

[0002] 0015

PFS BM+

PFS in All

OS BM+

OS in All

[0002]

Experimental1 vs Control Experimental2 vs Control

OS BM+

OS in All

[0009]

OS BM+

OS in All

[0009]

Time 2

For OS in BM+ information fraction assumed 70 (OrsquoBrien-Fleming)

Possibility for Improvement Graphical Approach

bull Historical overview bull Introduced in 2009 by Bretz et al (and Burman et al)

bull 2013 extension to group-sequential trials (Maurer Bretz)

bull 2017 Supported by the draft FDA guidance (Multiple Endpoints in Clinical Trials)

bull Closed test procedure =gt Strong control of family-wise error rate

bull A method to depict develop communicate a strategy consisting of Bonferroni-based sequential methods

Additional consideration PFS is not analysed at Time2 12

Experimental1 vs Control Experimental2 vs Control

Graphical Approach Introduction

0015

0

1

PFS BM+

PFS in All

OS BM+

OS in All

1

001

0

0015

0

1

PFS BM+

PFS in All

OS BM+

OS in All

1

001

0

Vertices = nodes

Endpoint-specific alpha asymp Local significance level

Edges = paths

Weight of edge

Blue expressions terminology of FDA guidance 13

Experimental1 vs Control Experimental2 vs Control

Improving Testing Strategy Add Edges

0015

0

1

PFS BM+

PFS in All

OS BM+

OS in All

1

001

0

1

0015

0

1

PFS BM+

PFS in All

OS BM+

OS in All

1

001

0

1

1

1

Along the green edges alpha can be passed only at Time1

14

15

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 1a Time 1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

1

Along the green edges alpha can be passed only at Time1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

16

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 1b Time 1

0015

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

1

Along the green edges alpha can be passed only at Time1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

17

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 2a Time 1

0015

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

1

Along the green edges alpha can be passed only at Time1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

18

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 2b Time 1

OS BM+

OS In All

1

0025 [00060023]

0

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

1

Along the green edges alpha can be passed only at Time1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

19

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 3a Time 2

OS BM+

OS In All

1

0025 [00060023]

0

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

20

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 3b Time 2

OS BM+

OS In All

1

0025 [00060023]

0

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

21

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 3c Time 2

OS In All

0025 [002001]

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

1 1

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

1

1

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ +171 00 +179 00

All +176 00 +179 00

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ +66 +57

All +77 +73 22

How Does the Power Gain Change under Different Assumptions

Endpoint Modified

Scenario Details

OS 1 OS larger difference in treatment effect by BM status

HR (for non-cure patients and after delay) increased by 10 in the BM- and decreased by 10 in the BM+ population (while keeping HR for all randomized the same) (a)

2 OS shorter delay 10 shorter delay in all 4 OS endpoints (a)

OS PFS 3 OS PFS Exp2 better OS and PFS treatment effect (delay cure rate HR in non-cure patients) improved by 10 in Experimental2 arm

4 OS PFS Exp2 worse OS and PFS treatment effect (delay cure rate HR in non-cure patients) worsened by 10 in Experimental2 arm

5 OS PFS longer delay OS delay increased with 50 PFS delay with 300

PFS 6 PFS no delay No delay in any of the 4 PFS endpoints

Time of analysis is generated based on of events in all 3 arms 23

0

01

02

03

04

05

06

07

0

01

02

03

04

05

06

07

PROTOCOL 1 OS Larger BM effect 2 OS shorter delay 3 Exp2 better 4 Exp2 worse 5 OS PFS longer delay 6 PFS no delay

Benefit of Additional Edges in Statistical Power of OS Time1

OS1+

OS1All

OS2+

OS2All

OS1

OS1AC

OS2

OS2AC

05

055

06

065

07

075

08

085

09

095

1

05

055

06

065

07

075

08

085

09

095

1

PROTOCOL 1 OS Larger BM effect 2 OS shorter delay 3 Exp2 better 4 Exp2 worse 5 OS PFS longer delay 6 PFS no delay

Benefit of Additional Edges in Statistical Power of OS Time2

OS1+

OS1All

OS2+

OS2All

OS1

OS1AC

OS2

OS2AC24

Experimental1 vs Control Experimental2 vs Control

Alternative1 Secure All Endpoints within Comparison

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

ε

ε

1 - ε 1 - ε

25

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dashed edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative1 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

ε

ε

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

ε

ε

1-ε 1-ε

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 170 + 14 + 178 + 17

All + 176 + 16 + 178 + 20

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 64 + 56

All + 75 + 71 26

Experimental1 vs Control Experimental2 vs Control

Alternative2 Giving BM+ Maximum Chance

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

001

0

1

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

001

0

1

ε2 ε2

1 - ε 1 - ε

1

1 1

ε2 ε2

1

27

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative2 Change in Power vs Base Case (under Protocol Assumptions)

1 - ε 1 - ε

1

1 1

ε2

0015

0

PFS BM+

PFS In All

OS BM

+

OS In All

0002

0

1

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

0002

0

1

ε2 ε2 ε2

1

Time 1 Time 2

OS BM+

OS In All

0002

0

OS BM+

OS In All

0002

0

05 05

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 135 + 08 + 138 + 08

All - 197 - 80 - 19 - 79

+20

+15

+10

+5

0

-5

-10

-15

-20

28

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 37 + 32

All - 67 - 68

4 Concluding Remarks

29

Concluding Remarks MTP

bull Limitation of the results on statistical power bull OS and PFS were generated independently

bull Identifying the testing strategy for an actual clinical trial always requires the input of several functions and the power results represent only one aspect of these considerations

bull Using graphical approach may result in overpowered endpoints However this still may be desirable reasons bull Protocol assumptions are complicated =gt just one detail off can result in power drop

(even in the presence of meaningful treatment effect)

bull At an interim analysis probability of crossing the efficacy boundary increases =gt gives larger chance for earlier read-out

30

Thank you for your attention

31

References

bull Bretz F Maurer W Brannath W Posch M A graphical approach to sequentially rejective multiple test procedures Statistics in Medicine 2009 28 586-604

bull Burman C-F Sonesson C and Guilbaud O (2009) A recycling framework for the construction of Bonferroni-based multiple tests Statistics in Medicine 28 739ndash761

bull Bretz F Posch M Glimm E Klinglmueller F Maurer W Rohmeyer K Graphical approaches for multiple comparison procedures using weighted Bonferroni Simes or parametric tests Biometrical Journal 2011 53(6) 894-913

bull Ye Y Li A Liu L and Yao B (2013) A group sequential Holm procedure with multiple primary endpoints Statistics in Medicine 32 1112-1124

bull Maurer W and Bretz F (2013a) ldquoGraphical Multiple Test Procedures With Memoryrdquo Statistics in Medicine DOI 101002sim5711

bull Willi MAURER and Frank BRETZ Multiple Testing in Group Sequential Trials Using Graphical Approaches American Statistical Association Statistics in Biopharmaceutical Research November 2013 Vol5 No4

bull Xi D Tamhane AC Allocating recycled significance levels in group sequential procedures for multiple endpoints Biom J 2015 Jan57(1)90-107

bull Tamhane A C Mehta C R and Liu L (2010) Testing a primary and a secondary endpoint in a group sequential design Biometrics 66 1174ndash1184

32

Back-up

33

Basic Parameters of the Study Design Table 1 Basic Parameters of the Study Design

Overall Survival Progression-Free Survival

BM+ BM- All Rand BM+ BM- All Rand

randomized Total (per comparison)

asymp 480 (asymp 320) (c)

asymp 720 (asymp 480) (c)

1200 (800) (fixed)

As OS As OS As OS

Significance level (a) 001 0015

Delayed Effect (b) 4m 5m 1m 1m

Cure Rate in Experimental (b) 15 10 0 0

HR after Delay Effect (b) (d) 06 075 056 075

Overall HR (b) Time1 Time2 070 061 084 074 077 069 063 084 072

Median controlexperimental 8m 12m 9m 12m 9m 12m 4m 64m 4m 47m 4m 55m

events Total Time1 Time2 asymp 285 385 (fixed)

asymp 693 asymp 963 380 (fixed) asymp 989

events per comparison Time1 Time2

asymp 198 asymp 266 asymp 478 asymp 661 asymp 261 asymp 672

(a) Initially allocated endpoint-specific alpha used for sample size calculation (b) For each comparison with the two experimental arms (c) Based on an expected BM+ proportion of 40 (Total number randomized is fixed) (d) As assumed for non-cure patients

34

Literature Overview

Graphical approach Bonferroni-based (Bretz et al 2009)

Non-parametric extensions (Bretz et al 2011)

GSP extension special case (Ye et al 2013)

GSP general allows for selection of recycling stages (Xi Tamhane 2015)

GSP extension general (Maurer Bretz 2013)

Graphical approach Bonferroni-based (Burman 2009)

GSP = group-sequential procedure Dark blue boxes referenced by FDA guidance

Graphs with memory (Maurer Bretz 2013a)

-649

-648 -577 -459

35

Multiplicity History of Regulatory Guidances bull E9 Statistical Principles for Clinical Trials (1998 Section 56)

bull Pre-specification of MTP bull EMA Points to Consider on Multiplicity Issues in Clinical Trials (2002)

bull Strong control of FWER bull EMA Concept paper on the need for a guideline on multiplicity issues in

clinical trials (Draft 2012) =gt Nov2012 Workshop bull EMA Guideline on multiplicity issues in clinical trials bull FDA Guidance document on multiplicity issues in clinical trials bull PMDA no multiplicity guidance (ldquoMTPs do not necessarily match with

labellingrdquo)

36

DRAFT Released for public

consultation in 2017

Related document within BMS Reporting Observed Significance Levels (p-values) in Clinical Study Reports Regulatory Documents and Publications 2014 (Stephane Munier)

Multiplicity Latest Regulatory Guidancersquos (2017) EMA Guideline on multiplicity issues

in clinical trials FDA Guidance document on

multiplicity issues in clinical trials

Principles bull Family-wise strong control bull Pre-specification of multiplicity adjustment

Secondary Endpoints bull Secondary endpoints expressing supportive evidence needs no MTP

bull Secondary endpoints that may become base for additional claims need MTP

(Section 6)

Distinction between different types of secondary endpoints is not clear all require MTP (Section IIIA)

Scope Similar (not the same)

Subgroups Section 7 (also own guidance) Excluded (another guidance)

Estimation and descriptive statistics Section 10 Not included

Graphical approach Not mentioned Explicitly encouraged

37

Experimental1 vs Control Experimental2 vs Control

Alternative4 Importance of Experimental1 PFS

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

ε

1

1 - ε

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain 38

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative4 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

ε

1

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

ε

1

1-ε

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 166 + 13 + 131 0

AC + 162 + 19 + 116 0

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 56 + 30

AC + 69 + 40 39

Experimental1 vs Control Experimental2 vs Control

Alternative3 Importance of both OS BM+

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

05 05

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

05

05

40

Timepoint 1 Timepoint 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative3 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

05

0002

0

05

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

0002

0

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

0009

0

OS BM

+

OS In All

0002

0

05

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 104 00 + 105 00

All + 87 - 14 + 84 - 16

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 46 NA + 40 NA

All + 49 NA + 42 NA

05 05 05 05 05

41

05 05 05 05

B) Final OS Look Adjusting Significance Levels bull The time of the final OS analysis will be determined by the number of

BM+ OS events in the 3 arms or in the control arm

bull As a result the number of events for each OS comparison is a random variable by design

bull Consequence bull Time 1 Assume none of the OS endpoints was rejected bull Time 2 If the of events for a given comparison is more (less) than expected

=gt the information fraction and nominal significance level used at Time 1 was larger (smaller) than what we should have used

For the final look critical values for OS will be re-calculated using the number of exactly observed events at the final analysis

The effect of adjustment is limited (always lt 0004 and in general lt 0001)

42

Illustration Effect of Final Alpha Adjustment

43 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Legend Observed IF at Time 1 lt Q1 Q1 - Q2 Q2 - Q3 Q3 lt

Illustration Effect of Final Alpha Adjustment

44 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Page 3: Multiplicity Correction in a Group-Sequential Oncology ... · PDF file• Largest difference: for Exp1 in scenario#1 (OS BM+: 11%, OS All: 13%, PFS BM+: 5%) 25 30 35 40 45 50 55 60

Outline

1 Study Design

2 Testing strategy

3 Concluding remarks

3

1 Study Design

4

Hypothetical Study Design

Control Experimental 1 Experimental 2

Population Metastatic Oncology

Randomize 111

Primary Population BM+ subpopulation

OS PFS OS PFS

Secondary Population All Randomized

OS PFS OS PFS

α = 001 α = 0015 α = 001 α = 0015

About 40 of the population prognostic and predictive

N = 1200

5

Group-Sequential Nature of the Design

First patient randomized

Enrolment

Time 1 Time 2

22months 29months 44months

PFS

OS (1)

PFS in BM+ is mature

OS in BM+ is mature

6

(1) Lan-DeMets α spending function used for the two BM+ OS endpoints OrsquoBrien and Fleming for the two All Randomized OS endpoints Pocock type of boundary

OS PFS Underlying Survival Curves

7

Legend

- - - - BM+ treatment

_______ All treatment

- - - - BM+ control

_______ All control

OS BM+ vs All Randomized PFS BM+ vs All Randomized

PFS

Rat

es

of Events in All 2 Arms vs in Control Arm Analysis will be conducted when

bull Time 1 PFS in BM+ is mature

bull Time 2 OS in BM+ is mature

Question should maturity be defined based on the of events in

1 All 3 arms in total OR

2 The control arm (requires independent group)

Statistical properties were investigated under 4 scenarios

8

Scenario Details

Protocol Assumptions

1 OS PFS Exp2 no treatment effect

OS and PFS no treatment effect in Experimental2 arm

2 OS PFS Exp2 better OS and PFS treatment effect (delay cure rate HR in non-cure patients) improved by 30 in Experimental2 arm

3 Drop-out in all 3 arms 5 drop-out rate in all 3 arms

of Events in Total vs Control Simulation Results

Protocol assumptions

2 Exp2 better

1 Exp2 no treatment effect

3 5 drop-out in all 3 arms

9

Top line (dotted) is based on of events in control arm Bottom line (solid) is based on of events in all 3 arms

of Events in Total vs Control Statistical power bull For all 8 endpoints across all scenarios power is never lower if timing is later bull Largest difference for Exp1 in scenario1 (OS BM+ 11 OS All 13 PFS BM+ 5)

25 30 35 40 45 50 55 60 65 70

Months from First Patient Randomized

Time of Analysis (90 CI) under Different Scenarios

3 Testing Strategy

A Base Case

B Graphical Approach Introduction

C Chosen testing strategy bull Illustration of Sequentially Rejective Procedure

bull Power Gain

D Alternative testing strategies

10

11

Experimental1 vs Control Experimental2 vs Control

Testing Strategy Base Case

Time 1

0015

PFS BM+

PFS in All

OS BM+

OS in All

[0002] 0015

PFS BM+

PFS in All

OS BM+

OS in All

[0002]

Experimental1 vs Control Experimental2 vs Control

OS BM+

OS in All

[0009]

OS BM+

OS in All

[0009]

Time 2

For OS in BM+ information fraction assumed 70 (OrsquoBrien-Fleming)

Possibility for Improvement Graphical Approach

bull Historical overview bull Introduced in 2009 by Bretz et al (and Burman et al)

bull 2013 extension to group-sequential trials (Maurer Bretz)

bull 2017 Supported by the draft FDA guidance (Multiple Endpoints in Clinical Trials)

bull Closed test procedure =gt Strong control of family-wise error rate

bull A method to depict develop communicate a strategy consisting of Bonferroni-based sequential methods

Additional consideration PFS is not analysed at Time2 12

Experimental1 vs Control Experimental2 vs Control

Graphical Approach Introduction

0015

0

1

PFS BM+

PFS in All

OS BM+

OS in All

1

001

0

0015

0

1

PFS BM+

PFS in All

OS BM+

OS in All

1

001

0

Vertices = nodes

Endpoint-specific alpha asymp Local significance level

Edges = paths

Weight of edge

Blue expressions terminology of FDA guidance 13

Experimental1 vs Control Experimental2 vs Control

Improving Testing Strategy Add Edges

0015

0

1

PFS BM+

PFS in All

OS BM+

OS in All

1

001

0

1

0015

0

1

PFS BM+

PFS in All

OS BM+

OS in All

1

001

0

1

1

1

Along the green edges alpha can be passed only at Time1

14

15

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 1a Time 1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

1

Along the green edges alpha can be passed only at Time1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

16

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 1b Time 1

0015

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

1

Along the green edges alpha can be passed only at Time1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

17

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 2a Time 1

0015

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

1

Along the green edges alpha can be passed only at Time1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

18

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 2b Time 1

OS BM+

OS In All

1

0025 [00060023]

0

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

1

Along the green edges alpha can be passed only at Time1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

19

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 3a Time 2

OS BM+

OS In All

1

0025 [00060023]

0

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

20

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 3b Time 2

OS BM+

OS In All

1

0025 [00060023]

0

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

21

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 3c Time 2

OS In All

0025 [002001]

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

1 1

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

1

1

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ +171 00 +179 00

All +176 00 +179 00

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ +66 +57

All +77 +73 22

How Does the Power Gain Change under Different Assumptions

Endpoint Modified

Scenario Details

OS 1 OS larger difference in treatment effect by BM status

HR (for non-cure patients and after delay) increased by 10 in the BM- and decreased by 10 in the BM+ population (while keeping HR for all randomized the same) (a)

2 OS shorter delay 10 shorter delay in all 4 OS endpoints (a)

OS PFS 3 OS PFS Exp2 better OS and PFS treatment effect (delay cure rate HR in non-cure patients) improved by 10 in Experimental2 arm

4 OS PFS Exp2 worse OS and PFS treatment effect (delay cure rate HR in non-cure patients) worsened by 10 in Experimental2 arm

5 OS PFS longer delay OS delay increased with 50 PFS delay with 300

PFS 6 PFS no delay No delay in any of the 4 PFS endpoints

Time of analysis is generated based on of events in all 3 arms 23

0

01

02

03

04

05

06

07

0

01

02

03

04

05

06

07

PROTOCOL 1 OS Larger BM effect 2 OS shorter delay 3 Exp2 better 4 Exp2 worse 5 OS PFS longer delay 6 PFS no delay

Benefit of Additional Edges in Statistical Power of OS Time1

OS1+

OS1All

OS2+

OS2All

OS1

OS1AC

OS2

OS2AC

05

055

06

065

07

075

08

085

09

095

1

05

055

06

065

07

075

08

085

09

095

1

PROTOCOL 1 OS Larger BM effect 2 OS shorter delay 3 Exp2 better 4 Exp2 worse 5 OS PFS longer delay 6 PFS no delay

Benefit of Additional Edges in Statistical Power of OS Time2

OS1+

OS1All

OS2+

OS2All

OS1

OS1AC

OS2

OS2AC24

Experimental1 vs Control Experimental2 vs Control

Alternative1 Secure All Endpoints within Comparison

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

ε

ε

1 - ε 1 - ε

25

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dashed edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative1 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

ε

ε

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

ε

ε

1-ε 1-ε

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 170 + 14 + 178 + 17

All + 176 + 16 + 178 + 20

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 64 + 56

All + 75 + 71 26

Experimental1 vs Control Experimental2 vs Control

Alternative2 Giving BM+ Maximum Chance

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

001

0

1

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

001

0

1

ε2 ε2

1 - ε 1 - ε

1

1 1

ε2 ε2

1

27

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative2 Change in Power vs Base Case (under Protocol Assumptions)

1 - ε 1 - ε

1

1 1

ε2

0015

0

PFS BM+

PFS In All

OS BM

+

OS In All

0002

0

1

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

0002

0

1

ε2 ε2 ε2

1

Time 1 Time 2

OS BM+

OS In All

0002

0

OS BM+

OS In All

0002

0

05 05

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 135 + 08 + 138 + 08

All - 197 - 80 - 19 - 79

+20

+15

+10

+5

0

-5

-10

-15

-20

28

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 37 + 32

All - 67 - 68

4 Concluding Remarks

29

Concluding Remarks MTP

bull Limitation of the results on statistical power bull OS and PFS were generated independently

bull Identifying the testing strategy for an actual clinical trial always requires the input of several functions and the power results represent only one aspect of these considerations

bull Using graphical approach may result in overpowered endpoints However this still may be desirable reasons bull Protocol assumptions are complicated =gt just one detail off can result in power drop

(even in the presence of meaningful treatment effect)

bull At an interim analysis probability of crossing the efficacy boundary increases =gt gives larger chance for earlier read-out

30

Thank you for your attention

31

References

bull Bretz F Maurer W Brannath W Posch M A graphical approach to sequentially rejective multiple test procedures Statistics in Medicine 2009 28 586-604

bull Burman C-F Sonesson C and Guilbaud O (2009) A recycling framework for the construction of Bonferroni-based multiple tests Statistics in Medicine 28 739ndash761

bull Bretz F Posch M Glimm E Klinglmueller F Maurer W Rohmeyer K Graphical approaches for multiple comparison procedures using weighted Bonferroni Simes or parametric tests Biometrical Journal 2011 53(6) 894-913

bull Ye Y Li A Liu L and Yao B (2013) A group sequential Holm procedure with multiple primary endpoints Statistics in Medicine 32 1112-1124

bull Maurer W and Bretz F (2013a) ldquoGraphical Multiple Test Procedures With Memoryrdquo Statistics in Medicine DOI 101002sim5711

bull Willi MAURER and Frank BRETZ Multiple Testing in Group Sequential Trials Using Graphical Approaches American Statistical Association Statistics in Biopharmaceutical Research November 2013 Vol5 No4

bull Xi D Tamhane AC Allocating recycled significance levels in group sequential procedures for multiple endpoints Biom J 2015 Jan57(1)90-107

bull Tamhane A C Mehta C R and Liu L (2010) Testing a primary and a secondary endpoint in a group sequential design Biometrics 66 1174ndash1184

32

Back-up

33

Basic Parameters of the Study Design Table 1 Basic Parameters of the Study Design

Overall Survival Progression-Free Survival

BM+ BM- All Rand BM+ BM- All Rand

randomized Total (per comparison)

asymp 480 (asymp 320) (c)

asymp 720 (asymp 480) (c)

1200 (800) (fixed)

As OS As OS As OS

Significance level (a) 001 0015

Delayed Effect (b) 4m 5m 1m 1m

Cure Rate in Experimental (b) 15 10 0 0

HR after Delay Effect (b) (d) 06 075 056 075

Overall HR (b) Time1 Time2 070 061 084 074 077 069 063 084 072

Median controlexperimental 8m 12m 9m 12m 9m 12m 4m 64m 4m 47m 4m 55m

events Total Time1 Time2 asymp 285 385 (fixed)

asymp 693 asymp 963 380 (fixed) asymp 989

events per comparison Time1 Time2

asymp 198 asymp 266 asymp 478 asymp 661 asymp 261 asymp 672

(a) Initially allocated endpoint-specific alpha used for sample size calculation (b) For each comparison with the two experimental arms (c) Based on an expected BM+ proportion of 40 (Total number randomized is fixed) (d) As assumed for non-cure patients

34

Literature Overview

Graphical approach Bonferroni-based (Bretz et al 2009)

Non-parametric extensions (Bretz et al 2011)

GSP extension special case (Ye et al 2013)

GSP general allows for selection of recycling stages (Xi Tamhane 2015)

GSP extension general (Maurer Bretz 2013)

Graphical approach Bonferroni-based (Burman 2009)

GSP = group-sequential procedure Dark blue boxes referenced by FDA guidance

Graphs with memory (Maurer Bretz 2013a)

-649

-648 -577 -459

35

Multiplicity History of Regulatory Guidances bull E9 Statistical Principles for Clinical Trials (1998 Section 56)

bull Pre-specification of MTP bull EMA Points to Consider on Multiplicity Issues in Clinical Trials (2002)

bull Strong control of FWER bull EMA Concept paper on the need for a guideline on multiplicity issues in

clinical trials (Draft 2012) =gt Nov2012 Workshop bull EMA Guideline on multiplicity issues in clinical trials bull FDA Guidance document on multiplicity issues in clinical trials bull PMDA no multiplicity guidance (ldquoMTPs do not necessarily match with

labellingrdquo)

36

DRAFT Released for public

consultation in 2017

Related document within BMS Reporting Observed Significance Levels (p-values) in Clinical Study Reports Regulatory Documents and Publications 2014 (Stephane Munier)

Multiplicity Latest Regulatory Guidancersquos (2017) EMA Guideline on multiplicity issues

in clinical trials FDA Guidance document on

multiplicity issues in clinical trials

Principles bull Family-wise strong control bull Pre-specification of multiplicity adjustment

Secondary Endpoints bull Secondary endpoints expressing supportive evidence needs no MTP

bull Secondary endpoints that may become base for additional claims need MTP

(Section 6)

Distinction between different types of secondary endpoints is not clear all require MTP (Section IIIA)

Scope Similar (not the same)

Subgroups Section 7 (also own guidance) Excluded (another guidance)

Estimation and descriptive statistics Section 10 Not included

Graphical approach Not mentioned Explicitly encouraged

37

Experimental1 vs Control Experimental2 vs Control

Alternative4 Importance of Experimental1 PFS

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

ε

1

1 - ε

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain 38

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative4 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

ε

1

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

ε

1

1-ε

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 166 + 13 + 131 0

AC + 162 + 19 + 116 0

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 56 + 30

AC + 69 + 40 39

Experimental1 vs Control Experimental2 vs Control

Alternative3 Importance of both OS BM+

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

05 05

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

05

05

40

Timepoint 1 Timepoint 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative3 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

05

0002

0

05

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

0002

0

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

0009

0

OS BM

+

OS In All

0002

0

05

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 104 00 + 105 00

All + 87 - 14 + 84 - 16

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 46 NA + 40 NA

All + 49 NA + 42 NA

05 05 05 05 05

41

05 05 05 05

B) Final OS Look Adjusting Significance Levels bull The time of the final OS analysis will be determined by the number of

BM+ OS events in the 3 arms or in the control arm

bull As a result the number of events for each OS comparison is a random variable by design

bull Consequence bull Time 1 Assume none of the OS endpoints was rejected bull Time 2 If the of events for a given comparison is more (less) than expected

=gt the information fraction and nominal significance level used at Time 1 was larger (smaller) than what we should have used

For the final look critical values for OS will be re-calculated using the number of exactly observed events at the final analysis

The effect of adjustment is limited (always lt 0004 and in general lt 0001)

42

Illustration Effect of Final Alpha Adjustment

43 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Legend Observed IF at Time 1 lt Q1 Q1 - Q2 Q2 - Q3 Q3 lt

Illustration Effect of Final Alpha Adjustment

44 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Page 4: Multiplicity Correction in a Group-Sequential Oncology ... · PDF file• Largest difference: for Exp1 in scenario#1 (OS BM+: 11%, OS All: 13%, PFS BM+: 5%) 25 30 35 40 45 50 55 60

1 Study Design

4

Hypothetical Study Design

Control Experimental 1 Experimental 2

Population Metastatic Oncology

Randomize 111

Primary Population BM+ subpopulation

OS PFS OS PFS

Secondary Population All Randomized

OS PFS OS PFS

α = 001 α = 0015 α = 001 α = 0015

About 40 of the population prognostic and predictive

N = 1200

5

Group-Sequential Nature of the Design

First patient randomized

Enrolment

Time 1 Time 2

22months 29months 44months

PFS

OS (1)

PFS in BM+ is mature

OS in BM+ is mature

6

(1) Lan-DeMets α spending function used for the two BM+ OS endpoints OrsquoBrien and Fleming for the two All Randomized OS endpoints Pocock type of boundary

OS PFS Underlying Survival Curves

7

Legend

- - - - BM+ treatment

_______ All treatment

- - - - BM+ control

_______ All control

OS BM+ vs All Randomized PFS BM+ vs All Randomized

PFS

Rat

es

of Events in All 2 Arms vs in Control Arm Analysis will be conducted when

bull Time 1 PFS in BM+ is mature

bull Time 2 OS in BM+ is mature

Question should maturity be defined based on the of events in

1 All 3 arms in total OR

2 The control arm (requires independent group)

Statistical properties were investigated under 4 scenarios

8

Scenario Details

Protocol Assumptions

1 OS PFS Exp2 no treatment effect

OS and PFS no treatment effect in Experimental2 arm

2 OS PFS Exp2 better OS and PFS treatment effect (delay cure rate HR in non-cure patients) improved by 30 in Experimental2 arm

3 Drop-out in all 3 arms 5 drop-out rate in all 3 arms

of Events in Total vs Control Simulation Results

Protocol assumptions

2 Exp2 better

1 Exp2 no treatment effect

3 5 drop-out in all 3 arms

9

Top line (dotted) is based on of events in control arm Bottom line (solid) is based on of events in all 3 arms

of Events in Total vs Control Statistical power bull For all 8 endpoints across all scenarios power is never lower if timing is later bull Largest difference for Exp1 in scenario1 (OS BM+ 11 OS All 13 PFS BM+ 5)

25 30 35 40 45 50 55 60 65 70

Months from First Patient Randomized

Time of Analysis (90 CI) under Different Scenarios

3 Testing Strategy

A Base Case

B Graphical Approach Introduction

C Chosen testing strategy bull Illustration of Sequentially Rejective Procedure

bull Power Gain

D Alternative testing strategies

10

11

Experimental1 vs Control Experimental2 vs Control

Testing Strategy Base Case

Time 1

0015

PFS BM+

PFS in All

OS BM+

OS in All

[0002] 0015

PFS BM+

PFS in All

OS BM+

OS in All

[0002]

Experimental1 vs Control Experimental2 vs Control

OS BM+

OS in All

[0009]

OS BM+

OS in All

[0009]

Time 2

For OS in BM+ information fraction assumed 70 (OrsquoBrien-Fleming)

Possibility for Improvement Graphical Approach

bull Historical overview bull Introduced in 2009 by Bretz et al (and Burman et al)

bull 2013 extension to group-sequential trials (Maurer Bretz)

bull 2017 Supported by the draft FDA guidance (Multiple Endpoints in Clinical Trials)

bull Closed test procedure =gt Strong control of family-wise error rate

bull A method to depict develop communicate a strategy consisting of Bonferroni-based sequential methods

Additional consideration PFS is not analysed at Time2 12

Experimental1 vs Control Experimental2 vs Control

Graphical Approach Introduction

0015

0

1

PFS BM+

PFS in All

OS BM+

OS in All

1

001

0

0015

0

1

PFS BM+

PFS in All

OS BM+

OS in All

1

001

0

Vertices = nodes

Endpoint-specific alpha asymp Local significance level

Edges = paths

Weight of edge

Blue expressions terminology of FDA guidance 13

Experimental1 vs Control Experimental2 vs Control

Improving Testing Strategy Add Edges

0015

0

1

PFS BM+

PFS in All

OS BM+

OS in All

1

001

0

1

0015

0

1

PFS BM+

PFS in All

OS BM+

OS in All

1

001

0

1

1

1

Along the green edges alpha can be passed only at Time1

14

15

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 1a Time 1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

1

Along the green edges alpha can be passed only at Time1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

16

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 1b Time 1

0015

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

1

Along the green edges alpha can be passed only at Time1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

17

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 2a Time 1

0015

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

1

Along the green edges alpha can be passed only at Time1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

18

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 2b Time 1

OS BM+

OS In All

1

0025 [00060023]

0

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

1

Along the green edges alpha can be passed only at Time1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

19

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 3a Time 2

OS BM+

OS In All

1

0025 [00060023]

0

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

20

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 3b Time 2

OS BM+

OS In All

1

0025 [00060023]

0

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

21

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 3c Time 2

OS In All

0025 [002001]

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

1 1

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

1

1

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ +171 00 +179 00

All +176 00 +179 00

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ +66 +57

All +77 +73 22

How Does the Power Gain Change under Different Assumptions

Endpoint Modified

Scenario Details

OS 1 OS larger difference in treatment effect by BM status

HR (for non-cure patients and after delay) increased by 10 in the BM- and decreased by 10 in the BM+ population (while keeping HR for all randomized the same) (a)

2 OS shorter delay 10 shorter delay in all 4 OS endpoints (a)

OS PFS 3 OS PFS Exp2 better OS and PFS treatment effect (delay cure rate HR in non-cure patients) improved by 10 in Experimental2 arm

4 OS PFS Exp2 worse OS and PFS treatment effect (delay cure rate HR in non-cure patients) worsened by 10 in Experimental2 arm

5 OS PFS longer delay OS delay increased with 50 PFS delay with 300

PFS 6 PFS no delay No delay in any of the 4 PFS endpoints

Time of analysis is generated based on of events in all 3 arms 23

0

01

02

03

04

05

06

07

0

01

02

03

04

05

06

07

PROTOCOL 1 OS Larger BM effect 2 OS shorter delay 3 Exp2 better 4 Exp2 worse 5 OS PFS longer delay 6 PFS no delay

Benefit of Additional Edges in Statistical Power of OS Time1

OS1+

OS1All

OS2+

OS2All

OS1

OS1AC

OS2

OS2AC

05

055

06

065

07

075

08

085

09

095

1

05

055

06

065

07

075

08

085

09

095

1

PROTOCOL 1 OS Larger BM effect 2 OS shorter delay 3 Exp2 better 4 Exp2 worse 5 OS PFS longer delay 6 PFS no delay

Benefit of Additional Edges in Statistical Power of OS Time2

OS1+

OS1All

OS2+

OS2All

OS1

OS1AC

OS2

OS2AC24

Experimental1 vs Control Experimental2 vs Control

Alternative1 Secure All Endpoints within Comparison

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

ε

ε

1 - ε 1 - ε

25

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dashed edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative1 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

ε

ε

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

ε

ε

1-ε 1-ε

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 170 + 14 + 178 + 17

All + 176 + 16 + 178 + 20

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 64 + 56

All + 75 + 71 26

Experimental1 vs Control Experimental2 vs Control

Alternative2 Giving BM+ Maximum Chance

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

001

0

1

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

001

0

1

ε2 ε2

1 - ε 1 - ε

1

1 1

ε2 ε2

1

27

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative2 Change in Power vs Base Case (under Protocol Assumptions)

1 - ε 1 - ε

1

1 1

ε2

0015

0

PFS BM+

PFS In All

OS BM

+

OS In All

0002

0

1

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

0002

0

1

ε2 ε2 ε2

1

Time 1 Time 2

OS BM+

OS In All

0002

0

OS BM+

OS In All

0002

0

05 05

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 135 + 08 + 138 + 08

All - 197 - 80 - 19 - 79

+20

+15

+10

+5

0

-5

-10

-15

-20

28

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 37 + 32

All - 67 - 68

4 Concluding Remarks

29

Concluding Remarks MTP

bull Limitation of the results on statistical power bull OS and PFS were generated independently

bull Identifying the testing strategy for an actual clinical trial always requires the input of several functions and the power results represent only one aspect of these considerations

bull Using graphical approach may result in overpowered endpoints However this still may be desirable reasons bull Protocol assumptions are complicated =gt just one detail off can result in power drop

(even in the presence of meaningful treatment effect)

bull At an interim analysis probability of crossing the efficacy boundary increases =gt gives larger chance for earlier read-out

30

Thank you for your attention

31

References

bull Bretz F Maurer W Brannath W Posch M A graphical approach to sequentially rejective multiple test procedures Statistics in Medicine 2009 28 586-604

bull Burman C-F Sonesson C and Guilbaud O (2009) A recycling framework for the construction of Bonferroni-based multiple tests Statistics in Medicine 28 739ndash761

bull Bretz F Posch M Glimm E Klinglmueller F Maurer W Rohmeyer K Graphical approaches for multiple comparison procedures using weighted Bonferroni Simes or parametric tests Biometrical Journal 2011 53(6) 894-913

bull Ye Y Li A Liu L and Yao B (2013) A group sequential Holm procedure with multiple primary endpoints Statistics in Medicine 32 1112-1124

bull Maurer W and Bretz F (2013a) ldquoGraphical Multiple Test Procedures With Memoryrdquo Statistics in Medicine DOI 101002sim5711

bull Willi MAURER and Frank BRETZ Multiple Testing in Group Sequential Trials Using Graphical Approaches American Statistical Association Statistics in Biopharmaceutical Research November 2013 Vol5 No4

bull Xi D Tamhane AC Allocating recycled significance levels in group sequential procedures for multiple endpoints Biom J 2015 Jan57(1)90-107

bull Tamhane A C Mehta C R and Liu L (2010) Testing a primary and a secondary endpoint in a group sequential design Biometrics 66 1174ndash1184

32

Back-up

33

Basic Parameters of the Study Design Table 1 Basic Parameters of the Study Design

Overall Survival Progression-Free Survival

BM+ BM- All Rand BM+ BM- All Rand

randomized Total (per comparison)

asymp 480 (asymp 320) (c)

asymp 720 (asymp 480) (c)

1200 (800) (fixed)

As OS As OS As OS

Significance level (a) 001 0015

Delayed Effect (b) 4m 5m 1m 1m

Cure Rate in Experimental (b) 15 10 0 0

HR after Delay Effect (b) (d) 06 075 056 075

Overall HR (b) Time1 Time2 070 061 084 074 077 069 063 084 072

Median controlexperimental 8m 12m 9m 12m 9m 12m 4m 64m 4m 47m 4m 55m

events Total Time1 Time2 asymp 285 385 (fixed)

asymp 693 asymp 963 380 (fixed) asymp 989

events per comparison Time1 Time2

asymp 198 asymp 266 asymp 478 asymp 661 asymp 261 asymp 672

(a) Initially allocated endpoint-specific alpha used for sample size calculation (b) For each comparison with the two experimental arms (c) Based on an expected BM+ proportion of 40 (Total number randomized is fixed) (d) As assumed for non-cure patients

34

Literature Overview

Graphical approach Bonferroni-based (Bretz et al 2009)

Non-parametric extensions (Bretz et al 2011)

GSP extension special case (Ye et al 2013)

GSP general allows for selection of recycling stages (Xi Tamhane 2015)

GSP extension general (Maurer Bretz 2013)

Graphical approach Bonferroni-based (Burman 2009)

GSP = group-sequential procedure Dark blue boxes referenced by FDA guidance

Graphs with memory (Maurer Bretz 2013a)

-649

-648 -577 -459

35

Multiplicity History of Regulatory Guidances bull E9 Statistical Principles for Clinical Trials (1998 Section 56)

bull Pre-specification of MTP bull EMA Points to Consider on Multiplicity Issues in Clinical Trials (2002)

bull Strong control of FWER bull EMA Concept paper on the need for a guideline on multiplicity issues in

clinical trials (Draft 2012) =gt Nov2012 Workshop bull EMA Guideline on multiplicity issues in clinical trials bull FDA Guidance document on multiplicity issues in clinical trials bull PMDA no multiplicity guidance (ldquoMTPs do not necessarily match with

labellingrdquo)

36

DRAFT Released for public

consultation in 2017

Related document within BMS Reporting Observed Significance Levels (p-values) in Clinical Study Reports Regulatory Documents and Publications 2014 (Stephane Munier)

Multiplicity Latest Regulatory Guidancersquos (2017) EMA Guideline on multiplicity issues

in clinical trials FDA Guidance document on

multiplicity issues in clinical trials

Principles bull Family-wise strong control bull Pre-specification of multiplicity adjustment

Secondary Endpoints bull Secondary endpoints expressing supportive evidence needs no MTP

bull Secondary endpoints that may become base for additional claims need MTP

(Section 6)

Distinction between different types of secondary endpoints is not clear all require MTP (Section IIIA)

Scope Similar (not the same)

Subgroups Section 7 (also own guidance) Excluded (another guidance)

Estimation and descriptive statistics Section 10 Not included

Graphical approach Not mentioned Explicitly encouraged

37

Experimental1 vs Control Experimental2 vs Control

Alternative4 Importance of Experimental1 PFS

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

ε

1

1 - ε

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain 38

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative4 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

ε

1

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

ε

1

1-ε

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 166 + 13 + 131 0

AC + 162 + 19 + 116 0

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 56 + 30

AC + 69 + 40 39

Experimental1 vs Control Experimental2 vs Control

Alternative3 Importance of both OS BM+

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

05 05

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

05

05

40

Timepoint 1 Timepoint 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative3 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

05

0002

0

05

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

0002

0

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

0009

0

OS BM

+

OS In All

0002

0

05

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 104 00 + 105 00

All + 87 - 14 + 84 - 16

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 46 NA + 40 NA

All + 49 NA + 42 NA

05 05 05 05 05

41

05 05 05 05

B) Final OS Look Adjusting Significance Levels bull The time of the final OS analysis will be determined by the number of

BM+ OS events in the 3 arms or in the control arm

bull As a result the number of events for each OS comparison is a random variable by design

bull Consequence bull Time 1 Assume none of the OS endpoints was rejected bull Time 2 If the of events for a given comparison is more (less) than expected

=gt the information fraction and nominal significance level used at Time 1 was larger (smaller) than what we should have used

For the final look critical values for OS will be re-calculated using the number of exactly observed events at the final analysis

The effect of adjustment is limited (always lt 0004 and in general lt 0001)

42

Illustration Effect of Final Alpha Adjustment

43 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Legend Observed IF at Time 1 lt Q1 Q1 - Q2 Q2 - Q3 Q3 lt

Illustration Effect of Final Alpha Adjustment

44 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Page 5: Multiplicity Correction in a Group-Sequential Oncology ... · PDF file• Largest difference: for Exp1 in scenario#1 (OS BM+: 11%, OS All: 13%, PFS BM+: 5%) 25 30 35 40 45 50 55 60

Hypothetical Study Design

Control Experimental 1 Experimental 2

Population Metastatic Oncology

Randomize 111

Primary Population BM+ subpopulation

OS PFS OS PFS

Secondary Population All Randomized

OS PFS OS PFS

α = 001 α = 0015 α = 001 α = 0015

About 40 of the population prognostic and predictive

N = 1200

5

Group-Sequential Nature of the Design

First patient randomized

Enrolment

Time 1 Time 2

22months 29months 44months

PFS

OS (1)

PFS in BM+ is mature

OS in BM+ is mature

6

(1) Lan-DeMets α spending function used for the two BM+ OS endpoints OrsquoBrien and Fleming for the two All Randomized OS endpoints Pocock type of boundary

OS PFS Underlying Survival Curves

7

Legend

- - - - BM+ treatment

_______ All treatment

- - - - BM+ control

_______ All control

OS BM+ vs All Randomized PFS BM+ vs All Randomized

PFS

Rat

es

of Events in All 2 Arms vs in Control Arm Analysis will be conducted when

bull Time 1 PFS in BM+ is mature

bull Time 2 OS in BM+ is mature

Question should maturity be defined based on the of events in

1 All 3 arms in total OR

2 The control arm (requires independent group)

Statistical properties were investigated under 4 scenarios

8

Scenario Details

Protocol Assumptions

1 OS PFS Exp2 no treatment effect

OS and PFS no treatment effect in Experimental2 arm

2 OS PFS Exp2 better OS and PFS treatment effect (delay cure rate HR in non-cure patients) improved by 30 in Experimental2 arm

3 Drop-out in all 3 arms 5 drop-out rate in all 3 arms

of Events in Total vs Control Simulation Results

Protocol assumptions

2 Exp2 better

1 Exp2 no treatment effect

3 5 drop-out in all 3 arms

9

Top line (dotted) is based on of events in control arm Bottom line (solid) is based on of events in all 3 arms

of Events in Total vs Control Statistical power bull For all 8 endpoints across all scenarios power is never lower if timing is later bull Largest difference for Exp1 in scenario1 (OS BM+ 11 OS All 13 PFS BM+ 5)

25 30 35 40 45 50 55 60 65 70

Months from First Patient Randomized

Time of Analysis (90 CI) under Different Scenarios

3 Testing Strategy

A Base Case

B Graphical Approach Introduction

C Chosen testing strategy bull Illustration of Sequentially Rejective Procedure

bull Power Gain

D Alternative testing strategies

10

11

Experimental1 vs Control Experimental2 vs Control

Testing Strategy Base Case

Time 1

0015

PFS BM+

PFS in All

OS BM+

OS in All

[0002] 0015

PFS BM+

PFS in All

OS BM+

OS in All

[0002]

Experimental1 vs Control Experimental2 vs Control

OS BM+

OS in All

[0009]

OS BM+

OS in All

[0009]

Time 2

For OS in BM+ information fraction assumed 70 (OrsquoBrien-Fleming)

Possibility for Improvement Graphical Approach

bull Historical overview bull Introduced in 2009 by Bretz et al (and Burman et al)

bull 2013 extension to group-sequential trials (Maurer Bretz)

bull 2017 Supported by the draft FDA guidance (Multiple Endpoints in Clinical Trials)

bull Closed test procedure =gt Strong control of family-wise error rate

bull A method to depict develop communicate a strategy consisting of Bonferroni-based sequential methods

Additional consideration PFS is not analysed at Time2 12

Experimental1 vs Control Experimental2 vs Control

Graphical Approach Introduction

0015

0

1

PFS BM+

PFS in All

OS BM+

OS in All

1

001

0

0015

0

1

PFS BM+

PFS in All

OS BM+

OS in All

1

001

0

Vertices = nodes

Endpoint-specific alpha asymp Local significance level

Edges = paths

Weight of edge

Blue expressions terminology of FDA guidance 13

Experimental1 vs Control Experimental2 vs Control

Improving Testing Strategy Add Edges

0015

0

1

PFS BM+

PFS in All

OS BM+

OS in All

1

001

0

1

0015

0

1

PFS BM+

PFS in All

OS BM+

OS in All

1

001

0

1

1

1

Along the green edges alpha can be passed only at Time1

14

15

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 1a Time 1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

1

Along the green edges alpha can be passed only at Time1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

16

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 1b Time 1

0015

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

1

Along the green edges alpha can be passed only at Time1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

17

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 2a Time 1

0015

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

1

Along the green edges alpha can be passed only at Time1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

18

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 2b Time 1

OS BM+

OS In All

1

0025 [00060023]

0

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

1

Along the green edges alpha can be passed only at Time1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

19

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 3a Time 2

OS BM+

OS In All

1

0025 [00060023]

0

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

20

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 3b Time 2

OS BM+

OS In All

1

0025 [00060023]

0

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

21

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 3c Time 2

OS In All

0025 [002001]

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

1 1

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

1

1

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ +171 00 +179 00

All +176 00 +179 00

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ +66 +57

All +77 +73 22

How Does the Power Gain Change under Different Assumptions

Endpoint Modified

Scenario Details

OS 1 OS larger difference in treatment effect by BM status

HR (for non-cure patients and after delay) increased by 10 in the BM- and decreased by 10 in the BM+ population (while keeping HR for all randomized the same) (a)

2 OS shorter delay 10 shorter delay in all 4 OS endpoints (a)

OS PFS 3 OS PFS Exp2 better OS and PFS treatment effect (delay cure rate HR in non-cure patients) improved by 10 in Experimental2 arm

4 OS PFS Exp2 worse OS and PFS treatment effect (delay cure rate HR in non-cure patients) worsened by 10 in Experimental2 arm

5 OS PFS longer delay OS delay increased with 50 PFS delay with 300

PFS 6 PFS no delay No delay in any of the 4 PFS endpoints

Time of analysis is generated based on of events in all 3 arms 23

0

01

02

03

04

05

06

07

0

01

02

03

04

05

06

07

PROTOCOL 1 OS Larger BM effect 2 OS shorter delay 3 Exp2 better 4 Exp2 worse 5 OS PFS longer delay 6 PFS no delay

Benefit of Additional Edges in Statistical Power of OS Time1

OS1+

OS1All

OS2+

OS2All

OS1

OS1AC

OS2

OS2AC

05

055

06

065

07

075

08

085

09

095

1

05

055

06

065

07

075

08

085

09

095

1

PROTOCOL 1 OS Larger BM effect 2 OS shorter delay 3 Exp2 better 4 Exp2 worse 5 OS PFS longer delay 6 PFS no delay

Benefit of Additional Edges in Statistical Power of OS Time2

OS1+

OS1All

OS2+

OS2All

OS1

OS1AC

OS2

OS2AC24

Experimental1 vs Control Experimental2 vs Control

Alternative1 Secure All Endpoints within Comparison

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

ε

ε

1 - ε 1 - ε

25

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dashed edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative1 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

ε

ε

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

ε

ε

1-ε 1-ε

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 170 + 14 + 178 + 17

All + 176 + 16 + 178 + 20

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 64 + 56

All + 75 + 71 26

Experimental1 vs Control Experimental2 vs Control

Alternative2 Giving BM+ Maximum Chance

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

001

0

1

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

001

0

1

ε2 ε2

1 - ε 1 - ε

1

1 1

ε2 ε2

1

27

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative2 Change in Power vs Base Case (under Protocol Assumptions)

1 - ε 1 - ε

1

1 1

ε2

0015

0

PFS BM+

PFS In All

OS BM

+

OS In All

0002

0

1

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

0002

0

1

ε2 ε2 ε2

1

Time 1 Time 2

OS BM+

OS In All

0002

0

OS BM+

OS In All

0002

0

05 05

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 135 + 08 + 138 + 08

All - 197 - 80 - 19 - 79

+20

+15

+10

+5

0

-5

-10

-15

-20

28

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 37 + 32

All - 67 - 68

4 Concluding Remarks

29

Concluding Remarks MTP

bull Limitation of the results on statistical power bull OS and PFS were generated independently

bull Identifying the testing strategy for an actual clinical trial always requires the input of several functions and the power results represent only one aspect of these considerations

bull Using graphical approach may result in overpowered endpoints However this still may be desirable reasons bull Protocol assumptions are complicated =gt just one detail off can result in power drop

(even in the presence of meaningful treatment effect)

bull At an interim analysis probability of crossing the efficacy boundary increases =gt gives larger chance for earlier read-out

30

Thank you for your attention

31

References

bull Bretz F Maurer W Brannath W Posch M A graphical approach to sequentially rejective multiple test procedures Statistics in Medicine 2009 28 586-604

bull Burman C-F Sonesson C and Guilbaud O (2009) A recycling framework for the construction of Bonferroni-based multiple tests Statistics in Medicine 28 739ndash761

bull Bretz F Posch M Glimm E Klinglmueller F Maurer W Rohmeyer K Graphical approaches for multiple comparison procedures using weighted Bonferroni Simes or parametric tests Biometrical Journal 2011 53(6) 894-913

bull Ye Y Li A Liu L and Yao B (2013) A group sequential Holm procedure with multiple primary endpoints Statistics in Medicine 32 1112-1124

bull Maurer W and Bretz F (2013a) ldquoGraphical Multiple Test Procedures With Memoryrdquo Statistics in Medicine DOI 101002sim5711

bull Willi MAURER and Frank BRETZ Multiple Testing in Group Sequential Trials Using Graphical Approaches American Statistical Association Statistics in Biopharmaceutical Research November 2013 Vol5 No4

bull Xi D Tamhane AC Allocating recycled significance levels in group sequential procedures for multiple endpoints Biom J 2015 Jan57(1)90-107

bull Tamhane A C Mehta C R and Liu L (2010) Testing a primary and a secondary endpoint in a group sequential design Biometrics 66 1174ndash1184

32

Back-up

33

Basic Parameters of the Study Design Table 1 Basic Parameters of the Study Design

Overall Survival Progression-Free Survival

BM+ BM- All Rand BM+ BM- All Rand

randomized Total (per comparison)

asymp 480 (asymp 320) (c)

asymp 720 (asymp 480) (c)

1200 (800) (fixed)

As OS As OS As OS

Significance level (a) 001 0015

Delayed Effect (b) 4m 5m 1m 1m

Cure Rate in Experimental (b) 15 10 0 0

HR after Delay Effect (b) (d) 06 075 056 075

Overall HR (b) Time1 Time2 070 061 084 074 077 069 063 084 072

Median controlexperimental 8m 12m 9m 12m 9m 12m 4m 64m 4m 47m 4m 55m

events Total Time1 Time2 asymp 285 385 (fixed)

asymp 693 asymp 963 380 (fixed) asymp 989

events per comparison Time1 Time2

asymp 198 asymp 266 asymp 478 asymp 661 asymp 261 asymp 672

(a) Initially allocated endpoint-specific alpha used for sample size calculation (b) For each comparison with the two experimental arms (c) Based on an expected BM+ proportion of 40 (Total number randomized is fixed) (d) As assumed for non-cure patients

34

Literature Overview

Graphical approach Bonferroni-based (Bretz et al 2009)

Non-parametric extensions (Bretz et al 2011)

GSP extension special case (Ye et al 2013)

GSP general allows for selection of recycling stages (Xi Tamhane 2015)

GSP extension general (Maurer Bretz 2013)

Graphical approach Bonferroni-based (Burman 2009)

GSP = group-sequential procedure Dark blue boxes referenced by FDA guidance

Graphs with memory (Maurer Bretz 2013a)

-649

-648 -577 -459

35

Multiplicity History of Regulatory Guidances bull E9 Statistical Principles for Clinical Trials (1998 Section 56)

bull Pre-specification of MTP bull EMA Points to Consider on Multiplicity Issues in Clinical Trials (2002)

bull Strong control of FWER bull EMA Concept paper on the need for a guideline on multiplicity issues in

clinical trials (Draft 2012) =gt Nov2012 Workshop bull EMA Guideline on multiplicity issues in clinical trials bull FDA Guidance document on multiplicity issues in clinical trials bull PMDA no multiplicity guidance (ldquoMTPs do not necessarily match with

labellingrdquo)

36

DRAFT Released for public

consultation in 2017

Related document within BMS Reporting Observed Significance Levels (p-values) in Clinical Study Reports Regulatory Documents and Publications 2014 (Stephane Munier)

Multiplicity Latest Regulatory Guidancersquos (2017) EMA Guideline on multiplicity issues

in clinical trials FDA Guidance document on

multiplicity issues in clinical trials

Principles bull Family-wise strong control bull Pre-specification of multiplicity adjustment

Secondary Endpoints bull Secondary endpoints expressing supportive evidence needs no MTP

bull Secondary endpoints that may become base for additional claims need MTP

(Section 6)

Distinction between different types of secondary endpoints is not clear all require MTP (Section IIIA)

Scope Similar (not the same)

Subgroups Section 7 (also own guidance) Excluded (another guidance)

Estimation and descriptive statistics Section 10 Not included

Graphical approach Not mentioned Explicitly encouraged

37

Experimental1 vs Control Experimental2 vs Control

Alternative4 Importance of Experimental1 PFS

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

ε

1

1 - ε

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain 38

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative4 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

ε

1

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

ε

1

1-ε

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 166 + 13 + 131 0

AC + 162 + 19 + 116 0

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 56 + 30

AC + 69 + 40 39

Experimental1 vs Control Experimental2 vs Control

Alternative3 Importance of both OS BM+

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

05 05

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

05

05

40

Timepoint 1 Timepoint 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative3 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

05

0002

0

05

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

0002

0

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

0009

0

OS BM

+

OS In All

0002

0

05

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 104 00 + 105 00

All + 87 - 14 + 84 - 16

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 46 NA + 40 NA

All + 49 NA + 42 NA

05 05 05 05 05

41

05 05 05 05

B) Final OS Look Adjusting Significance Levels bull The time of the final OS analysis will be determined by the number of

BM+ OS events in the 3 arms or in the control arm

bull As a result the number of events for each OS comparison is a random variable by design

bull Consequence bull Time 1 Assume none of the OS endpoints was rejected bull Time 2 If the of events for a given comparison is more (less) than expected

=gt the information fraction and nominal significance level used at Time 1 was larger (smaller) than what we should have used

For the final look critical values for OS will be re-calculated using the number of exactly observed events at the final analysis

The effect of adjustment is limited (always lt 0004 and in general lt 0001)

42

Illustration Effect of Final Alpha Adjustment

43 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Legend Observed IF at Time 1 lt Q1 Q1 - Q2 Q2 - Q3 Q3 lt

Illustration Effect of Final Alpha Adjustment

44 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Page 6: Multiplicity Correction in a Group-Sequential Oncology ... · PDF file• Largest difference: for Exp1 in scenario#1 (OS BM+: 11%, OS All: 13%, PFS BM+: 5%) 25 30 35 40 45 50 55 60

Group-Sequential Nature of the Design

First patient randomized

Enrolment

Time 1 Time 2

22months 29months 44months

PFS

OS (1)

PFS in BM+ is mature

OS in BM+ is mature

6

(1) Lan-DeMets α spending function used for the two BM+ OS endpoints OrsquoBrien and Fleming for the two All Randomized OS endpoints Pocock type of boundary

OS PFS Underlying Survival Curves

7

Legend

- - - - BM+ treatment

_______ All treatment

- - - - BM+ control

_______ All control

OS BM+ vs All Randomized PFS BM+ vs All Randomized

PFS

Rat

es

of Events in All 2 Arms vs in Control Arm Analysis will be conducted when

bull Time 1 PFS in BM+ is mature

bull Time 2 OS in BM+ is mature

Question should maturity be defined based on the of events in

1 All 3 arms in total OR

2 The control arm (requires independent group)

Statistical properties were investigated under 4 scenarios

8

Scenario Details

Protocol Assumptions

1 OS PFS Exp2 no treatment effect

OS and PFS no treatment effect in Experimental2 arm

2 OS PFS Exp2 better OS and PFS treatment effect (delay cure rate HR in non-cure patients) improved by 30 in Experimental2 arm

3 Drop-out in all 3 arms 5 drop-out rate in all 3 arms

of Events in Total vs Control Simulation Results

Protocol assumptions

2 Exp2 better

1 Exp2 no treatment effect

3 5 drop-out in all 3 arms

9

Top line (dotted) is based on of events in control arm Bottom line (solid) is based on of events in all 3 arms

of Events in Total vs Control Statistical power bull For all 8 endpoints across all scenarios power is never lower if timing is later bull Largest difference for Exp1 in scenario1 (OS BM+ 11 OS All 13 PFS BM+ 5)

25 30 35 40 45 50 55 60 65 70

Months from First Patient Randomized

Time of Analysis (90 CI) under Different Scenarios

3 Testing Strategy

A Base Case

B Graphical Approach Introduction

C Chosen testing strategy bull Illustration of Sequentially Rejective Procedure

bull Power Gain

D Alternative testing strategies

10

11

Experimental1 vs Control Experimental2 vs Control

Testing Strategy Base Case

Time 1

0015

PFS BM+

PFS in All

OS BM+

OS in All

[0002] 0015

PFS BM+

PFS in All

OS BM+

OS in All

[0002]

Experimental1 vs Control Experimental2 vs Control

OS BM+

OS in All

[0009]

OS BM+

OS in All

[0009]

Time 2

For OS in BM+ information fraction assumed 70 (OrsquoBrien-Fleming)

Possibility for Improvement Graphical Approach

bull Historical overview bull Introduced in 2009 by Bretz et al (and Burman et al)

bull 2013 extension to group-sequential trials (Maurer Bretz)

bull 2017 Supported by the draft FDA guidance (Multiple Endpoints in Clinical Trials)

bull Closed test procedure =gt Strong control of family-wise error rate

bull A method to depict develop communicate a strategy consisting of Bonferroni-based sequential methods

Additional consideration PFS is not analysed at Time2 12

Experimental1 vs Control Experimental2 vs Control

Graphical Approach Introduction

0015

0

1

PFS BM+

PFS in All

OS BM+

OS in All

1

001

0

0015

0

1

PFS BM+

PFS in All

OS BM+

OS in All

1

001

0

Vertices = nodes

Endpoint-specific alpha asymp Local significance level

Edges = paths

Weight of edge

Blue expressions terminology of FDA guidance 13

Experimental1 vs Control Experimental2 vs Control

Improving Testing Strategy Add Edges

0015

0

1

PFS BM+

PFS in All

OS BM+

OS in All

1

001

0

1

0015

0

1

PFS BM+

PFS in All

OS BM+

OS in All

1

001

0

1

1

1

Along the green edges alpha can be passed only at Time1

14

15

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 1a Time 1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

1

Along the green edges alpha can be passed only at Time1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

16

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 1b Time 1

0015

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

1

Along the green edges alpha can be passed only at Time1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

17

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 2a Time 1

0015

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

1

Along the green edges alpha can be passed only at Time1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

18

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 2b Time 1

OS BM+

OS In All

1

0025 [00060023]

0

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

1

Along the green edges alpha can be passed only at Time1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

19

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 3a Time 2

OS BM+

OS In All

1

0025 [00060023]

0

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

20

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 3b Time 2

OS BM+

OS In All

1

0025 [00060023]

0

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

21

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 3c Time 2

OS In All

0025 [002001]

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

1 1

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

1

1

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ +171 00 +179 00

All +176 00 +179 00

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ +66 +57

All +77 +73 22

How Does the Power Gain Change under Different Assumptions

Endpoint Modified

Scenario Details

OS 1 OS larger difference in treatment effect by BM status

HR (for non-cure patients and after delay) increased by 10 in the BM- and decreased by 10 in the BM+ population (while keeping HR for all randomized the same) (a)

2 OS shorter delay 10 shorter delay in all 4 OS endpoints (a)

OS PFS 3 OS PFS Exp2 better OS and PFS treatment effect (delay cure rate HR in non-cure patients) improved by 10 in Experimental2 arm

4 OS PFS Exp2 worse OS and PFS treatment effect (delay cure rate HR in non-cure patients) worsened by 10 in Experimental2 arm

5 OS PFS longer delay OS delay increased with 50 PFS delay with 300

PFS 6 PFS no delay No delay in any of the 4 PFS endpoints

Time of analysis is generated based on of events in all 3 arms 23

0

01

02

03

04

05

06

07

0

01

02

03

04

05

06

07

PROTOCOL 1 OS Larger BM effect 2 OS shorter delay 3 Exp2 better 4 Exp2 worse 5 OS PFS longer delay 6 PFS no delay

Benefit of Additional Edges in Statistical Power of OS Time1

OS1+

OS1All

OS2+

OS2All

OS1

OS1AC

OS2

OS2AC

05

055

06

065

07

075

08

085

09

095

1

05

055

06

065

07

075

08

085

09

095

1

PROTOCOL 1 OS Larger BM effect 2 OS shorter delay 3 Exp2 better 4 Exp2 worse 5 OS PFS longer delay 6 PFS no delay

Benefit of Additional Edges in Statistical Power of OS Time2

OS1+

OS1All

OS2+

OS2All

OS1

OS1AC

OS2

OS2AC24

Experimental1 vs Control Experimental2 vs Control

Alternative1 Secure All Endpoints within Comparison

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

ε

ε

1 - ε 1 - ε

25

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dashed edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative1 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

ε

ε

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

ε

ε

1-ε 1-ε

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 170 + 14 + 178 + 17

All + 176 + 16 + 178 + 20

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 64 + 56

All + 75 + 71 26

Experimental1 vs Control Experimental2 vs Control

Alternative2 Giving BM+ Maximum Chance

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

001

0

1

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

001

0

1

ε2 ε2

1 - ε 1 - ε

1

1 1

ε2 ε2

1

27

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative2 Change in Power vs Base Case (under Protocol Assumptions)

1 - ε 1 - ε

1

1 1

ε2

0015

0

PFS BM+

PFS In All

OS BM

+

OS In All

0002

0

1

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

0002

0

1

ε2 ε2 ε2

1

Time 1 Time 2

OS BM+

OS In All

0002

0

OS BM+

OS In All

0002

0

05 05

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 135 + 08 + 138 + 08

All - 197 - 80 - 19 - 79

+20

+15

+10

+5

0

-5

-10

-15

-20

28

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 37 + 32

All - 67 - 68

4 Concluding Remarks

29

Concluding Remarks MTP

bull Limitation of the results on statistical power bull OS and PFS were generated independently

bull Identifying the testing strategy for an actual clinical trial always requires the input of several functions and the power results represent only one aspect of these considerations

bull Using graphical approach may result in overpowered endpoints However this still may be desirable reasons bull Protocol assumptions are complicated =gt just one detail off can result in power drop

(even in the presence of meaningful treatment effect)

bull At an interim analysis probability of crossing the efficacy boundary increases =gt gives larger chance for earlier read-out

30

Thank you for your attention

31

References

bull Bretz F Maurer W Brannath W Posch M A graphical approach to sequentially rejective multiple test procedures Statistics in Medicine 2009 28 586-604

bull Burman C-F Sonesson C and Guilbaud O (2009) A recycling framework for the construction of Bonferroni-based multiple tests Statistics in Medicine 28 739ndash761

bull Bretz F Posch M Glimm E Klinglmueller F Maurer W Rohmeyer K Graphical approaches for multiple comparison procedures using weighted Bonferroni Simes or parametric tests Biometrical Journal 2011 53(6) 894-913

bull Ye Y Li A Liu L and Yao B (2013) A group sequential Holm procedure with multiple primary endpoints Statistics in Medicine 32 1112-1124

bull Maurer W and Bretz F (2013a) ldquoGraphical Multiple Test Procedures With Memoryrdquo Statistics in Medicine DOI 101002sim5711

bull Willi MAURER and Frank BRETZ Multiple Testing in Group Sequential Trials Using Graphical Approaches American Statistical Association Statistics in Biopharmaceutical Research November 2013 Vol5 No4

bull Xi D Tamhane AC Allocating recycled significance levels in group sequential procedures for multiple endpoints Biom J 2015 Jan57(1)90-107

bull Tamhane A C Mehta C R and Liu L (2010) Testing a primary and a secondary endpoint in a group sequential design Biometrics 66 1174ndash1184

32

Back-up

33

Basic Parameters of the Study Design Table 1 Basic Parameters of the Study Design

Overall Survival Progression-Free Survival

BM+ BM- All Rand BM+ BM- All Rand

randomized Total (per comparison)

asymp 480 (asymp 320) (c)

asymp 720 (asymp 480) (c)

1200 (800) (fixed)

As OS As OS As OS

Significance level (a) 001 0015

Delayed Effect (b) 4m 5m 1m 1m

Cure Rate in Experimental (b) 15 10 0 0

HR after Delay Effect (b) (d) 06 075 056 075

Overall HR (b) Time1 Time2 070 061 084 074 077 069 063 084 072

Median controlexperimental 8m 12m 9m 12m 9m 12m 4m 64m 4m 47m 4m 55m

events Total Time1 Time2 asymp 285 385 (fixed)

asymp 693 asymp 963 380 (fixed) asymp 989

events per comparison Time1 Time2

asymp 198 asymp 266 asymp 478 asymp 661 asymp 261 asymp 672

(a) Initially allocated endpoint-specific alpha used for sample size calculation (b) For each comparison with the two experimental arms (c) Based on an expected BM+ proportion of 40 (Total number randomized is fixed) (d) As assumed for non-cure patients

34

Literature Overview

Graphical approach Bonferroni-based (Bretz et al 2009)

Non-parametric extensions (Bretz et al 2011)

GSP extension special case (Ye et al 2013)

GSP general allows for selection of recycling stages (Xi Tamhane 2015)

GSP extension general (Maurer Bretz 2013)

Graphical approach Bonferroni-based (Burman 2009)

GSP = group-sequential procedure Dark blue boxes referenced by FDA guidance

Graphs with memory (Maurer Bretz 2013a)

-649

-648 -577 -459

35

Multiplicity History of Regulatory Guidances bull E9 Statistical Principles for Clinical Trials (1998 Section 56)

bull Pre-specification of MTP bull EMA Points to Consider on Multiplicity Issues in Clinical Trials (2002)

bull Strong control of FWER bull EMA Concept paper on the need for a guideline on multiplicity issues in

clinical trials (Draft 2012) =gt Nov2012 Workshop bull EMA Guideline on multiplicity issues in clinical trials bull FDA Guidance document on multiplicity issues in clinical trials bull PMDA no multiplicity guidance (ldquoMTPs do not necessarily match with

labellingrdquo)

36

DRAFT Released for public

consultation in 2017

Related document within BMS Reporting Observed Significance Levels (p-values) in Clinical Study Reports Regulatory Documents and Publications 2014 (Stephane Munier)

Multiplicity Latest Regulatory Guidancersquos (2017) EMA Guideline on multiplicity issues

in clinical trials FDA Guidance document on

multiplicity issues in clinical trials

Principles bull Family-wise strong control bull Pre-specification of multiplicity adjustment

Secondary Endpoints bull Secondary endpoints expressing supportive evidence needs no MTP

bull Secondary endpoints that may become base for additional claims need MTP

(Section 6)

Distinction between different types of secondary endpoints is not clear all require MTP (Section IIIA)

Scope Similar (not the same)

Subgroups Section 7 (also own guidance) Excluded (another guidance)

Estimation and descriptive statistics Section 10 Not included

Graphical approach Not mentioned Explicitly encouraged

37

Experimental1 vs Control Experimental2 vs Control

Alternative4 Importance of Experimental1 PFS

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

ε

1

1 - ε

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain 38

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative4 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

ε

1

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

ε

1

1-ε

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 166 + 13 + 131 0

AC + 162 + 19 + 116 0

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 56 + 30

AC + 69 + 40 39

Experimental1 vs Control Experimental2 vs Control

Alternative3 Importance of both OS BM+

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

05 05

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

05

05

40

Timepoint 1 Timepoint 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative3 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

05

0002

0

05

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

0002

0

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

0009

0

OS BM

+

OS In All

0002

0

05

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 104 00 + 105 00

All + 87 - 14 + 84 - 16

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 46 NA + 40 NA

All + 49 NA + 42 NA

05 05 05 05 05

41

05 05 05 05

B) Final OS Look Adjusting Significance Levels bull The time of the final OS analysis will be determined by the number of

BM+ OS events in the 3 arms or in the control arm

bull As a result the number of events for each OS comparison is a random variable by design

bull Consequence bull Time 1 Assume none of the OS endpoints was rejected bull Time 2 If the of events for a given comparison is more (less) than expected

=gt the information fraction and nominal significance level used at Time 1 was larger (smaller) than what we should have used

For the final look critical values for OS will be re-calculated using the number of exactly observed events at the final analysis

The effect of adjustment is limited (always lt 0004 and in general lt 0001)

42

Illustration Effect of Final Alpha Adjustment

43 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Legend Observed IF at Time 1 lt Q1 Q1 - Q2 Q2 - Q3 Q3 lt

Illustration Effect of Final Alpha Adjustment

44 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Page 7: Multiplicity Correction in a Group-Sequential Oncology ... · PDF file• Largest difference: for Exp1 in scenario#1 (OS BM+: 11%, OS All: 13%, PFS BM+: 5%) 25 30 35 40 45 50 55 60

OS PFS Underlying Survival Curves

7

Legend

- - - - BM+ treatment

_______ All treatment

- - - - BM+ control

_______ All control

OS BM+ vs All Randomized PFS BM+ vs All Randomized

PFS

Rat

es

of Events in All 2 Arms vs in Control Arm Analysis will be conducted when

bull Time 1 PFS in BM+ is mature

bull Time 2 OS in BM+ is mature

Question should maturity be defined based on the of events in

1 All 3 arms in total OR

2 The control arm (requires independent group)

Statistical properties were investigated under 4 scenarios

8

Scenario Details

Protocol Assumptions

1 OS PFS Exp2 no treatment effect

OS and PFS no treatment effect in Experimental2 arm

2 OS PFS Exp2 better OS and PFS treatment effect (delay cure rate HR in non-cure patients) improved by 30 in Experimental2 arm

3 Drop-out in all 3 arms 5 drop-out rate in all 3 arms

of Events in Total vs Control Simulation Results

Protocol assumptions

2 Exp2 better

1 Exp2 no treatment effect

3 5 drop-out in all 3 arms

9

Top line (dotted) is based on of events in control arm Bottom line (solid) is based on of events in all 3 arms

of Events in Total vs Control Statistical power bull For all 8 endpoints across all scenarios power is never lower if timing is later bull Largest difference for Exp1 in scenario1 (OS BM+ 11 OS All 13 PFS BM+ 5)

25 30 35 40 45 50 55 60 65 70

Months from First Patient Randomized

Time of Analysis (90 CI) under Different Scenarios

3 Testing Strategy

A Base Case

B Graphical Approach Introduction

C Chosen testing strategy bull Illustration of Sequentially Rejective Procedure

bull Power Gain

D Alternative testing strategies

10

11

Experimental1 vs Control Experimental2 vs Control

Testing Strategy Base Case

Time 1

0015

PFS BM+

PFS in All

OS BM+

OS in All

[0002] 0015

PFS BM+

PFS in All

OS BM+

OS in All

[0002]

Experimental1 vs Control Experimental2 vs Control

OS BM+

OS in All

[0009]

OS BM+

OS in All

[0009]

Time 2

For OS in BM+ information fraction assumed 70 (OrsquoBrien-Fleming)

Possibility for Improvement Graphical Approach

bull Historical overview bull Introduced in 2009 by Bretz et al (and Burman et al)

bull 2013 extension to group-sequential trials (Maurer Bretz)

bull 2017 Supported by the draft FDA guidance (Multiple Endpoints in Clinical Trials)

bull Closed test procedure =gt Strong control of family-wise error rate

bull A method to depict develop communicate a strategy consisting of Bonferroni-based sequential methods

Additional consideration PFS is not analysed at Time2 12

Experimental1 vs Control Experimental2 vs Control

Graphical Approach Introduction

0015

0

1

PFS BM+

PFS in All

OS BM+

OS in All

1

001

0

0015

0

1

PFS BM+

PFS in All

OS BM+

OS in All

1

001

0

Vertices = nodes

Endpoint-specific alpha asymp Local significance level

Edges = paths

Weight of edge

Blue expressions terminology of FDA guidance 13

Experimental1 vs Control Experimental2 vs Control

Improving Testing Strategy Add Edges

0015

0

1

PFS BM+

PFS in All

OS BM+

OS in All

1

001

0

1

0015

0

1

PFS BM+

PFS in All

OS BM+

OS in All

1

001

0

1

1

1

Along the green edges alpha can be passed only at Time1

14

15

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 1a Time 1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

1

Along the green edges alpha can be passed only at Time1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

16

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 1b Time 1

0015

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

1

Along the green edges alpha can be passed only at Time1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

17

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 2a Time 1

0015

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

1

Along the green edges alpha can be passed only at Time1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

18

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 2b Time 1

OS BM+

OS In All

1

0025 [00060023]

0

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

1

Along the green edges alpha can be passed only at Time1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

19

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 3a Time 2

OS BM+

OS In All

1

0025 [00060023]

0

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

20

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 3b Time 2

OS BM+

OS In All

1

0025 [00060023]

0

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

21

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 3c Time 2

OS In All

0025 [002001]

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

1 1

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

1

1

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ +171 00 +179 00

All +176 00 +179 00

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ +66 +57

All +77 +73 22

How Does the Power Gain Change under Different Assumptions

Endpoint Modified

Scenario Details

OS 1 OS larger difference in treatment effect by BM status

HR (for non-cure patients and after delay) increased by 10 in the BM- and decreased by 10 in the BM+ population (while keeping HR for all randomized the same) (a)

2 OS shorter delay 10 shorter delay in all 4 OS endpoints (a)

OS PFS 3 OS PFS Exp2 better OS and PFS treatment effect (delay cure rate HR in non-cure patients) improved by 10 in Experimental2 arm

4 OS PFS Exp2 worse OS and PFS treatment effect (delay cure rate HR in non-cure patients) worsened by 10 in Experimental2 arm

5 OS PFS longer delay OS delay increased with 50 PFS delay with 300

PFS 6 PFS no delay No delay in any of the 4 PFS endpoints

Time of analysis is generated based on of events in all 3 arms 23

0

01

02

03

04

05

06

07

0

01

02

03

04

05

06

07

PROTOCOL 1 OS Larger BM effect 2 OS shorter delay 3 Exp2 better 4 Exp2 worse 5 OS PFS longer delay 6 PFS no delay

Benefit of Additional Edges in Statistical Power of OS Time1

OS1+

OS1All

OS2+

OS2All

OS1

OS1AC

OS2

OS2AC

05

055

06

065

07

075

08

085

09

095

1

05

055

06

065

07

075

08

085

09

095

1

PROTOCOL 1 OS Larger BM effect 2 OS shorter delay 3 Exp2 better 4 Exp2 worse 5 OS PFS longer delay 6 PFS no delay

Benefit of Additional Edges in Statistical Power of OS Time2

OS1+

OS1All

OS2+

OS2All

OS1

OS1AC

OS2

OS2AC24

Experimental1 vs Control Experimental2 vs Control

Alternative1 Secure All Endpoints within Comparison

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

ε

ε

1 - ε 1 - ε

25

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dashed edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative1 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

ε

ε

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

ε

ε

1-ε 1-ε

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 170 + 14 + 178 + 17

All + 176 + 16 + 178 + 20

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 64 + 56

All + 75 + 71 26

Experimental1 vs Control Experimental2 vs Control

Alternative2 Giving BM+ Maximum Chance

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

001

0

1

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

001

0

1

ε2 ε2

1 - ε 1 - ε

1

1 1

ε2 ε2

1

27

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative2 Change in Power vs Base Case (under Protocol Assumptions)

1 - ε 1 - ε

1

1 1

ε2

0015

0

PFS BM+

PFS In All

OS BM

+

OS In All

0002

0

1

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

0002

0

1

ε2 ε2 ε2

1

Time 1 Time 2

OS BM+

OS In All

0002

0

OS BM+

OS In All

0002

0

05 05

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 135 + 08 + 138 + 08

All - 197 - 80 - 19 - 79

+20

+15

+10

+5

0

-5

-10

-15

-20

28

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 37 + 32

All - 67 - 68

4 Concluding Remarks

29

Concluding Remarks MTP

bull Limitation of the results on statistical power bull OS and PFS were generated independently

bull Identifying the testing strategy for an actual clinical trial always requires the input of several functions and the power results represent only one aspect of these considerations

bull Using graphical approach may result in overpowered endpoints However this still may be desirable reasons bull Protocol assumptions are complicated =gt just one detail off can result in power drop

(even in the presence of meaningful treatment effect)

bull At an interim analysis probability of crossing the efficacy boundary increases =gt gives larger chance for earlier read-out

30

Thank you for your attention

31

References

bull Bretz F Maurer W Brannath W Posch M A graphical approach to sequentially rejective multiple test procedures Statistics in Medicine 2009 28 586-604

bull Burman C-F Sonesson C and Guilbaud O (2009) A recycling framework for the construction of Bonferroni-based multiple tests Statistics in Medicine 28 739ndash761

bull Bretz F Posch M Glimm E Klinglmueller F Maurer W Rohmeyer K Graphical approaches for multiple comparison procedures using weighted Bonferroni Simes or parametric tests Biometrical Journal 2011 53(6) 894-913

bull Ye Y Li A Liu L and Yao B (2013) A group sequential Holm procedure with multiple primary endpoints Statistics in Medicine 32 1112-1124

bull Maurer W and Bretz F (2013a) ldquoGraphical Multiple Test Procedures With Memoryrdquo Statistics in Medicine DOI 101002sim5711

bull Willi MAURER and Frank BRETZ Multiple Testing in Group Sequential Trials Using Graphical Approaches American Statistical Association Statistics in Biopharmaceutical Research November 2013 Vol5 No4

bull Xi D Tamhane AC Allocating recycled significance levels in group sequential procedures for multiple endpoints Biom J 2015 Jan57(1)90-107

bull Tamhane A C Mehta C R and Liu L (2010) Testing a primary and a secondary endpoint in a group sequential design Biometrics 66 1174ndash1184

32

Back-up

33

Basic Parameters of the Study Design Table 1 Basic Parameters of the Study Design

Overall Survival Progression-Free Survival

BM+ BM- All Rand BM+ BM- All Rand

randomized Total (per comparison)

asymp 480 (asymp 320) (c)

asymp 720 (asymp 480) (c)

1200 (800) (fixed)

As OS As OS As OS

Significance level (a) 001 0015

Delayed Effect (b) 4m 5m 1m 1m

Cure Rate in Experimental (b) 15 10 0 0

HR after Delay Effect (b) (d) 06 075 056 075

Overall HR (b) Time1 Time2 070 061 084 074 077 069 063 084 072

Median controlexperimental 8m 12m 9m 12m 9m 12m 4m 64m 4m 47m 4m 55m

events Total Time1 Time2 asymp 285 385 (fixed)

asymp 693 asymp 963 380 (fixed) asymp 989

events per comparison Time1 Time2

asymp 198 asymp 266 asymp 478 asymp 661 asymp 261 asymp 672

(a) Initially allocated endpoint-specific alpha used for sample size calculation (b) For each comparison with the two experimental arms (c) Based on an expected BM+ proportion of 40 (Total number randomized is fixed) (d) As assumed for non-cure patients

34

Literature Overview

Graphical approach Bonferroni-based (Bretz et al 2009)

Non-parametric extensions (Bretz et al 2011)

GSP extension special case (Ye et al 2013)

GSP general allows for selection of recycling stages (Xi Tamhane 2015)

GSP extension general (Maurer Bretz 2013)

Graphical approach Bonferroni-based (Burman 2009)

GSP = group-sequential procedure Dark blue boxes referenced by FDA guidance

Graphs with memory (Maurer Bretz 2013a)

-649

-648 -577 -459

35

Multiplicity History of Regulatory Guidances bull E9 Statistical Principles for Clinical Trials (1998 Section 56)

bull Pre-specification of MTP bull EMA Points to Consider on Multiplicity Issues in Clinical Trials (2002)

bull Strong control of FWER bull EMA Concept paper on the need for a guideline on multiplicity issues in

clinical trials (Draft 2012) =gt Nov2012 Workshop bull EMA Guideline on multiplicity issues in clinical trials bull FDA Guidance document on multiplicity issues in clinical trials bull PMDA no multiplicity guidance (ldquoMTPs do not necessarily match with

labellingrdquo)

36

DRAFT Released for public

consultation in 2017

Related document within BMS Reporting Observed Significance Levels (p-values) in Clinical Study Reports Regulatory Documents and Publications 2014 (Stephane Munier)

Multiplicity Latest Regulatory Guidancersquos (2017) EMA Guideline on multiplicity issues

in clinical trials FDA Guidance document on

multiplicity issues in clinical trials

Principles bull Family-wise strong control bull Pre-specification of multiplicity adjustment

Secondary Endpoints bull Secondary endpoints expressing supportive evidence needs no MTP

bull Secondary endpoints that may become base for additional claims need MTP

(Section 6)

Distinction between different types of secondary endpoints is not clear all require MTP (Section IIIA)

Scope Similar (not the same)

Subgroups Section 7 (also own guidance) Excluded (another guidance)

Estimation and descriptive statistics Section 10 Not included

Graphical approach Not mentioned Explicitly encouraged

37

Experimental1 vs Control Experimental2 vs Control

Alternative4 Importance of Experimental1 PFS

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

ε

1

1 - ε

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain 38

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative4 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

ε

1

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

ε

1

1-ε

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 166 + 13 + 131 0

AC + 162 + 19 + 116 0

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 56 + 30

AC + 69 + 40 39

Experimental1 vs Control Experimental2 vs Control

Alternative3 Importance of both OS BM+

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

05 05

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

05

05

40

Timepoint 1 Timepoint 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative3 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

05

0002

0

05

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

0002

0

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

0009

0

OS BM

+

OS In All

0002

0

05

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 104 00 + 105 00

All + 87 - 14 + 84 - 16

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 46 NA + 40 NA

All + 49 NA + 42 NA

05 05 05 05 05

41

05 05 05 05

B) Final OS Look Adjusting Significance Levels bull The time of the final OS analysis will be determined by the number of

BM+ OS events in the 3 arms or in the control arm

bull As a result the number of events for each OS comparison is a random variable by design

bull Consequence bull Time 1 Assume none of the OS endpoints was rejected bull Time 2 If the of events for a given comparison is more (less) than expected

=gt the information fraction and nominal significance level used at Time 1 was larger (smaller) than what we should have used

For the final look critical values for OS will be re-calculated using the number of exactly observed events at the final analysis

The effect of adjustment is limited (always lt 0004 and in general lt 0001)

42

Illustration Effect of Final Alpha Adjustment

43 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Legend Observed IF at Time 1 lt Q1 Q1 - Q2 Q2 - Q3 Q3 lt

Illustration Effect of Final Alpha Adjustment

44 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Page 8: Multiplicity Correction in a Group-Sequential Oncology ... · PDF file• Largest difference: for Exp1 in scenario#1 (OS BM+: 11%, OS All: 13%, PFS BM+: 5%) 25 30 35 40 45 50 55 60

of Events in All 2 Arms vs in Control Arm Analysis will be conducted when

bull Time 1 PFS in BM+ is mature

bull Time 2 OS in BM+ is mature

Question should maturity be defined based on the of events in

1 All 3 arms in total OR

2 The control arm (requires independent group)

Statistical properties were investigated under 4 scenarios

8

Scenario Details

Protocol Assumptions

1 OS PFS Exp2 no treatment effect

OS and PFS no treatment effect in Experimental2 arm

2 OS PFS Exp2 better OS and PFS treatment effect (delay cure rate HR in non-cure patients) improved by 30 in Experimental2 arm

3 Drop-out in all 3 arms 5 drop-out rate in all 3 arms

of Events in Total vs Control Simulation Results

Protocol assumptions

2 Exp2 better

1 Exp2 no treatment effect

3 5 drop-out in all 3 arms

9

Top line (dotted) is based on of events in control arm Bottom line (solid) is based on of events in all 3 arms

of Events in Total vs Control Statistical power bull For all 8 endpoints across all scenarios power is never lower if timing is later bull Largest difference for Exp1 in scenario1 (OS BM+ 11 OS All 13 PFS BM+ 5)

25 30 35 40 45 50 55 60 65 70

Months from First Patient Randomized

Time of Analysis (90 CI) under Different Scenarios

3 Testing Strategy

A Base Case

B Graphical Approach Introduction

C Chosen testing strategy bull Illustration of Sequentially Rejective Procedure

bull Power Gain

D Alternative testing strategies

10

11

Experimental1 vs Control Experimental2 vs Control

Testing Strategy Base Case

Time 1

0015

PFS BM+

PFS in All

OS BM+

OS in All

[0002] 0015

PFS BM+

PFS in All

OS BM+

OS in All

[0002]

Experimental1 vs Control Experimental2 vs Control

OS BM+

OS in All

[0009]

OS BM+

OS in All

[0009]

Time 2

For OS in BM+ information fraction assumed 70 (OrsquoBrien-Fleming)

Possibility for Improvement Graphical Approach

bull Historical overview bull Introduced in 2009 by Bretz et al (and Burman et al)

bull 2013 extension to group-sequential trials (Maurer Bretz)

bull 2017 Supported by the draft FDA guidance (Multiple Endpoints in Clinical Trials)

bull Closed test procedure =gt Strong control of family-wise error rate

bull A method to depict develop communicate a strategy consisting of Bonferroni-based sequential methods

Additional consideration PFS is not analysed at Time2 12

Experimental1 vs Control Experimental2 vs Control

Graphical Approach Introduction

0015

0

1

PFS BM+

PFS in All

OS BM+

OS in All

1

001

0

0015

0

1

PFS BM+

PFS in All

OS BM+

OS in All

1

001

0

Vertices = nodes

Endpoint-specific alpha asymp Local significance level

Edges = paths

Weight of edge

Blue expressions terminology of FDA guidance 13

Experimental1 vs Control Experimental2 vs Control

Improving Testing Strategy Add Edges

0015

0

1

PFS BM+

PFS in All

OS BM+

OS in All

1

001

0

1

0015

0

1

PFS BM+

PFS in All

OS BM+

OS in All

1

001

0

1

1

1

Along the green edges alpha can be passed only at Time1

14

15

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 1a Time 1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

1

Along the green edges alpha can be passed only at Time1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

16

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 1b Time 1

0015

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

1

Along the green edges alpha can be passed only at Time1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

17

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 2a Time 1

0015

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

1

Along the green edges alpha can be passed only at Time1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

18

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 2b Time 1

OS BM+

OS In All

1

0025 [00060023]

0

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

1

Along the green edges alpha can be passed only at Time1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

19

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 3a Time 2

OS BM+

OS In All

1

0025 [00060023]

0

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

20

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 3b Time 2

OS BM+

OS In All

1

0025 [00060023]

0

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

21

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 3c Time 2

OS In All

0025 [002001]

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

1 1

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

1

1

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ +171 00 +179 00

All +176 00 +179 00

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ +66 +57

All +77 +73 22

How Does the Power Gain Change under Different Assumptions

Endpoint Modified

Scenario Details

OS 1 OS larger difference in treatment effect by BM status

HR (for non-cure patients and after delay) increased by 10 in the BM- and decreased by 10 in the BM+ population (while keeping HR for all randomized the same) (a)

2 OS shorter delay 10 shorter delay in all 4 OS endpoints (a)

OS PFS 3 OS PFS Exp2 better OS and PFS treatment effect (delay cure rate HR in non-cure patients) improved by 10 in Experimental2 arm

4 OS PFS Exp2 worse OS and PFS treatment effect (delay cure rate HR in non-cure patients) worsened by 10 in Experimental2 arm

5 OS PFS longer delay OS delay increased with 50 PFS delay with 300

PFS 6 PFS no delay No delay in any of the 4 PFS endpoints

Time of analysis is generated based on of events in all 3 arms 23

0

01

02

03

04

05

06

07

0

01

02

03

04

05

06

07

PROTOCOL 1 OS Larger BM effect 2 OS shorter delay 3 Exp2 better 4 Exp2 worse 5 OS PFS longer delay 6 PFS no delay

Benefit of Additional Edges in Statistical Power of OS Time1

OS1+

OS1All

OS2+

OS2All

OS1

OS1AC

OS2

OS2AC

05

055

06

065

07

075

08

085

09

095

1

05

055

06

065

07

075

08

085

09

095

1

PROTOCOL 1 OS Larger BM effect 2 OS shorter delay 3 Exp2 better 4 Exp2 worse 5 OS PFS longer delay 6 PFS no delay

Benefit of Additional Edges in Statistical Power of OS Time2

OS1+

OS1All

OS2+

OS2All

OS1

OS1AC

OS2

OS2AC24

Experimental1 vs Control Experimental2 vs Control

Alternative1 Secure All Endpoints within Comparison

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

ε

ε

1 - ε 1 - ε

25

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dashed edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative1 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

ε

ε

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

ε

ε

1-ε 1-ε

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 170 + 14 + 178 + 17

All + 176 + 16 + 178 + 20

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 64 + 56

All + 75 + 71 26

Experimental1 vs Control Experimental2 vs Control

Alternative2 Giving BM+ Maximum Chance

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

001

0

1

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

001

0

1

ε2 ε2

1 - ε 1 - ε

1

1 1

ε2 ε2

1

27

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative2 Change in Power vs Base Case (under Protocol Assumptions)

1 - ε 1 - ε

1

1 1

ε2

0015

0

PFS BM+

PFS In All

OS BM

+

OS In All

0002

0

1

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

0002

0

1

ε2 ε2 ε2

1

Time 1 Time 2

OS BM+

OS In All

0002

0

OS BM+

OS In All

0002

0

05 05

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 135 + 08 + 138 + 08

All - 197 - 80 - 19 - 79

+20

+15

+10

+5

0

-5

-10

-15

-20

28

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 37 + 32

All - 67 - 68

4 Concluding Remarks

29

Concluding Remarks MTP

bull Limitation of the results on statistical power bull OS and PFS were generated independently

bull Identifying the testing strategy for an actual clinical trial always requires the input of several functions and the power results represent only one aspect of these considerations

bull Using graphical approach may result in overpowered endpoints However this still may be desirable reasons bull Protocol assumptions are complicated =gt just one detail off can result in power drop

(even in the presence of meaningful treatment effect)

bull At an interim analysis probability of crossing the efficacy boundary increases =gt gives larger chance for earlier read-out

30

Thank you for your attention

31

References

bull Bretz F Maurer W Brannath W Posch M A graphical approach to sequentially rejective multiple test procedures Statistics in Medicine 2009 28 586-604

bull Burman C-F Sonesson C and Guilbaud O (2009) A recycling framework for the construction of Bonferroni-based multiple tests Statistics in Medicine 28 739ndash761

bull Bretz F Posch M Glimm E Klinglmueller F Maurer W Rohmeyer K Graphical approaches for multiple comparison procedures using weighted Bonferroni Simes or parametric tests Biometrical Journal 2011 53(6) 894-913

bull Ye Y Li A Liu L and Yao B (2013) A group sequential Holm procedure with multiple primary endpoints Statistics in Medicine 32 1112-1124

bull Maurer W and Bretz F (2013a) ldquoGraphical Multiple Test Procedures With Memoryrdquo Statistics in Medicine DOI 101002sim5711

bull Willi MAURER and Frank BRETZ Multiple Testing in Group Sequential Trials Using Graphical Approaches American Statistical Association Statistics in Biopharmaceutical Research November 2013 Vol5 No4

bull Xi D Tamhane AC Allocating recycled significance levels in group sequential procedures for multiple endpoints Biom J 2015 Jan57(1)90-107

bull Tamhane A C Mehta C R and Liu L (2010) Testing a primary and a secondary endpoint in a group sequential design Biometrics 66 1174ndash1184

32

Back-up

33

Basic Parameters of the Study Design Table 1 Basic Parameters of the Study Design

Overall Survival Progression-Free Survival

BM+ BM- All Rand BM+ BM- All Rand

randomized Total (per comparison)

asymp 480 (asymp 320) (c)

asymp 720 (asymp 480) (c)

1200 (800) (fixed)

As OS As OS As OS

Significance level (a) 001 0015

Delayed Effect (b) 4m 5m 1m 1m

Cure Rate in Experimental (b) 15 10 0 0

HR after Delay Effect (b) (d) 06 075 056 075

Overall HR (b) Time1 Time2 070 061 084 074 077 069 063 084 072

Median controlexperimental 8m 12m 9m 12m 9m 12m 4m 64m 4m 47m 4m 55m

events Total Time1 Time2 asymp 285 385 (fixed)

asymp 693 asymp 963 380 (fixed) asymp 989

events per comparison Time1 Time2

asymp 198 asymp 266 asymp 478 asymp 661 asymp 261 asymp 672

(a) Initially allocated endpoint-specific alpha used for sample size calculation (b) For each comparison with the two experimental arms (c) Based on an expected BM+ proportion of 40 (Total number randomized is fixed) (d) As assumed for non-cure patients

34

Literature Overview

Graphical approach Bonferroni-based (Bretz et al 2009)

Non-parametric extensions (Bretz et al 2011)

GSP extension special case (Ye et al 2013)

GSP general allows for selection of recycling stages (Xi Tamhane 2015)

GSP extension general (Maurer Bretz 2013)

Graphical approach Bonferroni-based (Burman 2009)

GSP = group-sequential procedure Dark blue boxes referenced by FDA guidance

Graphs with memory (Maurer Bretz 2013a)

-649

-648 -577 -459

35

Multiplicity History of Regulatory Guidances bull E9 Statistical Principles for Clinical Trials (1998 Section 56)

bull Pre-specification of MTP bull EMA Points to Consider on Multiplicity Issues in Clinical Trials (2002)

bull Strong control of FWER bull EMA Concept paper on the need for a guideline on multiplicity issues in

clinical trials (Draft 2012) =gt Nov2012 Workshop bull EMA Guideline on multiplicity issues in clinical trials bull FDA Guidance document on multiplicity issues in clinical trials bull PMDA no multiplicity guidance (ldquoMTPs do not necessarily match with

labellingrdquo)

36

DRAFT Released for public

consultation in 2017

Related document within BMS Reporting Observed Significance Levels (p-values) in Clinical Study Reports Regulatory Documents and Publications 2014 (Stephane Munier)

Multiplicity Latest Regulatory Guidancersquos (2017) EMA Guideline on multiplicity issues

in clinical trials FDA Guidance document on

multiplicity issues in clinical trials

Principles bull Family-wise strong control bull Pre-specification of multiplicity adjustment

Secondary Endpoints bull Secondary endpoints expressing supportive evidence needs no MTP

bull Secondary endpoints that may become base for additional claims need MTP

(Section 6)

Distinction between different types of secondary endpoints is not clear all require MTP (Section IIIA)

Scope Similar (not the same)

Subgroups Section 7 (also own guidance) Excluded (another guidance)

Estimation and descriptive statistics Section 10 Not included

Graphical approach Not mentioned Explicitly encouraged

37

Experimental1 vs Control Experimental2 vs Control

Alternative4 Importance of Experimental1 PFS

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

ε

1

1 - ε

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain 38

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative4 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

ε

1

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

ε

1

1-ε

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 166 + 13 + 131 0

AC + 162 + 19 + 116 0

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 56 + 30

AC + 69 + 40 39

Experimental1 vs Control Experimental2 vs Control

Alternative3 Importance of both OS BM+

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

05 05

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

05

05

40

Timepoint 1 Timepoint 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative3 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

05

0002

0

05

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

0002

0

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

0009

0

OS BM

+

OS In All

0002

0

05

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 104 00 + 105 00

All + 87 - 14 + 84 - 16

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 46 NA + 40 NA

All + 49 NA + 42 NA

05 05 05 05 05

41

05 05 05 05

B) Final OS Look Adjusting Significance Levels bull The time of the final OS analysis will be determined by the number of

BM+ OS events in the 3 arms or in the control arm

bull As a result the number of events for each OS comparison is a random variable by design

bull Consequence bull Time 1 Assume none of the OS endpoints was rejected bull Time 2 If the of events for a given comparison is more (less) than expected

=gt the information fraction and nominal significance level used at Time 1 was larger (smaller) than what we should have used

For the final look critical values for OS will be re-calculated using the number of exactly observed events at the final analysis

The effect of adjustment is limited (always lt 0004 and in general lt 0001)

42

Illustration Effect of Final Alpha Adjustment

43 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Legend Observed IF at Time 1 lt Q1 Q1 - Q2 Q2 - Q3 Q3 lt

Illustration Effect of Final Alpha Adjustment

44 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Page 9: Multiplicity Correction in a Group-Sequential Oncology ... · PDF file• Largest difference: for Exp1 in scenario#1 (OS BM+: 11%, OS All: 13%, PFS BM+: 5%) 25 30 35 40 45 50 55 60

of Events in Total vs Control Simulation Results

Protocol assumptions

2 Exp2 better

1 Exp2 no treatment effect

3 5 drop-out in all 3 arms

9

Top line (dotted) is based on of events in control arm Bottom line (solid) is based on of events in all 3 arms

of Events in Total vs Control Statistical power bull For all 8 endpoints across all scenarios power is never lower if timing is later bull Largest difference for Exp1 in scenario1 (OS BM+ 11 OS All 13 PFS BM+ 5)

25 30 35 40 45 50 55 60 65 70

Months from First Patient Randomized

Time of Analysis (90 CI) under Different Scenarios

3 Testing Strategy

A Base Case

B Graphical Approach Introduction

C Chosen testing strategy bull Illustration of Sequentially Rejective Procedure

bull Power Gain

D Alternative testing strategies

10

11

Experimental1 vs Control Experimental2 vs Control

Testing Strategy Base Case

Time 1

0015

PFS BM+

PFS in All

OS BM+

OS in All

[0002] 0015

PFS BM+

PFS in All

OS BM+

OS in All

[0002]

Experimental1 vs Control Experimental2 vs Control

OS BM+

OS in All

[0009]

OS BM+

OS in All

[0009]

Time 2

For OS in BM+ information fraction assumed 70 (OrsquoBrien-Fleming)

Possibility for Improvement Graphical Approach

bull Historical overview bull Introduced in 2009 by Bretz et al (and Burman et al)

bull 2013 extension to group-sequential trials (Maurer Bretz)

bull 2017 Supported by the draft FDA guidance (Multiple Endpoints in Clinical Trials)

bull Closed test procedure =gt Strong control of family-wise error rate

bull A method to depict develop communicate a strategy consisting of Bonferroni-based sequential methods

Additional consideration PFS is not analysed at Time2 12

Experimental1 vs Control Experimental2 vs Control

Graphical Approach Introduction

0015

0

1

PFS BM+

PFS in All

OS BM+

OS in All

1

001

0

0015

0

1

PFS BM+

PFS in All

OS BM+

OS in All

1

001

0

Vertices = nodes

Endpoint-specific alpha asymp Local significance level

Edges = paths

Weight of edge

Blue expressions terminology of FDA guidance 13

Experimental1 vs Control Experimental2 vs Control

Improving Testing Strategy Add Edges

0015

0

1

PFS BM+

PFS in All

OS BM+

OS in All

1

001

0

1

0015

0

1

PFS BM+

PFS in All

OS BM+

OS in All

1

001

0

1

1

1

Along the green edges alpha can be passed only at Time1

14

15

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 1a Time 1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

1

Along the green edges alpha can be passed only at Time1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

16

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 1b Time 1

0015

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

1

Along the green edges alpha can be passed only at Time1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

17

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 2a Time 1

0015

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

1

Along the green edges alpha can be passed only at Time1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

18

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 2b Time 1

OS BM+

OS In All

1

0025 [00060023]

0

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

1

Along the green edges alpha can be passed only at Time1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

19

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 3a Time 2

OS BM+

OS In All

1

0025 [00060023]

0

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

20

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 3b Time 2

OS BM+

OS In All

1

0025 [00060023]

0

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

21

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 3c Time 2

OS In All

0025 [002001]

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

1 1

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

1

1

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ +171 00 +179 00

All +176 00 +179 00

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ +66 +57

All +77 +73 22

How Does the Power Gain Change under Different Assumptions

Endpoint Modified

Scenario Details

OS 1 OS larger difference in treatment effect by BM status

HR (for non-cure patients and after delay) increased by 10 in the BM- and decreased by 10 in the BM+ population (while keeping HR for all randomized the same) (a)

2 OS shorter delay 10 shorter delay in all 4 OS endpoints (a)

OS PFS 3 OS PFS Exp2 better OS and PFS treatment effect (delay cure rate HR in non-cure patients) improved by 10 in Experimental2 arm

4 OS PFS Exp2 worse OS and PFS treatment effect (delay cure rate HR in non-cure patients) worsened by 10 in Experimental2 arm

5 OS PFS longer delay OS delay increased with 50 PFS delay with 300

PFS 6 PFS no delay No delay in any of the 4 PFS endpoints

Time of analysis is generated based on of events in all 3 arms 23

0

01

02

03

04

05

06

07

0

01

02

03

04

05

06

07

PROTOCOL 1 OS Larger BM effect 2 OS shorter delay 3 Exp2 better 4 Exp2 worse 5 OS PFS longer delay 6 PFS no delay

Benefit of Additional Edges in Statistical Power of OS Time1

OS1+

OS1All

OS2+

OS2All

OS1

OS1AC

OS2

OS2AC

05

055

06

065

07

075

08

085

09

095

1

05

055

06

065

07

075

08

085

09

095

1

PROTOCOL 1 OS Larger BM effect 2 OS shorter delay 3 Exp2 better 4 Exp2 worse 5 OS PFS longer delay 6 PFS no delay

Benefit of Additional Edges in Statistical Power of OS Time2

OS1+

OS1All

OS2+

OS2All

OS1

OS1AC

OS2

OS2AC24

Experimental1 vs Control Experimental2 vs Control

Alternative1 Secure All Endpoints within Comparison

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

ε

ε

1 - ε 1 - ε

25

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dashed edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative1 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

ε

ε

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

ε

ε

1-ε 1-ε

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 170 + 14 + 178 + 17

All + 176 + 16 + 178 + 20

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 64 + 56

All + 75 + 71 26

Experimental1 vs Control Experimental2 vs Control

Alternative2 Giving BM+ Maximum Chance

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

001

0

1

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

001

0

1

ε2 ε2

1 - ε 1 - ε

1

1 1

ε2 ε2

1

27

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative2 Change in Power vs Base Case (under Protocol Assumptions)

1 - ε 1 - ε

1

1 1

ε2

0015

0

PFS BM+

PFS In All

OS BM

+

OS In All

0002

0

1

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

0002

0

1

ε2 ε2 ε2

1

Time 1 Time 2

OS BM+

OS In All

0002

0

OS BM+

OS In All

0002

0

05 05

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 135 + 08 + 138 + 08

All - 197 - 80 - 19 - 79

+20

+15

+10

+5

0

-5

-10

-15

-20

28

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 37 + 32

All - 67 - 68

4 Concluding Remarks

29

Concluding Remarks MTP

bull Limitation of the results on statistical power bull OS and PFS were generated independently

bull Identifying the testing strategy for an actual clinical trial always requires the input of several functions and the power results represent only one aspect of these considerations

bull Using graphical approach may result in overpowered endpoints However this still may be desirable reasons bull Protocol assumptions are complicated =gt just one detail off can result in power drop

(even in the presence of meaningful treatment effect)

bull At an interim analysis probability of crossing the efficacy boundary increases =gt gives larger chance for earlier read-out

30

Thank you for your attention

31

References

bull Bretz F Maurer W Brannath W Posch M A graphical approach to sequentially rejective multiple test procedures Statistics in Medicine 2009 28 586-604

bull Burman C-F Sonesson C and Guilbaud O (2009) A recycling framework for the construction of Bonferroni-based multiple tests Statistics in Medicine 28 739ndash761

bull Bretz F Posch M Glimm E Klinglmueller F Maurer W Rohmeyer K Graphical approaches for multiple comparison procedures using weighted Bonferroni Simes or parametric tests Biometrical Journal 2011 53(6) 894-913

bull Ye Y Li A Liu L and Yao B (2013) A group sequential Holm procedure with multiple primary endpoints Statistics in Medicine 32 1112-1124

bull Maurer W and Bretz F (2013a) ldquoGraphical Multiple Test Procedures With Memoryrdquo Statistics in Medicine DOI 101002sim5711

bull Willi MAURER and Frank BRETZ Multiple Testing in Group Sequential Trials Using Graphical Approaches American Statistical Association Statistics in Biopharmaceutical Research November 2013 Vol5 No4

bull Xi D Tamhane AC Allocating recycled significance levels in group sequential procedures for multiple endpoints Biom J 2015 Jan57(1)90-107

bull Tamhane A C Mehta C R and Liu L (2010) Testing a primary and a secondary endpoint in a group sequential design Biometrics 66 1174ndash1184

32

Back-up

33

Basic Parameters of the Study Design Table 1 Basic Parameters of the Study Design

Overall Survival Progression-Free Survival

BM+ BM- All Rand BM+ BM- All Rand

randomized Total (per comparison)

asymp 480 (asymp 320) (c)

asymp 720 (asymp 480) (c)

1200 (800) (fixed)

As OS As OS As OS

Significance level (a) 001 0015

Delayed Effect (b) 4m 5m 1m 1m

Cure Rate in Experimental (b) 15 10 0 0

HR after Delay Effect (b) (d) 06 075 056 075

Overall HR (b) Time1 Time2 070 061 084 074 077 069 063 084 072

Median controlexperimental 8m 12m 9m 12m 9m 12m 4m 64m 4m 47m 4m 55m

events Total Time1 Time2 asymp 285 385 (fixed)

asymp 693 asymp 963 380 (fixed) asymp 989

events per comparison Time1 Time2

asymp 198 asymp 266 asymp 478 asymp 661 asymp 261 asymp 672

(a) Initially allocated endpoint-specific alpha used for sample size calculation (b) For each comparison with the two experimental arms (c) Based on an expected BM+ proportion of 40 (Total number randomized is fixed) (d) As assumed for non-cure patients

34

Literature Overview

Graphical approach Bonferroni-based (Bretz et al 2009)

Non-parametric extensions (Bretz et al 2011)

GSP extension special case (Ye et al 2013)

GSP general allows for selection of recycling stages (Xi Tamhane 2015)

GSP extension general (Maurer Bretz 2013)

Graphical approach Bonferroni-based (Burman 2009)

GSP = group-sequential procedure Dark blue boxes referenced by FDA guidance

Graphs with memory (Maurer Bretz 2013a)

-649

-648 -577 -459

35

Multiplicity History of Regulatory Guidances bull E9 Statistical Principles for Clinical Trials (1998 Section 56)

bull Pre-specification of MTP bull EMA Points to Consider on Multiplicity Issues in Clinical Trials (2002)

bull Strong control of FWER bull EMA Concept paper on the need for a guideline on multiplicity issues in

clinical trials (Draft 2012) =gt Nov2012 Workshop bull EMA Guideline on multiplicity issues in clinical trials bull FDA Guidance document on multiplicity issues in clinical trials bull PMDA no multiplicity guidance (ldquoMTPs do not necessarily match with

labellingrdquo)

36

DRAFT Released for public

consultation in 2017

Related document within BMS Reporting Observed Significance Levels (p-values) in Clinical Study Reports Regulatory Documents and Publications 2014 (Stephane Munier)

Multiplicity Latest Regulatory Guidancersquos (2017) EMA Guideline on multiplicity issues

in clinical trials FDA Guidance document on

multiplicity issues in clinical trials

Principles bull Family-wise strong control bull Pre-specification of multiplicity adjustment

Secondary Endpoints bull Secondary endpoints expressing supportive evidence needs no MTP

bull Secondary endpoints that may become base for additional claims need MTP

(Section 6)

Distinction between different types of secondary endpoints is not clear all require MTP (Section IIIA)

Scope Similar (not the same)

Subgroups Section 7 (also own guidance) Excluded (another guidance)

Estimation and descriptive statistics Section 10 Not included

Graphical approach Not mentioned Explicitly encouraged

37

Experimental1 vs Control Experimental2 vs Control

Alternative4 Importance of Experimental1 PFS

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

ε

1

1 - ε

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain 38

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative4 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

ε

1

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

ε

1

1-ε

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 166 + 13 + 131 0

AC + 162 + 19 + 116 0

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 56 + 30

AC + 69 + 40 39

Experimental1 vs Control Experimental2 vs Control

Alternative3 Importance of both OS BM+

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

05 05

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

05

05

40

Timepoint 1 Timepoint 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative3 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

05

0002

0

05

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

0002

0

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

0009

0

OS BM

+

OS In All

0002

0

05

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 104 00 + 105 00

All + 87 - 14 + 84 - 16

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 46 NA + 40 NA

All + 49 NA + 42 NA

05 05 05 05 05

41

05 05 05 05

B) Final OS Look Adjusting Significance Levels bull The time of the final OS analysis will be determined by the number of

BM+ OS events in the 3 arms or in the control arm

bull As a result the number of events for each OS comparison is a random variable by design

bull Consequence bull Time 1 Assume none of the OS endpoints was rejected bull Time 2 If the of events for a given comparison is more (less) than expected

=gt the information fraction and nominal significance level used at Time 1 was larger (smaller) than what we should have used

For the final look critical values for OS will be re-calculated using the number of exactly observed events at the final analysis

The effect of adjustment is limited (always lt 0004 and in general lt 0001)

42

Illustration Effect of Final Alpha Adjustment

43 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Legend Observed IF at Time 1 lt Q1 Q1 - Q2 Q2 - Q3 Q3 lt

Illustration Effect of Final Alpha Adjustment

44 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Page 10: Multiplicity Correction in a Group-Sequential Oncology ... · PDF file• Largest difference: for Exp1 in scenario#1 (OS BM+: 11%, OS All: 13%, PFS BM+: 5%) 25 30 35 40 45 50 55 60

3 Testing Strategy

A Base Case

B Graphical Approach Introduction

C Chosen testing strategy bull Illustration of Sequentially Rejective Procedure

bull Power Gain

D Alternative testing strategies

10

11

Experimental1 vs Control Experimental2 vs Control

Testing Strategy Base Case

Time 1

0015

PFS BM+

PFS in All

OS BM+

OS in All

[0002] 0015

PFS BM+

PFS in All

OS BM+

OS in All

[0002]

Experimental1 vs Control Experimental2 vs Control

OS BM+

OS in All

[0009]

OS BM+

OS in All

[0009]

Time 2

For OS in BM+ information fraction assumed 70 (OrsquoBrien-Fleming)

Possibility for Improvement Graphical Approach

bull Historical overview bull Introduced in 2009 by Bretz et al (and Burman et al)

bull 2013 extension to group-sequential trials (Maurer Bretz)

bull 2017 Supported by the draft FDA guidance (Multiple Endpoints in Clinical Trials)

bull Closed test procedure =gt Strong control of family-wise error rate

bull A method to depict develop communicate a strategy consisting of Bonferroni-based sequential methods

Additional consideration PFS is not analysed at Time2 12

Experimental1 vs Control Experimental2 vs Control

Graphical Approach Introduction

0015

0

1

PFS BM+

PFS in All

OS BM+

OS in All

1

001

0

0015

0

1

PFS BM+

PFS in All

OS BM+

OS in All

1

001

0

Vertices = nodes

Endpoint-specific alpha asymp Local significance level

Edges = paths

Weight of edge

Blue expressions terminology of FDA guidance 13

Experimental1 vs Control Experimental2 vs Control

Improving Testing Strategy Add Edges

0015

0

1

PFS BM+

PFS in All

OS BM+

OS in All

1

001

0

1

0015

0

1

PFS BM+

PFS in All

OS BM+

OS in All

1

001

0

1

1

1

Along the green edges alpha can be passed only at Time1

14

15

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 1a Time 1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

1

Along the green edges alpha can be passed only at Time1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

16

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 1b Time 1

0015

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

1

Along the green edges alpha can be passed only at Time1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

17

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 2a Time 1

0015

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

1

Along the green edges alpha can be passed only at Time1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

18

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 2b Time 1

OS BM+

OS In All

1

0025 [00060023]

0

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

1

Along the green edges alpha can be passed only at Time1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

19

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 3a Time 2

OS BM+

OS In All

1

0025 [00060023]

0

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

20

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 3b Time 2

OS BM+

OS In All

1

0025 [00060023]

0

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

21

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 3c Time 2

OS In All

0025 [002001]

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

1 1

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

1

1

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ +171 00 +179 00

All +176 00 +179 00

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ +66 +57

All +77 +73 22

How Does the Power Gain Change under Different Assumptions

Endpoint Modified

Scenario Details

OS 1 OS larger difference in treatment effect by BM status

HR (for non-cure patients and after delay) increased by 10 in the BM- and decreased by 10 in the BM+ population (while keeping HR for all randomized the same) (a)

2 OS shorter delay 10 shorter delay in all 4 OS endpoints (a)

OS PFS 3 OS PFS Exp2 better OS and PFS treatment effect (delay cure rate HR in non-cure patients) improved by 10 in Experimental2 arm

4 OS PFS Exp2 worse OS and PFS treatment effect (delay cure rate HR in non-cure patients) worsened by 10 in Experimental2 arm

5 OS PFS longer delay OS delay increased with 50 PFS delay with 300

PFS 6 PFS no delay No delay in any of the 4 PFS endpoints

Time of analysis is generated based on of events in all 3 arms 23

0

01

02

03

04

05

06

07

0

01

02

03

04

05

06

07

PROTOCOL 1 OS Larger BM effect 2 OS shorter delay 3 Exp2 better 4 Exp2 worse 5 OS PFS longer delay 6 PFS no delay

Benefit of Additional Edges in Statistical Power of OS Time1

OS1+

OS1All

OS2+

OS2All

OS1

OS1AC

OS2

OS2AC

05

055

06

065

07

075

08

085

09

095

1

05

055

06

065

07

075

08

085

09

095

1

PROTOCOL 1 OS Larger BM effect 2 OS shorter delay 3 Exp2 better 4 Exp2 worse 5 OS PFS longer delay 6 PFS no delay

Benefit of Additional Edges in Statistical Power of OS Time2

OS1+

OS1All

OS2+

OS2All

OS1

OS1AC

OS2

OS2AC24

Experimental1 vs Control Experimental2 vs Control

Alternative1 Secure All Endpoints within Comparison

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

ε

ε

1 - ε 1 - ε

25

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dashed edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative1 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

ε

ε

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

ε

ε

1-ε 1-ε

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 170 + 14 + 178 + 17

All + 176 + 16 + 178 + 20

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 64 + 56

All + 75 + 71 26

Experimental1 vs Control Experimental2 vs Control

Alternative2 Giving BM+ Maximum Chance

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

001

0

1

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

001

0

1

ε2 ε2

1 - ε 1 - ε

1

1 1

ε2 ε2

1

27

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative2 Change in Power vs Base Case (under Protocol Assumptions)

1 - ε 1 - ε

1

1 1

ε2

0015

0

PFS BM+

PFS In All

OS BM

+

OS In All

0002

0

1

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

0002

0

1

ε2 ε2 ε2

1

Time 1 Time 2

OS BM+

OS In All

0002

0

OS BM+

OS In All

0002

0

05 05

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 135 + 08 + 138 + 08

All - 197 - 80 - 19 - 79

+20

+15

+10

+5

0

-5

-10

-15

-20

28

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 37 + 32

All - 67 - 68

4 Concluding Remarks

29

Concluding Remarks MTP

bull Limitation of the results on statistical power bull OS and PFS were generated independently

bull Identifying the testing strategy for an actual clinical trial always requires the input of several functions and the power results represent only one aspect of these considerations

bull Using graphical approach may result in overpowered endpoints However this still may be desirable reasons bull Protocol assumptions are complicated =gt just one detail off can result in power drop

(even in the presence of meaningful treatment effect)

bull At an interim analysis probability of crossing the efficacy boundary increases =gt gives larger chance for earlier read-out

30

Thank you for your attention

31

References

bull Bretz F Maurer W Brannath W Posch M A graphical approach to sequentially rejective multiple test procedures Statistics in Medicine 2009 28 586-604

bull Burman C-F Sonesson C and Guilbaud O (2009) A recycling framework for the construction of Bonferroni-based multiple tests Statistics in Medicine 28 739ndash761

bull Bretz F Posch M Glimm E Klinglmueller F Maurer W Rohmeyer K Graphical approaches for multiple comparison procedures using weighted Bonferroni Simes or parametric tests Biometrical Journal 2011 53(6) 894-913

bull Ye Y Li A Liu L and Yao B (2013) A group sequential Holm procedure with multiple primary endpoints Statistics in Medicine 32 1112-1124

bull Maurer W and Bretz F (2013a) ldquoGraphical Multiple Test Procedures With Memoryrdquo Statistics in Medicine DOI 101002sim5711

bull Willi MAURER and Frank BRETZ Multiple Testing in Group Sequential Trials Using Graphical Approaches American Statistical Association Statistics in Biopharmaceutical Research November 2013 Vol5 No4

bull Xi D Tamhane AC Allocating recycled significance levels in group sequential procedures for multiple endpoints Biom J 2015 Jan57(1)90-107

bull Tamhane A C Mehta C R and Liu L (2010) Testing a primary and a secondary endpoint in a group sequential design Biometrics 66 1174ndash1184

32

Back-up

33

Basic Parameters of the Study Design Table 1 Basic Parameters of the Study Design

Overall Survival Progression-Free Survival

BM+ BM- All Rand BM+ BM- All Rand

randomized Total (per comparison)

asymp 480 (asymp 320) (c)

asymp 720 (asymp 480) (c)

1200 (800) (fixed)

As OS As OS As OS

Significance level (a) 001 0015

Delayed Effect (b) 4m 5m 1m 1m

Cure Rate in Experimental (b) 15 10 0 0

HR after Delay Effect (b) (d) 06 075 056 075

Overall HR (b) Time1 Time2 070 061 084 074 077 069 063 084 072

Median controlexperimental 8m 12m 9m 12m 9m 12m 4m 64m 4m 47m 4m 55m

events Total Time1 Time2 asymp 285 385 (fixed)

asymp 693 asymp 963 380 (fixed) asymp 989

events per comparison Time1 Time2

asymp 198 asymp 266 asymp 478 asymp 661 asymp 261 asymp 672

(a) Initially allocated endpoint-specific alpha used for sample size calculation (b) For each comparison with the two experimental arms (c) Based on an expected BM+ proportion of 40 (Total number randomized is fixed) (d) As assumed for non-cure patients

34

Literature Overview

Graphical approach Bonferroni-based (Bretz et al 2009)

Non-parametric extensions (Bretz et al 2011)

GSP extension special case (Ye et al 2013)

GSP general allows for selection of recycling stages (Xi Tamhane 2015)

GSP extension general (Maurer Bretz 2013)

Graphical approach Bonferroni-based (Burman 2009)

GSP = group-sequential procedure Dark blue boxes referenced by FDA guidance

Graphs with memory (Maurer Bretz 2013a)

-649

-648 -577 -459

35

Multiplicity History of Regulatory Guidances bull E9 Statistical Principles for Clinical Trials (1998 Section 56)

bull Pre-specification of MTP bull EMA Points to Consider on Multiplicity Issues in Clinical Trials (2002)

bull Strong control of FWER bull EMA Concept paper on the need for a guideline on multiplicity issues in

clinical trials (Draft 2012) =gt Nov2012 Workshop bull EMA Guideline on multiplicity issues in clinical trials bull FDA Guidance document on multiplicity issues in clinical trials bull PMDA no multiplicity guidance (ldquoMTPs do not necessarily match with

labellingrdquo)

36

DRAFT Released for public

consultation in 2017

Related document within BMS Reporting Observed Significance Levels (p-values) in Clinical Study Reports Regulatory Documents and Publications 2014 (Stephane Munier)

Multiplicity Latest Regulatory Guidancersquos (2017) EMA Guideline on multiplicity issues

in clinical trials FDA Guidance document on

multiplicity issues in clinical trials

Principles bull Family-wise strong control bull Pre-specification of multiplicity adjustment

Secondary Endpoints bull Secondary endpoints expressing supportive evidence needs no MTP

bull Secondary endpoints that may become base for additional claims need MTP

(Section 6)

Distinction between different types of secondary endpoints is not clear all require MTP (Section IIIA)

Scope Similar (not the same)

Subgroups Section 7 (also own guidance) Excluded (another guidance)

Estimation and descriptive statistics Section 10 Not included

Graphical approach Not mentioned Explicitly encouraged

37

Experimental1 vs Control Experimental2 vs Control

Alternative4 Importance of Experimental1 PFS

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

ε

1

1 - ε

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain 38

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative4 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

ε

1

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

ε

1

1-ε

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 166 + 13 + 131 0

AC + 162 + 19 + 116 0

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 56 + 30

AC + 69 + 40 39

Experimental1 vs Control Experimental2 vs Control

Alternative3 Importance of both OS BM+

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

05 05

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

05

05

40

Timepoint 1 Timepoint 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative3 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

05

0002

0

05

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

0002

0

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

0009

0

OS BM

+

OS In All

0002

0

05

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 104 00 + 105 00

All + 87 - 14 + 84 - 16

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 46 NA + 40 NA

All + 49 NA + 42 NA

05 05 05 05 05

41

05 05 05 05

B) Final OS Look Adjusting Significance Levels bull The time of the final OS analysis will be determined by the number of

BM+ OS events in the 3 arms or in the control arm

bull As a result the number of events for each OS comparison is a random variable by design

bull Consequence bull Time 1 Assume none of the OS endpoints was rejected bull Time 2 If the of events for a given comparison is more (less) than expected

=gt the information fraction and nominal significance level used at Time 1 was larger (smaller) than what we should have used

For the final look critical values for OS will be re-calculated using the number of exactly observed events at the final analysis

The effect of adjustment is limited (always lt 0004 and in general lt 0001)

42

Illustration Effect of Final Alpha Adjustment

43 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Legend Observed IF at Time 1 lt Q1 Q1 - Q2 Q2 - Q3 Q3 lt

Illustration Effect of Final Alpha Adjustment

44 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Page 11: Multiplicity Correction in a Group-Sequential Oncology ... · PDF file• Largest difference: for Exp1 in scenario#1 (OS BM+: 11%, OS All: 13%, PFS BM+: 5%) 25 30 35 40 45 50 55 60

11

Experimental1 vs Control Experimental2 vs Control

Testing Strategy Base Case

Time 1

0015

PFS BM+

PFS in All

OS BM+

OS in All

[0002] 0015

PFS BM+

PFS in All

OS BM+

OS in All

[0002]

Experimental1 vs Control Experimental2 vs Control

OS BM+

OS in All

[0009]

OS BM+

OS in All

[0009]

Time 2

For OS in BM+ information fraction assumed 70 (OrsquoBrien-Fleming)

Possibility for Improvement Graphical Approach

bull Historical overview bull Introduced in 2009 by Bretz et al (and Burman et al)

bull 2013 extension to group-sequential trials (Maurer Bretz)

bull 2017 Supported by the draft FDA guidance (Multiple Endpoints in Clinical Trials)

bull Closed test procedure =gt Strong control of family-wise error rate

bull A method to depict develop communicate a strategy consisting of Bonferroni-based sequential methods

Additional consideration PFS is not analysed at Time2 12

Experimental1 vs Control Experimental2 vs Control

Graphical Approach Introduction

0015

0

1

PFS BM+

PFS in All

OS BM+

OS in All

1

001

0

0015

0

1

PFS BM+

PFS in All

OS BM+

OS in All

1

001

0

Vertices = nodes

Endpoint-specific alpha asymp Local significance level

Edges = paths

Weight of edge

Blue expressions terminology of FDA guidance 13

Experimental1 vs Control Experimental2 vs Control

Improving Testing Strategy Add Edges

0015

0

1

PFS BM+

PFS in All

OS BM+

OS in All

1

001

0

1

0015

0

1

PFS BM+

PFS in All

OS BM+

OS in All

1

001

0

1

1

1

Along the green edges alpha can be passed only at Time1

14

15

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 1a Time 1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

1

Along the green edges alpha can be passed only at Time1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

16

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 1b Time 1

0015

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

1

Along the green edges alpha can be passed only at Time1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

17

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 2a Time 1

0015

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

1

Along the green edges alpha can be passed only at Time1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

18

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 2b Time 1

OS BM+

OS In All

1

0025 [00060023]

0

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

1

Along the green edges alpha can be passed only at Time1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

19

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 3a Time 2

OS BM+

OS In All

1

0025 [00060023]

0

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

20

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 3b Time 2

OS BM+

OS In All

1

0025 [00060023]

0

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

21

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 3c Time 2

OS In All

0025 [002001]

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

1 1

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

1

1

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ +171 00 +179 00

All +176 00 +179 00

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ +66 +57

All +77 +73 22

How Does the Power Gain Change under Different Assumptions

Endpoint Modified

Scenario Details

OS 1 OS larger difference in treatment effect by BM status

HR (for non-cure patients and after delay) increased by 10 in the BM- and decreased by 10 in the BM+ population (while keeping HR for all randomized the same) (a)

2 OS shorter delay 10 shorter delay in all 4 OS endpoints (a)

OS PFS 3 OS PFS Exp2 better OS and PFS treatment effect (delay cure rate HR in non-cure patients) improved by 10 in Experimental2 arm

4 OS PFS Exp2 worse OS and PFS treatment effect (delay cure rate HR in non-cure patients) worsened by 10 in Experimental2 arm

5 OS PFS longer delay OS delay increased with 50 PFS delay with 300

PFS 6 PFS no delay No delay in any of the 4 PFS endpoints

Time of analysis is generated based on of events in all 3 arms 23

0

01

02

03

04

05

06

07

0

01

02

03

04

05

06

07

PROTOCOL 1 OS Larger BM effect 2 OS shorter delay 3 Exp2 better 4 Exp2 worse 5 OS PFS longer delay 6 PFS no delay

Benefit of Additional Edges in Statistical Power of OS Time1

OS1+

OS1All

OS2+

OS2All

OS1

OS1AC

OS2

OS2AC

05

055

06

065

07

075

08

085

09

095

1

05

055

06

065

07

075

08

085

09

095

1

PROTOCOL 1 OS Larger BM effect 2 OS shorter delay 3 Exp2 better 4 Exp2 worse 5 OS PFS longer delay 6 PFS no delay

Benefit of Additional Edges in Statistical Power of OS Time2

OS1+

OS1All

OS2+

OS2All

OS1

OS1AC

OS2

OS2AC24

Experimental1 vs Control Experimental2 vs Control

Alternative1 Secure All Endpoints within Comparison

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

ε

ε

1 - ε 1 - ε

25

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dashed edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative1 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

ε

ε

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

ε

ε

1-ε 1-ε

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 170 + 14 + 178 + 17

All + 176 + 16 + 178 + 20

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 64 + 56

All + 75 + 71 26

Experimental1 vs Control Experimental2 vs Control

Alternative2 Giving BM+ Maximum Chance

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

001

0

1

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

001

0

1

ε2 ε2

1 - ε 1 - ε

1

1 1

ε2 ε2

1

27

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative2 Change in Power vs Base Case (under Protocol Assumptions)

1 - ε 1 - ε

1

1 1

ε2

0015

0

PFS BM+

PFS In All

OS BM

+

OS In All

0002

0

1

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

0002

0

1

ε2 ε2 ε2

1

Time 1 Time 2

OS BM+

OS In All

0002

0

OS BM+

OS In All

0002

0

05 05

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 135 + 08 + 138 + 08

All - 197 - 80 - 19 - 79

+20

+15

+10

+5

0

-5

-10

-15

-20

28

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 37 + 32

All - 67 - 68

4 Concluding Remarks

29

Concluding Remarks MTP

bull Limitation of the results on statistical power bull OS and PFS were generated independently

bull Identifying the testing strategy for an actual clinical trial always requires the input of several functions and the power results represent only one aspect of these considerations

bull Using graphical approach may result in overpowered endpoints However this still may be desirable reasons bull Protocol assumptions are complicated =gt just one detail off can result in power drop

(even in the presence of meaningful treatment effect)

bull At an interim analysis probability of crossing the efficacy boundary increases =gt gives larger chance for earlier read-out

30

Thank you for your attention

31

References

bull Bretz F Maurer W Brannath W Posch M A graphical approach to sequentially rejective multiple test procedures Statistics in Medicine 2009 28 586-604

bull Burman C-F Sonesson C and Guilbaud O (2009) A recycling framework for the construction of Bonferroni-based multiple tests Statistics in Medicine 28 739ndash761

bull Bretz F Posch M Glimm E Klinglmueller F Maurer W Rohmeyer K Graphical approaches for multiple comparison procedures using weighted Bonferroni Simes or parametric tests Biometrical Journal 2011 53(6) 894-913

bull Ye Y Li A Liu L and Yao B (2013) A group sequential Holm procedure with multiple primary endpoints Statistics in Medicine 32 1112-1124

bull Maurer W and Bretz F (2013a) ldquoGraphical Multiple Test Procedures With Memoryrdquo Statistics in Medicine DOI 101002sim5711

bull Willi MAURER and Frank BRETZ Multiple Testing in Group Sequential Trials Using Graphical Approaches American Statistical Association Statistics in Biopharmaceutical Research November 2013 Vol5 No4

bull Xi D Tamhane AC Allocating recycled significance levels in group sequential procedures for multiple endpoints Biom J 2015 Jan57(1)90-107

bull Tamhane A C Mehta C R and Liu L (2010) Testing a primary and a secondary endpoint in a group sequential design Biometrics 66 1174ndash1184

32

Back-up

33

Basic Parameters of the Study Design Table 1 Basic Parameters of the Study Design

Overall Survival Progression-Free Survival

BM+ BM- All Rand BM+ BM- All Rand

randomized Total (per comparison)

asymp 480 (asymp 320) (c)

asymp 720 (asymp 480) (c)

1200 (800) (fixed)

As OS As OS As OS

Significance level (a) 001 0015

Delayed Effect (b) 4m 5m 1m 1m

Cure Rate in Experimental (b) 15 10 0 0

HR after Delay Effect (b) (d) 06 075 056 075

Overall HR (b) Time1 Time2 070 061 084 074 077 069 063 084 072

Median controlexperimental 8m 12m 9m 12m 9m 12m 4m 64m 4m 47m 4m 55m

events Total Time1 Time2 asymp 285 385 (fixed)

asymp 693 asymp 963 380 (fixed) asymp 989

events per comparison Time1 Time2

asymp 198 asymp 266 asymp 478 asymp 661 asymp 261 asymp 672

(a) Initially allocated endpoint-specific alpha used for sample size calculation (b) For each comparison with the two experimental arms (c) Based on an expected BM+ proportion of 40 (Total number randomized is fixed) (d) As assumed for non-cure patients

34

Literature Overview

Graphical approach Bonferroni-based (Bretz et al 2009)

Non-parametric extensions (Bretz et al 2011)

GSP extension special case (Ye et al 2013)

GSP general allows for selection of recycling stages (Xi Tamhane 2015)

GSP extension general (Maurer Bretz 2013)

Graphical approach Bonferroni-based (Burman 2009)

GSP = group-sequential procedure Dark blue boxes referenced by FDA guidance

Graphs with memory (Maurer Bretz 2013a)

-649

-648 -577 -459

35

Multiplicity History of Regulatory Guidances bull E9 Statistical Principles for Clinical Trials (1998 Section 56)

bull Pre-specification of MTP bull EMA Points to Consider on Multiplicity Issues in Clinical Trials (2002)

bull Strong control of FWER bull EMA Concept paper on the need for a guideline on multiplicity issues in

clinical trials (Draft 2012) =gt Nov2012 Workshop bull EMA Guideline on multiplicity issues in clinical trials bull FDA Guidance document on multiplicity issues in clinical trials bull PMDA no multiplicity guidance (ldquoMTPs do not necessarily match with

labellingrdquo)

36

DRAFT Released for public

consultation in 2017

Related document within BMS Reporting Observed Significance Levels (p-values) in Clinical Study Reports Regulatory Documents and Publications 2014 (Stephane Munier)

Multiplicity Latest Regulatory Guidancersquos (2017) EMA Guideline on multiplicity issues

in clinical trials FDA Guidance document on

multiplicity issues in clinical trials

Principles bull Family-wise strong control bull Pre-specification of multiplicity adjustment

Secondary Endpoints bull Secondary endpoints expressing supportive evidence needs no MTP

bull Secondary endpoints that may become base for additional claims need MTP

(Section 6)

Distinction between different types of secondary endpoints is not clear all require MTP (Section IIIA)

Scope Similar (not the same)

Subgroups Section 7 (also own guidance) Excluded (another guidance)

Estimation and descriptive statistics Section 10 Not included

Graphical approach Not mentioned Explicitly encouraged

37

Experimental1 vs Control Experimental2 vs Control

Alternative4 Importance of Experimental1 PFS

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

ε

1

1 - ε

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain 38

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative4 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

ε

1

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

ε

1

1-ε

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 166 + 13 + 131 0

AC + 162 + 19 + 116 0

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 56 + 30

AC + 69 + 40 39

Experimental1 vs Control Experimental2 vs Control

Alternative3 Importance of both OS BM+

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

05 05

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

05

05

40

Timepoint 1 Timepoint 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative3 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

05

0002

0

05

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

0002

0

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

0009

0

OS BM

+

OS In All

0002

0

05

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 104 00 + 105 00

All + 87 - 14 + 84 - 16

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 46 NA + 40 NA

All + 49 NA + 42 NA

05 05 05 05 05

41

05 05 05 05

B) Final OS Look Adjusting Significance Levels bull The time of the final OS analysis will be determined by the number of

BM+ OS events in the 3 arms or in the control arm

bull As a result the number of events for each OS comparison is a random variable by design

bull Consequence bull Time 1 Assume none of the OS endpoints was rejected bull Time 2 If the of events for a given comparison is more (less) than expected

=gt the information fraction and nominal significance level used at Time 1 was larger (smaller) than what we should have used

For the final look critical values for OS will be re-calculated using the number of exactly observed events at the final analysis

The effect of adjustment is limited (always lt 0004 and in general lt 0001)

42

Illustration Effect of Final Alpha Adjustment

43 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Legend Observed IF at Time 1 lt Q1 Q1 - Q2 Q2 - Q3 Q3 lt

Illustration Effect of Final Alpha Adjustment

44 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Page 12: Multiplicity Correction in a Group-Sequential Oncology ... · PDF file• Largest difference: for Exp1 in scenario#1 (OS BM+: 11%, OS All: 13%, PFS BM+: 5%) 25 30 35 40 45 50 55 60

Possibility for Improvement Graphical Approach

bull Historical overview bull Introduced in 2009 by Bretz et al (and Burman et al)

bull 2013 extension to group-sequential trials (Maurer Bretz)

bull 2017 Supported by the draft FDA guidance (Multiple Endpoints in Clinical Trials)

bull Closed test procedure =gt Strong control of family-wise error rate

bull A method to depict develop communicate a strategy consisting of Bonferroni-based sequential methods

Additional consideration PFS is not analysed at Time2 12

Experimental1 vs Control Experimental2 vs Control

Graphical Approach Introduction

0015

0

1

PFS BM+

PFS in All

OS BM+

OS in All

1

001

0

0015

0

1

PFS BM+

PFS in All

OS BM+

OS in All

1

001

0

Vertices = nodes

Endpoint-specific alpha asymp Local significance level

Edges = paths

Weight of edge

Blue expressions terminology of FDA guidance 13

Experimental1 vs Control Experimental2 vs Control

Improving Testing Strategy Add Edges

0015

0

1

PFS BM+

PFS in All

OS BM+

OS in All

1

001

0

1

0015

0

1

PFS BM+

PFS in All

OS BM+

OS in All

1

001

0

1

1

1

Along the green edges alpha can be passed only at Time1

14

15

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 1a Time 1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

1

Along the green edges alpha can be passed only at Time1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

16

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 1b Time 1

0015

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

1

Along the green edges alpha can be passed only at Time1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

17

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 2a Time 1

0015

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

1

Along the green edges alpha can be passed only at Time1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

18

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 2b Time 1

OS BM+

OS In All

1

0025 [00060023]

0

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

1

Along the green edges alpha can be passed only at Time1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

19

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 3a Time 2

OS BM+

OS In All

1

0025 [00060023]

0

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

20

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 3b Time 2

OS BM+

OS In All

1

0025 [00060023]

0

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

21

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 3c Time 2

OS In All

0025 [002001]

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

1 1

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

1

1

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ +171 00 +179 00

All +176 00 +179 00

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ +66 +57

All +77 +73 22

How Does the Power Gain Change under Different Assumptions

Endpoint Modified

Scenario Details

OS 1 OS larger difference in treatment effect by BM status

HR (for non-cure patients and after delay) increased by 10 in the BM- and decreased by 10 in the BM+ population (while keeping HR for all randomized the same) (a)

2 OS shorter delay 10 shorter delay in all 4 OS endpoints (a)

OS PFS 3 OS PFS Exp2 better OS and PFS treatment effect (delay cure rate HR in non-cure patients) improved by 10 in Experimental2 arm

4 OS PFS Exp2 worse OS and PFS treatment effect (delay cure rate HR in non-cure patients) worsened by 10 in Experimental2 arm

5 OS PFS longer delay OS delay increased with 50 PFS delay with 300

PFS 6 PFS no delay No delay in any of the 4 PFS endpoints

Time of analysis is generated based on of events in all 3 arms 23

0

01

02

03

04

05

06

07

0

01

02

03

04

05

06

07

PROTOCOL 1 OS Larger BM effect 2 OS shorter delay 3 Exp2 better 4 Exp2 worse 5 OS PFS longer delay 6 PFS no delay

Benefit of Additional Edges in Statistical Power of OS Time1

OS1+

OS1All

OS2+

OS2All

OS1

OS1AC

OS2

OS2AC

05

055

06

065

07

075

08

085

09

095

1

05

055

06

065

07

075

08

085

09

095

1

PROTOCOL 1 OS Larger BM effect 2 OS shorter delay 3 Exp2 better 4 Exp2 worse 5 OS PFS longer delay 6 PFS no delay

Benefit of Additional Edges in Statistical Power of OS Time2

OS1+

OS1All

OS2+

OS2All

OS1

OS1AC

OS2

OS2AC24

Experimental1 vs Control Experimental2 vs Control

Alternative1 Secure All Endpoints within Comparison

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

ε

ε

1 - ε 1 - ε

25

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dashed edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative1 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

ε

ε

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

ε

ε

1-ε 1-ε

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 170 + 14 + 178 + 17

All + 176 + 16 + 178 + 20

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 64 + 56

All + 75 + 71 26

Experimental1 vs Control Experimental2 vs Control

Alternative2 Giving BM+ Maximum Chance

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

001

0

1

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

001

0

1

ε2 ε2

1 - ε 1 - ε

1

1 1

ε2 ε2

1

27

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative2 Change in Power vs Base Case (under Protocol Assumptions)

1 - ε 1 - ε

1

1 1

ε2

0015

0

PFS BM+

PFS In All

OS BM

+

OS In All

0002

0

1

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

0002

0

1

ε2 ε2 ε2

1

Time 1 Time 2

OS BM+

OS In All

0002

0

OS BM+

OS In All

0002

0

05 05

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 135 + 08 + 138 + 08

All - 197 - 80 - 19 - 79

+20

+15

+10

+5

0

-5

-10

-15

-20

28

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 37 + 32

All - 67 - 68

4 Concluding Remarks

29

Concluding Remarks MTP

bull Limitation of the results on statistical power bull OS and PFS were generated independently

bull Identifying the testing strategy for an actual clinical trial always requires the input of several functions and the power results represent only one aspect of these considerations

bull Using graphical approach may result in overpowered endpoints However this still may be desirable reasons bull Protocol assumptions are complicated =gt just one detail off can result in power drop

(even in the presence of meaningful treatment effect)

bull At an interim analysis probability of crossing the efficacy boundary increases =gt gives larger chance for earlier read-out

30

Thank you for your attention

31

References

bull Bretz F Maurer W Brannath W Posch M A graphical approach to sequentially rejective multiple test procedures Statistics in Medicine 2009 28 586-604

bull Burman C-F Sonesson C and Guilbaud O (2009) A recycling framework for the construction of Bonferroni-based multiple tests Statistics in Medicine 28 739ndash761

bull Bretz F Posch M Glimm E Klinglmueller F Maurer W Rohmeyer K Graphical approaches for multiple comparison procedures using weighted Bonferroni Simes or parametric tests Biometrical Journal 2011 53(6) 894-913

bull Ye Y Li A Liu L and Yao B (2013) A group sequential Holm procedure with multiple primary endpoints Statistics in Medicine 32 1112-1124

bull Maurer W and Bretz F (2013a) ldquoGraphical Multiple Test Procedures With Memoryrdquo Statistics in Medicine DOI 101002sim5711

bull Willi MAURER and Frank BRETZ Multiple Testing in Group Sequential Trials Using Graphical Approaches American Statistical Association Statistics in Biopharmaceutical Research November 2013 Vol5 No4

bull Xi D Tamhane AC Allocating recycled significance levels in group sequential procedures for multiple endpoints Biom J 2015 Jan57(1)90-107

bull Tamhane A C Mehta C R and Liu L (2010) Testing a primary and a secondary endpoint in a group sequential design Biometrics 66 1174ndash1184

32

Back-up

33

Basic Parameters of the Study Design Table 1 Basic Parameters of the Study Design

Overall Survival Progression-Free Survival

BM+ BM- All Rand BM+ BM- All Rand

randomized Total (per comparison)

asymp 480 (asymp 320) (c)

asymp 720 (asymp 480) (c)

1200 (800) (fixed)

As OS As OS As OS

Significance level (a) 001 0015

Delayed Effect (b) 4m 5m 1m 1m

Cure Rate in Experimental (b) 15 10 0 0

HR after Delay Effect (b) (d) 06 075 056 075

Overall HR (b) Time1 Time2 070 061 084 074 077 069 063 084 072

Median controlexperimental 8m 12m 9m 12m 9m 12m 4m 64m 4m 47m 4m 55m

events Total Time1 Time2 asymp 285 385 (fixed)

asymp 693 asymp 963 380 (fixed) asymp 989

events per comparison Time1 Time2

asymp 198 asymp 266 asymp 478 asymp 661 asymp 261 asymp 672

(a) Initially allocated endpoint-specific alpha used for sample size calculation (b) For each comparison with the two experimental arms (c) Based on an expected BM+ proportion of 40 (Total number randomized is fixed) (d) As assumed for non-cure patients

34

Literature Overview

Graphical approach Bonferroni-based (Bretz et al 2009)

Non-parametric extensions (Bretz et al 2011)

GSP extension special case (Ye et al 2013)

GSP general allows for selection of recycling stages (Xi Tamhane 2015)

GSP extension general (Maurer Bretz 2013)

Graphical approach Bonferroni-based (Burman 2009)

GSP = group-sequential procedure Dark blue boxes referenced by FDA guidance

Graphs with memory (Maurer Bretz 2013a)

-649

-648 -577 -459

35

Multiplicity History of Regulatory Guidances bull E9 Statistical Principles for Clinical Trials (1998 Section 56)

bull Pre-specification of MTP bull EMA Points to Consider on Multiplicity Issues in Clinical Trials (2002)

bull Strong control of FWER bull EMA Concept paper on the need for a guideline on multiplicity issues in

clinical trials (Draft 2012) =gt Nov2012 Workshop bull EMA Guideline on multiplicity issues in clinical trials bull FDA Guidance document on multiplicity issues in clinical trials bull PMDA no multiplicity guidance (ldquoMTPs do not necessarily match with

labellingrdquo)

36

DRAFT Released for public

consultation in 2017

Related document within BMS Reporting Observed Significance Levels (p-values) in Clinical Study Reports Regulatory Documents and Publications 2014 (Stephane Munier)

Multiplicity Latest Regulatory Guidancersquos (2017) EMA Guideline on multiplicity issues

in clinical trials FDA Guidance document on

multiplicity issues in clinical trials

Principles bull Family-wise strong control bull Pre-specification of multiplicity adjustment

Secondary Endpoints bull Secondary endpoints expressing supportive evidence needs no MTP

bull Secondary endpoints that may become base for additional claims need MTP

(Section 6)

Distinction between different types of secondary endpoints is not clear all require MTP (Section IIIA)

Scope Similar (not the same)

Subgroups Section 7 (also own guidance) Excluded (another guidance)

Estimation and descriptive statistics Section 10 Not included

Graphical approach Not mentioned Explicitly encouraged

37

Experimental1 vs Control Experimental2 vs Control

Alternative4 Importance of Experimental1 PFS

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

ε

1

1 - ε

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain 38

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative4 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

ε

1

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

ε

1

1-ε

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 166 + 13 + 131 0

AC + 162 + 19 + 116 0

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 56 + 30

AC + 69 + 40 39

Experimental1 vs Control Experimental2 vs Control

Alternative3 Importance of both OS BM+

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

05 05

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

05

05

40

Timepoint 1 Timepoint 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative3 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

05

0002

0

05

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

0002

0

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

0009

0

OS BM

+

OS In All

0002

0

05

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 104 00 + 105 00

All + 87 - 14 + 84 - 16

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 46 NA + 40 NA

All + 49 NA + 42 NA

05 05 05 05 05

41

05 05 05 05

B) Final OS Look Adjusting Significance Levels bull The time of the final OS analysis will be determined by the number of

BM+ OS events in the 3 arms or in the control arm

bull As a result the number of events for each OS comparison is a random variable by design

bull Consequence bull Time 1 Assume none of the OS endpoints was rejected bull Time 2 If the of events for a given comparison is more (less) than expected

=gt the information fraction and nominal significance level used at Time 1 was larger (smaller) than what we should have used

For the final look critical values for OS will be re-calculated using the number of exactly observed events at the final analysis

The effect of adjustment is limited (always lt 0004 and in general lt 0001)

42

Illustration Effect of Final Alpha Adjustment

43 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Legend Observed IF at Time 1 lt Q1 Q1 - Q2 Q2 - Q3 Q3 lt

Illustration Effect of Final Alpha Adjustment

44 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Page 13: Multiplicity Correction in a Group-Sequential Oncology ... · PDF file• Largest difference: for Exp1 in scenario#1 (OS BM+: 11%, OS All: 13%, PFS BM+: 5%) 25 30 35 40 45 50 55 60

Experimental1 vs Control Experimental2 vs Control

Graphical Approach Introduction

0015

0

1

PFS BM+

PFS in All

OS BM+

OS in All

1

001

0

0015

0

1

PFS BM+

PFS in All

OS BM+

OS in All

1

001

0

Vertices = nodes

Endpoint-specific alpha asymp Local significance level

Edges = paths

Weight of edge

Blue expressions terminology of FDA guidance 13

Experimental1 vs Control Experimental2 vs Control

Improving Testing Strategy Add Edges

0015

0

1

PFS BM+

PFS in All

OS BM+

OS in All

1

001

0

1

0015

0

1

PFS BM+

PFS in All

OS BM+

OS in All

1

001

0

1

1

1

Along the green edges alpha can be passed only at Time1

14

15

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 1a Time 1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

1

Along the green edges alpha can be passed only at Time1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

16

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 1b Time 1

0015

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

1

Along the green edges alpha can be passed only at Time1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

17

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 2a Time 1

0015

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

1

Along the green edges alpha can be passed only at Time1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

18

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 2b Time 1

OS BM+

OS In All

1

0025 [00060023]

0

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

1

Along the green edges alpha can be passed only at Time1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

19

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 3a Time 2

OS BM+

OS In All

1

0025 [00060023]

0

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

20

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 3b Time 2

OS BM+

OS In All

1

0025 [00060023]

0

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

21

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 3c Time 2

OS In All

0025 [002001]

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

1 1

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

1

1

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ +171 00 +179 00

All +176 00 +179 00

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ +66 +57

All +77 +73 22

How Does the Power Gain Change under Different Assumptions

Endpoint Modified

Scenario Details

OS 1 OS larger difference in treatment effect by BM status

HR (for non-cure patients and after delay) increased by 10 in the BM- and decreased by 10 in the BM+ population (while keeping HR for all randomized the same) (a)

2 OS shorter delay 10 shorter delay in all 4 OS endpoints (a)

OS PFS 3 OS PFS Exp2 better OS and PFS treatment effect (delay cure rate HR in non-cure patients) improved by 10 in Experimental2 arm

4 OS PFS Exp2 worse OS and PFS treatment effect (delay cure rate HR in non-cure patients) worsened by 10 in Experimental2 arm

5 OS PFS longer delay OS delay increased with 50 PFS delay with 300

PFS 6 PFS no delay No delay in any of the 4 PFS endpoints

Time of analysis is generated based on of events in all 3 arms 23

0

01

02

03

04

05

06

07

0

01

02

03

04

05

06

07

PROTOCOL 1 OS Larger BM effect 2 OS shorter delay 3 Exp2 better 4 Exp2 worse 5 OS PFS longer delay 6 PFS no delay

Benefit of Additional Edges in Statistical Power of OS Time1

OS1+

OS1All

OS2+

OS2All

OS1

OS1AC

OS2

OS2AC

05

055

06

065

07

075

08

085

09

095

1

05

055

06

065

07

075

08

085

09

095

1

PROTOCOL 1 OS Larger BM effect 2 OS shorter delay 3 Exp2 better 4 Exp2 worse 5 OS PFS longer delay 6 PFS no delay

Benefit of Additional Edges in Statistical Power of OS Time2

OS1+

OS1All

OS2+

OS2All

OS1

OS1AC

OS2

OS2AC24

Experimental1 vs Control Experimental2 vs Control

Alternative1 Secure All Endpoints within Comparison

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

ε

ε

1 - ε 1 - ε

25

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dashed edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative1 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

ε

ε

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

ε

ε

1-ε 1-ε

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 170 + 14 + 178 + 17

All + 176 + 16 + 178 + 20

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 64 + 56

All + 75 + 71 26

Experimental1 vs Control Experimental2 vs Control

Alternative2 Giving BM+ Maximum Chance

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

001

0

1

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

001

0

1

ε2 ε2

1 - ε 1 - ε

1

1 1

ε2 ε2

1

27

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative2 Change in Power vs Base Case (under Protocol Assumptions)

1 - ε 1 - ε

1

1 1

ε2

0015

0

PFS BM+

PFS In All

OS BM

+

OS In All

0002

0

1

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

0002

0

1

ε2 ε2 ε2

1

Time 1 Time 2

OS BM+

OS In All

0002

0

OS BM+

OS In All

0002

0

05 05

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 135 + 08 + 138 + 08

All - 197 - 80 - 19 - 79

+20

+15

+10

+5

0

-5

-10

-15

-20

28

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 37 + 32

All - 67 - 68

4 Concluding Remarks

29

Concluding Remarks MTP

bull Limitation of the results on statistical power bull OS and PFS were generated independently

bull Identifying the testing strategy for an actual clinical trial always requires the input of several functions and the power results represent only one aspect of these considerations

bull Using graphical approach may result in overpowered endpoints However this still may be desirable reasons bull Protocol assumptions are complicated =gt just one detail off can result in power drop

(even in the presence of meaningful treatment effect)

bull At an interim analysis probability of crossing the efficacy boundary increases =gt gives larger chance for earlier read-out

30

Thank you for your attention

31

References

bull Bretz F Maurer W Brannath W Posch M A graphical approach to sequentially rejective multiple test procedures Statistics in Medicine 2009 28 586-604

bull Burman C-F Sonesson C and Guilbaud O (2009) A recycling framework for the construction of Bonferroni-based multiple tests Statistics in Medicine 28 739ndash761

bull Bretz F Posch M Glimm E Klinglmueller F Maurer W Rohmeyer K Graphical approaches for multiple comparison procedures using weighted Bonferroni Simes or parametric tests Biometrical Journal 2011 53(6) 894-913

bull Ye Y Li A Liu L and Yao B (2013) A group sequential Holm procedure with multiple primary endpoints Statistics in Medicine 32 1112-1124

bull Maurer W and Bretz F (2013a) ldquoGraphical Multiple Test Procedures With Memoryrdquo Statistics in Medicine DOI 101002sim5711

bull Willi MAURER and Frank BRETZ Multiple Testing in Group Sequential Trials Using Graphical Approaches American Statistical Association Statistics in Biopharmaceutical Research November 2013 Vol5 No4

bull Xi D Tamhane AC Allocating recycled significance levels in group sequential procedures for multiple endpoints Biom J 2015 Jan57(1)90-107

bull Tamhane A C Mehta C R and Liu L (2010) Testing a primary and a secondary endpoint in a group sequential design Biometrics 66 1174ndash1184

32

Back-up

33

Basic Parameters of the Study Design Table 1 Basic Parameters of the Study Design

Overall Survival Progression-Free Survival

BM+ BM- All Rand BM+ BM- All Rand

randomized Total (per comparison)

asymp 480 (asymp 320) (c)

asymp 720 (asymp 480) (c)

1200 (800) (fixed)

As OS As OS As OS

Significance level (a) 001 0015

Delayed Effect (b) 4m 5m 1m 1m

Cure Rate in Experimental (b) 15 10 0 0

HR after Delay Effect (b) (d) 06 075 056 075

Overall HR (b) Time1 Time2 070 061 084 074 077 069 063 084 072

Median controlexperimental 8m 12m 9m 12m 9m 12m 4m 64m 4m 47m 4m 55m

events Total Time1 Time2 asymp 285 385 (fixed)

asymp 693 asymp 963 380 (fixed) asymp 989

events per comparison Time1 Time2

asymp 198 asymp 266 asymp 478 asymp 661 asymp 261 asymp 672

(a) Initially allocated endpoint-specific alpha used for sample size calculation (b) For each comparison with the two experimental arms (c) Based on an expected BM+ proportion of 40 (Total number randomized is fixed) (d) As assumed for non-cure patients

34

Literature Overview

Graphical approach Bonferroni-based (Bretz et al 2009)

Non-parametric extensions (Bretz et al 2011)

GSP extension special case (Ye et al 2013)

GSP general allows for selection of recycling stages (Xi Tamhane 2015)

GSP extension general (Maurer Bretz 2013)

Graphical approach Bonferroni-based (Burman 2009)

GSP = group-sequential procedure Dark blue boxes referenced by FDA guidance

Graphs with memory (Maurer Bretz 2013a)

-649

-648 -577 -459

35

Multiplicity History of Regulatory Guidances bull E9 Statistical Principles for Clinical Trials (1998 Section 56)

bull Pre-specification of MTP bull EMA Points to Consider on Multiplicity Issues in Clinical Trials (2002)

bull Strong control of FWER bull EMA Concept paper on the need for a guideline on multiplicity issues in

clinical trials (Draft 2012) =gt Nov2012 Workshop bull EMA Guideline on multiplicity issues in clinical trials bull FDA Guidance document on multiplicity issues in clinical trials bull PMDA no multiplicity guidance (ldquoMTPs do not necessarily match with

labellingrdquo)

36

DRAFT Released for public

consultation in 2017

Related document within BMS Reporting Observed Significance Levels (p-values) in Clinical Study Reports Regulatory Documents and Publications 2014 (Stephane Munier)

Multiplicity Latest Regulatory Guidancersquos (2017) EMA Guideline on multiplicity issues

in clinical trials FDA Guidance document on

multiplicity issues in clinical trials

Principles bull Family-wise strong control bull Pre-specification of multiplicity adjustment

Secondary Endpoints bull Secondary endpoints expressing supportive evidence needs no MTP

bull Secondary endpoints that may become base for additional claims need MTP

(Section 6)

Distinction between different types of secondary endpoints is not clear all require MTP (Section IIIA)

Scope Similar (not the same)

Subgroups Section 7 (also own guidance) Excluded (another guidance)

Estimation and descriptive statistics Section 10 Not included

Graphical approach Not mentioned Explicitly encouraged

37

Experimental1 vs Control Experimental2 vs Control

Alternative4 Importance of Experimental1 PFS

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

ε

1

1 - ε

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain 38

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative4 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

ε

1

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

ε

1

1-ε

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 166 + 13 + 131 0

AC + 162 + 19 + 116 0

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 56 + 30

AC + 69 + 40 39

Experimental1 vs Control Experimental2 vs Control

Alternative3 Importance of both OS BM+

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

05 05

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

05

05

40

Timepoint 1 Timepoint 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative3 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

05

0002

0

05

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

0002

0

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

0009

0

OS BM

+

OS In All

0002

0

05

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 104 00 + 105 00

All + 87 - 14 + 84 - 16

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 46 NA + 40 NA

All + 49 NA + 42 NA

05 05 05 05 05

41

05 05 05 05

B) Final OS Look Adjusting Significance Levels bull The time of the final OS analysis will be determined by the number of

BM+ OS events in the 3 arms or in the control arm

bull As a result the number of events for each OS comparison is a random variable by design

bull Consequence bull Time 1 Assume none of the OS endpoints was rejected bull Time 2 If the of events for a given comparison is more (less) than expected

=gt the information fraction and nominal significance level used at Time 1 was larger (smaller) than what we should have used

For the final look critical values for OS will be re-calculated using the number of exactly observed events at the final analysis

The effect of adjustment is limited (always lt 0004 and in general lt 0001)

42

Illustration Effect of Final Alpha Adjustment

43 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Legend Observed IF at Time 1 lt Q1 Q1 - Q2 Q2 - Q3 Q3 lt

Illustration Effect of Final Alpha Adjustment

44 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Page 14: Multiplicity Correction in a Group-Sequential Oncology ... · PDF file• Largest difference: for Exp1 in scenario#1 (OS BM+: 11%, OS All: 13%, PFS BM+: 5%) 25 30 35 40 45 50 55 60

Experimental1 vs Control Experimental2 vs Control

Improving Testing Strategy Add Edges

0015

0

1

PFS BM+

PFS in All

OS BM+

OS in All

1

001

0

1

0015

0

1

PFS BM+

PFS in All

OS BM+

OS in All

1

001

0

1

1

1

Along the green edges alpha can be passed only at Time1

14

15

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 1a Time 1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

1

Along the green edges alpha can be passed only at Time1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

16

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 1b Time 1

0015

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

1

Along the green edges alpha can be passed only at Time1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

17

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 2a Time 1

0015

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

1

Along the green edges alpha can be passed only at Time1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

18

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 2b Time 1

OS BM+

OS In All

1

0025 [00060023]

0

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

1

Along the green edges alpha can be passed only at Time1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

19

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 3a Time 2

OS BM+

OS In All

1

0025 [00060023]

0

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

20

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 3b Time 2

OS BM+

OS In All

1

0025 [00060023]

0

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

21

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 3c Time 2

OS In All

0025 [002001]

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

1 1

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

1

1

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ +171 00 +179 00

All +176 00 +179 00

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ +66 +57

All +77 +73 22

How Does the Power Gain Change under Different Assumptions

Endpoint Modified

Scenario Details

OS 1 OS larger difference in treatment effect by BM status

HR (for non-cure patients and after delay) increased by 10 in the BM- and decreased by 10 in the BM+ population (while keeping HR for all randomized the same) (a)

2 OS shorter delay 10 shorter delay in all 4 OS endpoints (a)

OS PFS 3 OS PFS Exp2 better OS and PFS treatment effect (delay cure rate HR in non-cure patients) improved by 10 in Experimental2 arm

4 OS PFS Exp2 worse OS and PFS treatment effect (delay cure rate HR in non-cure patients) worsened by 10 in Experimental2 arm

5 OS PFS longer delay OS delay increased with 50 PFS delay with 300

PFS 6 PFS no delay No delay in any of the 4 PFS endpoints

Time of analysis is generated based on of events in all 3 arms 23

0

01

02

03

04

05

06

07

0

01

02

03

04

05

06

07

PROTOCOL 1 OS Larger BM effect 2 OS shorter delay 3 Exp2 better 4 Exp2 worse 5 OS PFS longer delay 6 PFS no delay

Benefit of Additional Edges in Statistical Power of OS Time1

OS1+

OS1All

OS2+

OS2All

OS1

OS1AC

OS2

OS2AC

05

055

06

065

07

075

08

085

09

095

1

05

055

06

065

07

075

08

085

09

095

1

PROTOCOL 1 OS Larger BM effect 2 OS shorter delay 3 Exp2 better 4 Exp2 worse 5 OS PFS longer delay 6 PFS no delay

Benefit of Additional Edges in Statistical Power of OS Time2

OS1+

OS1All

OS2+

OS2All

OS1

OS1AC

OS2

OS2AC24

Experimental1 vs Control Experimental2 vs Control

Alternative1 Secure All Endpoints within Comparison

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

ε

ε

1 - ε 1 - ε

25

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dashed edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative1 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

ε

ε

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

ε

ε

1-ε 1-ε

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 170 + 14 + 178 + 17

All + 176 + 16 + 178 + 20

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 64 + 56

All + 75 + 71 26

Experimental1 vs Control Experimental2 vs Control

Alternative2 Giving BM+ Maximum Chance

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

001

0

1

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

001

0

1

ε2 ε2

1 - ε 1 - ε

1

1 1

ε2 ε2

1

27

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative2 Change in Power vs Base Case (under Protocol Assumptions)

1 - ε 1 - ε

1

1 1

ε2

0015

0

PFS BM+

PFS In All

OS BM

+

OS In All

0002

0

1

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

0002

0

1

ε2 ε2 ε2

1

Time 1 Time 2

OS BM+

OS In All

0002

0

OS BM+

OS In All

0002

0

05 05

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 135 + 08 + 138 + 08

All - 197 - 80 - 19 - 79

+20

+15

+10

+5

0

-5

-10

-15

-20

28

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 37 + 32

All - 67 - 68

4 Concluding Remarks

29

Concluding Remarks MTP

bull Limitation of the results on statistical power bull OS and PFS were generated independently

bull Identifying the testing strategy for an actual clinical trial always requires the input of several functions and the power results represent only one aspect of these considerations

bull Using graphical approach may result in overpowered endpoints However this still may be desirable reasons bull Protocol assumptions are complicated =gt just one detail off can result in power drop

(even in the presence of meaningful treatment effect)

bull At an interim analysis probability of crossing the efficacy boundary increases =gt gives larger chance for earlier read-out

30

Thank you for your attention

31

References

bull Bretz F Maurer W Brannath W Posch M A graphical approach to sequentially rejective multiple test procedures Statistics in Medicine 2009 28 586-604

bull Burman C-F Sonesson C and Guilbaud O (2009) A recycling framework for the construction of Bonferroni-based multiple tests Statistics in Medicine 28 739ndash761

bull Bretz F Posch M Glimm E Klinglmueller F Maurer W Rohmeyer K Graphical approaches for multiple comparison procedures using weighted Bonferroni Simes or parametric tests Biometrical Journal 2011 53(6) 894-913

bull Ye Y Li A Liu L and Yao B (2013) A group sequential Holm procedure with multiple primary endpoints Statistics in Medicine 32 1112-1124

bull Maurer W and Bretz F (2013a) ldquoGraphical Multiple Test Procedures With Memoryrdquo Statistics in Medicine DOI 101002sim5711

bull Willi MAURER and Frank BRETZ Multiple Testing in Group Sequential Trials Using Graphical Approaches American Statistical Association Statistics in Biopharmaceutical Research November 2013 Vol5 No4

bull Xi D Tamhane AC Allocating recycled significance levels in group sequential procedures for multiple endpoints Biom J 2015 Jan57(1)90-107

bull Tamhane A C Mehta C R and Liu L (2010) Testing a primary and a secondary endpoint in a group sequential design Biometrics 66 1174ndash1184

32

Back-up

33

Basic Parameters of the Study Design Table 1 Basic Parameters of the Study Design

Overall Survival Progression-Free Survival

BM+ BM- All Rand BM+ BM- All Rand

randomized Total (per comparison)

asymp 480 (asymp 320) (c)

asymp 720 (asymp 480) (c)

1200 (800) (fixed)

As OS As OS As OS

Significance level (a) 001 0015

Delayed Effect (b) 4m 5m 1m 1m

Cure Rate in Experimental (b) 15 10 0 0

HR after Delay Effect (b) (d) 06 075 056 075

Overall HR (b) Time1 Time2 070 061 084 074 077 069 063 084 072

Median controlexperimental 8m 12m 9m 12m 9m 12m 4m 64m 4m 47m 4m 55m

events Total Time1 Time2 asymp 285 385 (fixed)

asymp 693 asymp 963 380 (fixed) asymp 989

events per comparison Time1 Time2

asymp 198 asymp 266 asymp 478 asymp 661 asymp 261 asymp 672

(a) Initially allocated endpoint-specific alpha used for sample size calculation (b) For each comparison with the two experimental arms (c) Based on an expected BM+ proportion of 40 (Total number randomized is fixed) (d) As assumed for non-cure patients

34

Literature Overview

Graphical approach Bonferroni-based (Bretz et al 2009)

Non-parametric extensions (Bretz et al 2011)

GSP extension special case (Ye et al 2013)

GSP general allows for selection of recycling stages (Xi Tamhane 2015)

GSP extension general (Maurer Bretz 2013)

Graphical approach Bonferroni-based (Burman 2009)

GSP = group-sequential procedure Dark blue boxes referenced by FDA guidance

Graphs with memory (Maurer Bretz 2013a)

-649

-648 -577 -459

35

Multiplicity History of Regulatory Guidances bull E9 Statistical Principles for Clinical Trials (1998 Section 56)

bull Pre-specification of MTP bull EMA Points to Consider on Multiplicity Issues in Clinical Trials (2002)

bull Strong control of FWER bull EMA Concept paper on the need for a guideline on multiplicity issues in

clinical trials (Draft 2012) =gt Nov2012 Workshop bull EMA Guideline on multiplicity issues in clinical trials bull FDA Guidance document on multiplicity issues in clinical trials bull PMDA no multiplicity guidance (ldquoMTPs do not necessarily match with

labellingrdquo)

36

DRAFT Released for public

consultation in 2017

Related document within BMS Reporting Observed Significance Levels (p-values) in Clinical Study Reports Regulatory Documents and Publications 2014 (Stephane Munier)

Multiplicity Latest Regulatory Guidancersquos (2017) EMA Guideline on multiplicity issues

in clinical trials FDA Guidance document on

multiplicity issues in clinical trials

Principles bull Family-wise strong control bull Pre-specification of multiplicity adjustment

Secondary Endpoints bull Secondary endpoints expressing supportive evidence needs no MTP

bull Secondary endpoints that may become base for additional claims need MTP

(Section 6)

Distinction between different types of secondary endpoints is not clear all require MTP (Section IIIA)

Scope Similar (not the same)

Subgroups Section 7 (also own guidance) Excluded (another guidance)

Estimation and descriptive statistics Section 10 Not included

Graphical approach Not mentioned Explicitly encouraged

37

Experimental1 vs Control Experimental2 vs Control

Alternative4 Importance of Experimental1 PFS

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

ε

1

1 - ε

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain 38

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative4 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

ε

1

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

ε

1

1-ε

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 166 + 13 + 131 0

AC + 162 + 19 + 116 0

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 56 + 30

AC + 69 + 40 39

Experimental1 vs Control Experimental2 vs Control

Alternative3 Importance of both OS BM+

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

05 05

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

05

05

40

Timepoint 1 Timepoint 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative3 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

05

0002

0

05

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

0002

0

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

0009

0

OS BM

+

OS In All

0002

0

05

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 104 00 + 105 00

All + 87 - 14 + 84 - 16

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 46 NA + 40 NA

All + 49 NA + 42 NA

05 05 05 05 05

41

05 05 05 05

B) Final OS Look Adjusting Significance Levels bull The time of the final OS analysis will be determined by the number of

BM+ OS events in the 3 arms or in the control arm

bull As a result the number of events for each OS comparison is a random variable by design

bull Consequence bull Time 1 Assume none of the OS endpoints was rejected bull Time 2 If the of events for a given comparison is more (less) than expected

=gt the information fraction and nominal significance level used at Time 1 was larger (smaller) than what we should have used

For the final look critical values for OS will be re-calculated using the number of exactly observed events at the final analysis

The effect of adjustment is limited (always lt 0004 and in general lt 0001)

42

Illustration Effect of Final Alpha Adjustment

43 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Legend Observed IF at Time 1 lt Q1 Q1 - Q2 Q2 - Q3 Q3 lt

Illustration Effect of Final Alpha Adjustment

44 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Page 15: Multiplicity Correction in a Group-Sequential Oncology ... · PDF file• Largest difference: for Exp1 in scenario#1 (OS BM+: 11%, OS All: 13%, PFS BM+: 5%) 25 30 35 40 45 50 55 60

15

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 1a Time 1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

1

Along the green edges alpha can be passed only at Time1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

16

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 1b Time 1

0015

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

1

Along the green edges alpha can be passed only at Time1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

17

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 2a Time 1

0015

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

1

Along the green edges alpha can be passed only at Time1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

18

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 2b Time 1

OS BM+

OS In All

1

0025 [00060023]

0

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

1

Along the green edges alpha can be passed only at Time1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

19

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 3a Time 2

OS BM+

OS In All

1

0025 [00060023]

0

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

20

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 3b Time 2

OS BM+

OS In All

1

0025 [00060023]

0

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

21

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 3c Time 2

OS In All

0025 [002001]

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

1 1

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

1

1

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ +171 00 +179 00

All +176 00 +179 00

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ +66 +57

All +77 +73 22

How Does the Power Gain Change under Different Assumptions

Endpoint Modified

Scenario Details

OS 1 OS larger difference in treatment effect by BM status

HR (for non-cure patients and after delay) increased by 10 in the BM- and decreased by 10 in the BM+ population (while keeping HR for all randomized the same) (a)

2 OS shorter delay 10 shorter delay in all 4 OS endpoints (a)

OS PFS 3 OS PFS Exp2 better OS and PFS treatment effect (delay cure rate HR in non-cure patients) improved by 10 in Experimental2 arm

4 OS PFS Exp2 worse OS and PFS treatment effect (delay cure rate HR in non-cure patients) worsened by 10 in Experimental2 arm

5 OS PFS longer delay OS delay increased with 50 PFS delay with 300

PFS 6 PFS no delay No delay in any of the 4 PFS endpoints

Time of analysis is generated based on of events in all 3 arms 23

0

01

02

03

04

05

06

07

0

01

02

03

04

05

06

07

PROTOCOL 1 OS Larger BM effect 2 OS shorter delay 3 Exp2 better 4 Exp2 worse 5 OS PFS longer delay 6 PFS no delay

Benefit of Additional Edges in Statistical Power of OS Time1

OS1+

OS1All

OS2+

OS2All

OS1

OS1AC

OS2

OS2AC

05

055

06

065

07

075

08

085

09

095

1

05

055

06

065

07

075

08

085

09

095

1

PROTOCOL 1 OS Larger BM effect 2 OS shorter delay 3 Exp2 better 4 Exp2 worse 5 OS PFS longer delay 6 PFS no delay

Benefit of Additional Edges in Statistical Power of OS Time2

OS1+

OS1All

OS2+

OS2All

OS1

OS1AC

OS2

OS2AC24

Experimental1 vs Control Experimental2 vs Control

Alternative1 Secure All Endpoints within Comparison

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

ε

ε

1 - ε 1 - ε

25

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dashed edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative1 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

ε

ε

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

ε

ε

1-ε 1-ε

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 170 + 14 + 178 + 17

All + 176 + 16 + 178 + 20

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 64 + 56

All + 75 + 71 26

Experimental1 vs Control Experimental2 vs Control

Alternative2 Giving BM+ Maximum Chance

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

001

0

1

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

001

0

1

ε2 ε2

1 - ε 1 - ε

1

1 1

ε2 ε2

1

27

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative2 Change in Power vs Base Case (under Protocol Assumptions)

1 - ε 1 - ε

1

1 1

ε2

0015

0

PFS BM+

PFS In All

OS BM

+

OS In All

0002

0

1

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

0002

0

1

ε2 ε2 ε2

1

Time 1 Time 2

OS BM+

OS In All

0002

0

OS BM+

OS In All

0002

0

05 05

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 135 + 08 + 138 + 08

All - 197 - 80 - 19 - 79

+20

+15

+10

+5

0

-5

-10

-15

-20

28

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 37 + 32

All - 67 - 68

4 Concluding Remarks

29

Concluding Remarks MTP

bull Limitation of the results on statistical power bull OS and PFS were generated independently

bull Identifying the testing strategy for an actual clinical trial always requires the input of several functions and the power results represent only one aspect of these considerations

bull Using graphical approach may result in overpowered endpoints However this still may be desirable reasons bull Protocol assumptions are complicated =gt just one detail off can result in power drop

(even in the presence of meaningful treatment effect)

bull At an interim analysis probability of crossing the efficacy boundary increases =gt gives larger chance for earlier read-out

30

Thank you for your attention

31

References

bull Bretz F Maurer W Brannath W Posch M A graphical approach to sequentially rejective multiple test procedures Statistics in Medicine 2009 28 586-604

bull Burman C-F Sonesson C and Guilbaud O (2009) A recycling framework for the construction of Bonferroni-based multiple tests Statistics in Medicine 28 739ndash761

bull Bretz F Posch M Glimm E Klinglmueller F Maurer W Rohmeyer K Graphical approaches for multiple comparison procedures using weighted Bonferroni Simes or parametric tests Biometrical Journal 2011 53(6) 894-913

bull Ye Y Li A Liu L and Yao B (2013) A group sequential Holm procedure with multiple primary endpoints Statistics in Medicine 32 1112-1124

bull Maurer W and Bretz F (2013a) ldquoGraphical Multiple Test Procedures With Memoryrdquo Statistics in Medicine DOI 101002sim5711

bull Willi MAURER and Frank BRETZ Multiple Testing in Group Sequential Trials Using Graphical Approaches American Statistical Association Statistics in Biopharmaceutical Research November 2013 Vol5 No4

bull Xi D Tamhane AC Allocating recycled significance levels in group sequential procedures for multiple endpoints Biom J 2015 Jan57(1)90-107

bull Tamhane A C Mehta C R and Liu L (2010) Testing a primary and a secondary endpoint in a group sequential design Biometrics 66 1174ndash1184

32

Back-up

33

Basic Parameters of the Study Design Table 1 Basic Parameters of the Study Design

Overall Survival Progression-Free Survival

BM+ BM- All Rand BM+ BM- All Rand

randomized Total (per comparison)

asymp 480 (asymp 320) (c)

asymp 720 (asymp 480) (c)

1200 (800) (fixed)

As OS As OS As OS

Significance level (a) 001 0015

Delayed Effect (b) 4m 5m 1m 1m

Cure Rate in Experimental (b) 15 10 0 0

HR after Delay Effect (b) (d) 06 075 056 075

Overall HR (b) Time1 Time2 070 061 084 074 077 069 063 084 072

Median controlexperimental 8m 12m 9m 12m 9m 12m 4m 64m 4m 47m 4m 55m

events Total Time1 Time2 asymp 285 385 (fixed)

asymp 693 asymp 963 380 (fixed) asymp 989

events per comparison Time1 Time2

asymp 198 asymp 266 asymp 478 asymp 661 asymp 261 asymp 672

(a) Initially allocated endpoint-specific alpha used for sample size calculation (b) For each comparison with the two experimental arms (c) Based on an expected BM+ proportion of 40 (Total number randomized is fixed) (d) As assumed for non-cure patients

34

Literature Overview

Graphical approach Bonferroni-based (Bretz et al 2009)

Non-parametric extensions (Bretz et al 2011)

GSP extension special case (Ye et al 2013)

GSP general allows for selection of recycling stages (Xi Tamhane 2015)

GSP extension general (Maurer Bretz 2013)

Graphical approach Bonferroni-based (Burman 2009)

GSP = group-sequential procedure Dark blue boxes referenced by FDA guidance

Graphs with memory (Maurer Bretz 2013a)

-649

-648 -577 -459

35

Multiplicity History of Regulatory Guidances bull E9 Statistical Principles for Clinical Trials (1998 Section 56)

bull Pre-specification of MTP bull EMA Points to Consider on Multiplicity Issues in Clinical Trials (2002)

bull Strong control of FWER bull EMA Concept paper on the need for a guideline on multiplicity issues in

clinical trials (Draft 2012) =gt Nov2012 Workshop bull EMA Guideline on multiplicity issues in clinical trials bull FDA Guidance document on multiplicity issues in clinical trials bull PMDA no multiplicity guidance (ldquoMTPs do not necessarily match with

labellingrdquo)

36

DRAFT Released for public

consultation in 2017

Related document within BMS Reporting Observed Significance Levels (p-values) in Clinical Study Reports Regulatory Documents and Publications 2014 (Stephane Munier)

Multiplicity Latest Regulatory Guidancersquos (2017) EMA Guideline on multiplicity issues

in clinical trials FDA Guidance document on

multiplicity issues in clinical trials

Principles bull Family-wise strong control bull Pre-specification of multiplicity adjustment

Secondary Endpoints bull Secondary endpoints expressing supportive evidence needs no MTP

bull Secondary endpoints that may become base for additional claims need MTP

(Section 6)

Distinction between different types of secondary endpoints is not clear all require MTP (Section IIIA)

Scope Similar (not the same)

Subgroups Section 7 (also own guidance) Excluded (another guidance)

Estimation and descriptive statistics Section 10 Not included

Graphical approach Not mentioned Explicitly encouraged

37

Experimental1 vs Control Experimental2 vs Control

Alternative4 Importance of Experimental1 PFS

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

ε

1

1 - ε

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain 38

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative4 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

ε

1

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

ε

1

1-ε

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 166 + 13 + 131 0

AC + 162 + 19 + 116 0

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 56 + 30

AC + 69 + 40 39

Experimental1 vs Control Experimental2 vs Control

Alternative3 Importance of both OS BM+

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

05 05

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

05

05

40

Timepoint 1 Timepoint 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative3 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

05

0002

0

05

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

0002

0

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

0009

0

OS BM

+

OS In All

0002

0

05

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 104 00 + 105 00

All + 87 - 14 + 84 - 16

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 46 NA + 40 NA

All + 49 NA + 42 NA

05 05 05 05 05

41

05 05 05 05

B) Final OS Look Adjusting Significance Levels bull The time of the final OS analysis will be determined by the number of

BM+ OS events in the 3 arms or in the control arm

bull As a result the number of events for each OS comparison is a random variable by design

bull Consequence bull Time 1 Assume none of the OS endpoints was rejected bull Time 2 If the of events for a given comparison is more (less) than expected

=gt the information fraction and nominal significance level used at Time 1 was larger (smaller) than what we should have used

For the final look critical values for OS will be re-calculated using the number of exactly observed events at the final analysis

The effect of adjustment is limited (always lt 0004 and in general lt 0001)

42

Illustration Effect of Final Alpha Adjustment

43 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Legend Observed IF at Time 1 lt Q1 Q1 - Q2 Q2 - Q3 Q3 lt

Illustration Effect of Final Alpha Adjustment

44 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Page 16: Multiplicity Correction in a Group-Sequential Oncology ... · PDF file• Largest difference: for Exp1 in scenario#1 (OS BM+: 11%, OS All: 13%, PFS BM+: 5%) 25 30 35 40 45 50 55 60

16

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 1b Time 1

0015

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

1

Along the green edges alpha can be passed only at Time1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

17

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 2a Time 1

0015

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

1

Along the green edges alpha can be passed only at Time1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

18

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 2b Time 1

OS BM+

OS In All

1

0025 [00060023]

0

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

1

Along the green edges alpha can be passed only at Time1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

19

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 3a Time 2

OS BM+

OS In All

1

0025 [00060023]

0

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

20

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 3b Time 2

OS BM+

OS In All

1

0025 [00060023]

0

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

21

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 3c Time 2

OS In All

0025 [002001]

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

1 1

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

1

1

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ +171 00 +179 00

All +176 00 +179 00

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ +66 +57

All +77 +73 22

How Does the Power Gain Change under Different Assumptions

Endpoint Modified

Scenario Details

OS 1 OS larger difference in treatment effect by BM status

HR (for non-cure patients and after delay) increased by 10 in the BM- and decreased by 10 in the BM+ population (while keeping HR for all randomized the same) (a)

2 OS shorter delay 10 shorter delay in all 4 OS endpoints (a)

OS PFS 3 OS PFS Exp2 better OS and PFS treatment effect (delay cure rate HR in non-cure patients) improved by 10 in Experimental2 arm

4 OS PFS Exp2 worse OS and PFS treatment effect (delay cure rate HR in non-cure patients) worsened by 10 in Experimental2 arm

5 OS PFS longer delay OS delay increased with 50 PFS delay with 300

PFS 6 PFS no delay No delay in any of the 4 PFS endpoints

Time of analysis is generated based on of events in all 3 arms 23

0

01

02

03

04

05

06

07

0

01

02

03

04

05

06

07

PROTOCOL 1 OS Larger BM effect 2 OS shorter delay 3 Exp2 better 4 Exp2 worse 5 OS PFS longer delay 6 PFS no delay

Benefit of Additional Edges in Statistical Power of OS Time1

OS1+

OS1All

OS2+

OS2All

OS1

OS1AC

OS2

OS2AC

05

055

06

065

07

075

08

085

09

095

1

05

055

06

065

07

075

08

085

09

095

1

PROTOCOL 1 OS Larger BM effect 2 OS shorter delay 3 Exp2 better 4 Exp2 worse 5 OS PFS longer delay 6 PFS no delay

Benefit of Additional Edges in Statistical Power of OS Time2

OS1+

OS1All

OS2+

OS2All

OS1

OS1AC

OS2

OS2AC24

Experimental1 vs Control Experimental2 vs Control

Alternative1 Secure All Endpoints within Comparison

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

ε

ε

1 - ε 1 - ε

25

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dashed edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative1 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

ε

ε

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

ε

ε

1-ε 1-ε

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 170 + 14 + 178 + 17

All + 176 + 16 + 178 + 20

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 64 + 56

All + 75 + 71 26

Experimental1 vs Control Experimental2 vs Control

Alternative2 Giving BM+ Maximum Chance

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

001

0

1

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

001

0

1

ε2 ε2

1 - ε 1 - ε

1

1 1

ε2 ε2

1

27

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative2 Change in Power vs Base Case (under Protocol Assumptions)

1 - ε 1 - ε

1

1 1

ε2

0015

0

PFS BM+

PFS In All

OS BM

+

OS In All

0002

0

1

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

0002

0

1

ε2 ε2 ε2

1

Time 1 Time 2

OS BM+

OS In All

0002

0

OS BM+

OS In All

0002

0

05 05

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 135 + 08 + 138 + 08

All - 197 - 80 - 19 - 79

+20

+15

+10

+5

0

-5

-10

-15

-20

28

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 37 + 32

All - 67 - 68

4 Concluding Remarks

29

Concluding Remarks MTP

bull Limitation of the results on statistical power bull OS and PFS were generated independently

bull Identifying the testing strategy for an actual clinical trial always requires the input of several functions and the power results represent only one aspect of these considerations

bull Using graphical approach may result in overpowered endpoints However this still may be desirable reasons bull Protocol assumptions are complicated =gt just one detail off can result in power drop

(even in the presence of meaningful treatment effect)

bull At an interim analysis probability of crossing the efficacy boundary increases =gt gives larger chance for earlier read-out

30

Thank you for your attention

31

References

bull Bretz F Maurer W Brannath W Posch M A graphical approach to sequentially rejective multiple test procedures Statistics in Medicine 2009 28 586-604

bull Burman C-F Sonesson C and Guilbaud O (2009) A recycling framework for the construction of Bonferroni-based multiple tests Statistics in Medicine 28 739ndash761

bull Bretz F Posch M Glimm E Klinglmueller F Maurer W Rohmeyer K Graphical approaches for multiple comparison procedures using weighted Bonferroni Simes or parametric tests Biometrical Journal 2011 53(6) 894-913

bull Ye Y Li A Liu L and Yao B (2013) A group sequential Holm procedure with multiple primary endpoints Statistics in Medicine 32 1112-1124

bull Maurer W and Bretz F (2013a) ldquoGraphical Multiple Test Procedures With Memoryrdquo Statistics in Medicine DOI 101002sim5711

bull Willi MAURER and Frank BRETZ Multiple Testing in Group Sequential Trials Using Graphical Approaches American Statistical Association Statistics in Biopharmaceutical Research November 2013 Vol5 No4

bull Xi D Tamhane AC Allocating recycled significance levels in group sequential procedures for multiple endpoints Biom J 2015 Jan57(1)90-107

bull Tamhane A C Mehta C R and Liu L (2010) Testing a primary and a secondary endpoint in a group sequential design Biometrics 66 1174ndash1184

32

Back-up

33

Basic Parameters of the Study Design Table 1 Basic Parameters of the Study Design

Overall Survival Progression-Free Survival

BM+ BM- All Rand BM+ BM- All Rand

randomized Total (per comparison)

asymp 480 (asymp 320) (c)

asymp 720 (asymp 480) (c)

1200 (800) (fixed)

As OS As OS As OS

Significance level (a) 001 0015

Delayed Effect (b) 4m 5m 1m 1m

Cure Rate in Experimental (b) 15 10 0 0

HR after Delay Effect (b) (d) 06 075 056 075

Overall HR (b) Time1 Time2 070 061 084 074 077 069 063 084 072

Median controlexperimental 8m 12m 9m 12m 9m 12m 4m 64m 4m 47m 4m 55m

events Total Time1 Time2 asymp 285 385 (fixed)

asymp 693 asymp 963 380 (fixed) asymp 989

events per comparison Time1 Time2

asymp 198 asymp 266 asymp 478 asymp 661 asymp 261 asymp 672

(a) Initially allocated endpoint-specific alpha used for sample size calculation (b) For each comparison with the two experimental arms (c) Based on an expected BM+ proportion of 40 (Total number randomized is fixed) (d) As assumed for non-cure patients

34

Literature Overview

Graphical approach Bonferroni-based (Bretz et al 2009)

Non-parametric extensions (Bretz et al 2011)

GSP extension special case (Ye et al 2013)

GSP general allows for selection of recycling stages (Xi Tamhane 2015)

GSP extension general (Maurer Bretz 2013)

Graphical approach Bonferroni-based (Burman 2009)

GSP = group-sequential procedure Dark blue boxes referenced by FDA guidance

Graphs with memory (Maurer Bretz 2013a)

-649

-648 -577 -459

35

Multiplicity History of Regulatory Guidances bull E9 Statistical Principles for Clinical Trials (1998 Section 56)

bull Pre-specification of MTP bull EMA Points to Consider on Multiplicity Issues in Clinical Trials (2002)

bull Strong control of FWER bull EMA Concept paper on the need for a guideline on multiplicity issues in

clinical trials (Draft 2012) =gt Nov2012 Workshop bull EMA Guideline on multiplicity issues in clinical trials bull FDA Guidance document on multiplicity issues in clinical trials bull PMDA no multiplicity guidance (ldquoMTPs do not necessarily match with

labellingrdquo)

36

DRAFT Released for public

consultation in 2017

Related document within BMS Reporting Observed Significance Levels (p-values) in Clinical Study Reports Regulatory Documents and Publications 2014 (Stephane Munier)

Multiplicity Latest Regulatory Guidancersquos (2017) EMA Guideline on multiplicity issues

in clinical trials FDA Guidance document on

multiplicity issues in clinical trials

Principles bull Family-wise strong control bull Pre-specification of multiplicity adjustment

Secondary Endpoints bull Secondary endpoints expressing supportive evidence needs no MTP

bull Secondary endpoints that may become base for additional claims need MTP

(Section 6)

Distinction between different types of secondary endpoints is not clear all require MTP (Section IIIA)

Scope Similar (not the same)

Subgroups Section 7 (also own guidance) Excluded (another guidance)

Estimation and descriptive statistics Section 10 Not included

Graphical approach Not mentioned Explicitly encouraged

37

Experimental1 vs Control Experimental2 vs Control

Alternative4 Importance of Experimental1 PFS

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

ε

1

1 - ε

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain 38

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative4 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

ε

1

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

ε

1

1-ε

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 166 + 13 + 131 0

AC + 162 + 19 + 116 0

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 56 + 30

AC + 69 + 40 39

Experimental1 vs Control Experimental2 vs Control

Alternative3 Importance of both OS BM+

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

05 05

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

05

05

40

Timepoint 1 Timepoint 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative3 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

05

0002

0

05

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

0002

0

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

0009

0

OS BM

+

OS In All

0002

0

05

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 104 00 + 105 00

All + 87 - 14 + 84 - 16

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 46 NA + 40 NA

All + 49 NA + 42 NA

05 05 05 05 05

41

05 05 05 05

B) Final OS Look Adjusting Significance Levels bull The time of the final OS analysis will be determined by the number of

BM+ OS events in the 3 arms or in the control arm

bull As a result the number of events for each OS comparison is a random variable by design

bull Consequence bull Time 1 Assume none of the OS endpoints was rejected bull Time 2 If the of events for a given comparison is more (less) than expected

=gt the information fraction and nominal significance level used at Time 1 was larger (smaller) than what we should have used

For the final look critical values for OS will be re-calculated using the number of exactly observed events at the final analysis

The effect of adjustment is limited (always lt 0004 and in general lt 0001)

42

Illustration Effect of Final Alpha Adjustment

43 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Legend Observed IF at Time 1 lt Q1 Q1 - Q2 Q2 - Q3 Q3 lt

Illustration Effect of Final Alpha Adjustment

44 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Page 17: Multiplicity Correction in a Group-Sequential Oncology ... · PDF file• Largest difference: for Exp1 in scenario#1 (OS BM+: 11%, OS All: 13%, PFS BM+: 5%) 25 30 35 40 45 50 55 60

17

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 2a Time 1

0015

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

1

Along the green edges alpha can be passed only at Time1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

18

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 2b Time 1

OS BM+

OS In All

1

0025 [00060023]

0

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

1

Along the green edges alpha can be passed only at Time1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

19

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 3a Time 2

OS BM+

OS In All

1

0025 [00060023]

0

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

20

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 3b Time 2

OS BM+

OS In All

1

0025 [00060023]

0

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

21

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 3c Time 2

OS In All

0025 [002001]

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

1 1

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

1

1

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ +171 00 +179 00

All +176 00 +179 00

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ +66 +57

All +77 +73 22

How Does the Power Gain Change under Different Assumptions

Endpoint Modified

Scenario Details

OS 1 OS larger difference in treatment effect by BM status

HR (for non-cure patients and after delay) increased by 10 in the BM- and decreased by 10 in the BM+ population (while keeping HR for all randomized the same) (a)

2 OS shorter delay 10 shorter delay in all 4 OS endpoints (a)

OS PFS 3 OS PFS Exp2 better OS and PFS treatment effect (delay cure rate HR in non-cure patients) improved by 10 in Experimental2 arm

4 OS PFS Exp2 worse OS and PFS treatment effect (delay cure rate HR in non-cure patients) worsened by 10 in Experimental2 arm

5 OS PFS longer delay OS delay increased with 50 PFS delay with 300

PFS 6 PFS no delay No delay in any of the 4 PFS endpoints

Time of analysis is generated based on of events in all 3 arms 23

0

01

02

03

04

05

06

07

0

01

02

03

04

05

06

07

PROTOCOL 1 OS Larger BM effect 2 OS shorter delay 3 Exp2 better 4 Exp2 worse 5 OS PFS longer delay 6 PFS no delay

Benefit of Additional Edges in Statistical Power of OS Time1

OS1+

OS1All

OS2+

OS2All

OS1

OS1AC

OS2

OS2AC

05

055

06

065

07

075

08

085

09

095

1

05

055

06

065

07

075

08

085

09

095

1

PROTOCOL 1 OS Larger BM effect 2 OS shorter delay 3 Exp2 better 4 Exp2 worse 5 OS PFS longer delay 6 PFS no delay

Benefit of Additional Edges in Statistical Power of OS Time2

OS1+

OS1All

OS2+

OS2All

OS1

OS1AC

OS2

OS2AC24

Experimental1 vs Control Experimental2 vs Control

Alternative1 Secure All Endpoints within Comparison

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

ε

ε

1 - ε 1 - ε

25

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dashed edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative1 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

ε

ε

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

ε

ε

1-ε 1-ε

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 170 + 14 + 178 + 17

All + 176 + 16 + 178 + 20

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 64 + 56

All + 75 + 71 26

Experimental1 vs Control Experimental2 vs Control

Alternative2 Giving BM+ Maximum Chance

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

001

0

1

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

001

0

1

ε2 ε2

1 - ε 1 - ε

1

1 1

ε2 ε2

1

27

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative2 Change in Power vs Base Case (under Protocol Assumptions)

1 - ε 1 - ε

1

1 1

ε2

0015

0

PFS BM+

PFS In All

OS BM

+

OS In All

0002

0

1

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

0002

0

1

ε2 ε2 ε2

1

Time 1 Time 2

OS BM+

OS In All

0002

0

OS BM+

OS In All

0002

0

05 05

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 135 + 08 + 138 + 08

All - 197 - 80 - 19 - 79

+20

+15

+10

+5

0

-5

-10

-15

-20

28

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 37 + 32

All - 67 - 68

4 Concluding Remarks

29

Concluding Remarks MTP

bull Limitation of the results on statistical power bull OS and PFS were generated independently

bull Identifying the testing strategy for an actual clinical trial always requires the input of several functions and the power results represent only one aspect of these considerations

bull Using graphical approach may result in overpowered endpoints However this still may be desirable reasons bull Protocol assumptions are complicated =gt just one detail off can result in power drop

(even in the presence of meaningful treatment effect)

bull At an interim analysis probability of crossing the efficacy boundary increases =gt gives larger chance for earlier read-out

30

Thank you for your attention

31

References

bull Bretz F Maurer W Brannath W Posch M A graphical approach to sequentially rejective multiple test procedures Statistics in Medicine 2009 28 586-604

bull Burman C-F Sonesson C and Guilbaud O (2009) A recycling framework for the construction of Bonferroni-based multiple tests Statistics in Medicine 28 739ndash761

bull Bretz F Posch M Glimm E Klinglmueller F Maurer W Rohmeyer K Graphical approaches for multiple comparison procedures using weighted Bonferroni Simes or parametric tests Biometrical Journal 2011 53(6) 894-913

bull Ye Y Li A Liu L and Yao B (2013) A group sequential Holm procedure with multiple primary endpoints Statistics in Medicine 32 1112-1124

bull Maurer W and Bretz F (2013a) ldquoGraphical Multiple Test Procedures With Memoryrdquo Statistics in Medicine DOI 101002sim5711

bull Willi MAURER and Frank BRETZ Multiple Testing in Group Sequential Trials Using Graphical Approaches American Statistical Association Statistics in Biopharmaceutical Research November 2013 Vol5 No4

bull Xi D Tamhane AC Allocating recycled significance levels in group sequential procedures for multiple endpoints Biom J 2015 Jan57(1)90-107

bull Tamhane A C Mehta C R and Liu L (2010) Testing a primary and a secondary endpoint in a group sequential design Biometrics 66 1174ndash1184

32

Back-up

33

Basic Parameters of the Study Design Table 1 Basic Parameters of the Study Design

Overall Survival Progression-Free Survival

BM+ BM- All Rand BM+ BM- All Rand

randomized Total (per comparison)

asymp 480 (asymp 320) (c)

asymp 720 (asymp 480) (c)

1200 (800) (fixed)

As OS As OS As OS

Significance level (a) 001 0015

Delayed Effect (b) 4m 5m 1m 1m

Cure Rate in Experimental (b) 15 10 0 0

HR after Delay Effect (b) (d) 06 075 056 075

Overall HR (b) Time1 Time2 070 061 084 074 077 069 063 084 072

Median controlexperimental 8m 12m 9m 12m 9m 12m 4m 64m 4m 47m 4m 55m

events Total Time1 Time2 asymp 285 385 (fixed)

asymp 693 asymp 963 380 (fixed) asymp 989

events per comparison Time1 Time2

asymp 198 asymp 266 asymp 478 asymp 661 asymp 261 asymp 672

(a) Initially allocated endpoint-specific alpha used for sample size calculation (b) For each comparison with the two experimental arms (c) Based on an expected BM+ proportion of 40 (Total number randomized is fixed) (d) As assumed for non-cure patients

34

Literature Overview

Graphical approach Bonferroni-based (Bretz et al 2009)

Non-parametric extensions (Bretz et al 2011)

GSP extension special case (Ye et al 2013)

GSP general allows for selection of recycling stages (Xi Tamhane 2015)

GSP extension general (Maurer Bretz 2013)

Graphical approach Bonferroni-based (Burman 2009)

GSP = group-sequential procedure Dark blue boxes referenced by FDA guidance

Graphs with memory (Maurer Bretz 2013a)

-649

-648 -577 -459

35

Multiplicity History of Regulatory Guidances bull E9 Statistical Principles for Clinical Trials (1998 Section 56)

bull Pre-specification of MTP bull EMA Points to Consider on Multiplicity Issues in Clinical Trials (2002)

bull Strong control of FWER bull EMA Concept paper on the need for a guideline on multiplicity issues in

clinical trials (Draft 2012) =gt Nov2012 Workshop bull EMA Guideline on multiplicity issues in clinical trials bull FDA Guidance document on multiplicity issues in clinical trials bull PMDA no multiplicity guidance (ldquoMTPs do not necessarily match with

labellingrdquo)

36

DRAFT Released for public

consultation in 2017

Related document within BMS Reporting Observed Significance Levels (p-values) in Clinical Study Reports Regulatory Documents and Publications 2014 (Stephane Munier)

Multiplicity Latest Regulatory Guidancersquos (2017) EMA Guideline on multiplicity issues

in clinical trials FDA Guidance document on

multiplicity issues in clinical trials

Principles bull Family-wise strong control bull Pre-specification of multiplicity adjustment

Secondary Endpoints bull Secondary endpoints expressing supportive evidence needs no MTP

bull Secondary endpoints that may become base for additional claims need MTP

(Section 6)

Distinction between different types of secondary endpoints is not clear all require MTP (Section IIIA)

Scope Similar (not the same)

Subgroups Section 7 (also own guidance) Excluded (another guidance)

Estimation and descriptive statistics Section 10 Not included

Graphical approach Not mentioned Explicitly encouraged

37

Experimental1 vs Control Experimental2 vs Control

Alternative4 Importance of Experimental1 PFS

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

ε

1

1 - ε

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain 38

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative4 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

ε

1

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

ε

1

1-ε

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 166 + 13 + 131 0

AC + 162 + 19 + 116 0

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 56 + 30

AC + 69 + 40 39

Experimental1 vs Control Experimental2 vs Control

Alternative3 Importance of both OS BM+

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

05 05

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

05

05

40

Timepoint 1 Timepoint 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative3 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

05

0002

0

05

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

0002

0

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

0009

0

OS BM

+

OS In All

0002

0

05

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 104 00 + 105 00

All + 87 - 14 + 84 - 16

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 46 NA + 40 NA

All + 49 NA + 42 NA

05 05 05 05 05

41

05 05 05 05

B) Final OS Look Adjusting Significance Levels bull The time of the final OS analysis will be determined by the number of

BM+ OS events in the 3 arms or in the control arm

bull As a result the number of events for each OS comparison is a random variable by design

bull Consequence bull Time 1 Assume none of the OS endpoints was rejected bull Time 2 If the of events for a given comparison is more (less) than expected

=gt the information fraction and nominal significance level used at Time 1 was larger (smaller) than what we should have used

For the final look critical values for OS will be re-calculated using the number of exactly observed events at the final analysis

The effect of adjustment is limited (always lt 0004 and in general lt 0001)

42

Illustration Effect of Final Alpha Adjustment

43 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Legend Observed IF at Time 1 lt Q1 Q1 - Q2 Q2 - Q3 Q3 lt

Illustration Effect of Final Alpha Adjustment

44 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Page 18: Multiplicity Correction in a Group-Sequential Oncology ... · PDF file• Largest difference: for Exp1 in scenario#1 (OS BM+: 11%, OS All: 13%, PFS BM+: 5%) 25 30 35 40 45 50 55 60

18

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 2b Time 1

OS BM+

OS In All

1

0025 [00060023]

0

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

1

Along the green edges alpha can be passed only at Time1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

19

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 3a Time 2

OS BM+

OS In All

1

0025 [00060023]

0

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

20

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 3b Time 2

OS BM+

OS In All

1

0025 [00060023]

0

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

21

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 3c Time 2

OS In All

0025 [002001]

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

1 1

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

1

1

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ +171 00 +179 00

All +176 00 +179 00

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ +66 +57

All +77 +73 22

How Does the Power Gain Change under Different Assumptions

Endpoint Modified

Scenario Details

OS 1 OS larger difference in treatment effect by BM status

HR (for non-cure patients and after delay) increased by 10 in the BM- and decreased by 10 in the BM+ population (while keeping HR for all randomized the same) (a)

2 OS shorter delay 10 shorter delay in all 4 OS endpoints (a)

OS PFS 3 OS PFS Exp2 better OS and PFS treatment effect (delay cure rate HR in non-cure patients) improved by 10 in Experimental2 arm

4 OS PFS Exp2 worse OS and PFS treatment effect (delay cure rate HR in non-cure patients) worsened by 10 in Experimental2 arm

5 OS PFS longer delay OS delay increased with 50 PFS delay with 300

PFS 6 PFS no delay No delay in any of the 4 PFS endpoints

Time of analysis is generated based on of events in all 3 arms 23

0

01

02

03

04

05

06

07

0

01

02

03

04

05

06

07

PROTOCOL 1 OS Larger BM effect 2 OS shorter delay 3 Exp2 better 4 Exp2 worse 5 OS PFS longer delay 6 PFS no delay

Benefit of Additional Edges in Statistical Power of OS Time1

OS1+

OS1All

OS2+

OS2All

OS1

OS1AC

OS2

OS2AC

05

055

06

065

07

075

08

085

09

095

1

05

055

06

065

07

075

08

085

09

095

1

PROTOCOL 1 OS Larger BM effect 2 OS shorter delay 3 Exp2 better 4 Exp2 worse 5 OS PFS longer delay 6 PFS no delay

Benefit of Additional Edges in Statistical Power of OS Time2

OS1+

OS1All

OS2+

OS2All

OS1

OS1AC

OS2

OS2AC24

Experimental1 vs Control Experimental2 vs Control

Alternative1 Secure All Endpoints within Comparison

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

ε

ε

1 - ε 1 - ε

25

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dashed edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative1 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

ε

ε

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

ε

ε

1-ε 1-ε

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 170 + 14 + 178 + 17

All + 176 + 16 + 178 + 20

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 64 + 56

All + 75 + 71 26

Experimental1 vs Control Experimental2 vs Control

Alternative2 Giving BM+ Maximum Chance

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

001

0

1

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

001

0

1

ε2 ε2

1 - ε 1 - ε

1

1 1

ε2 ε2

1

27

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative2 Change in Power vs Base Case (under Protocol Assumptions)

1 - ε 1 - ε

1

1 1

ε2

0015

0

PFS BM+

PFS In All

OS BM

+

OS In All

0002

0

1

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

0002

0

1

ε2 ε2 ε2

1

Time 1 Time 2

OS BM+

OS In All

0002

0

OS BM+

OS In All

0002

0

05 05

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 135 + 08 + 138 + 08

All - 197 - 80 - 19 - 79

+20

+15

+10

+5

0

-5

-10

-15

-20

28

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 37 + 32

All - 67 - 68

4 Concluding Remarks

29

Concluding Remarks MTP

bull Limitation of the results on statistical power bull OS and PFS were generated independently

bull Identifying the testing strategy for an actual clinical trial always requires the input of several functions and the power results represent only one aspect of these considerations

bull Using graphical approach may result in overpowered endpoints However this still may be desirable reasons bull Protocol assumptions are complicated =gt just one detail off can result in power drop

(even in the presence of meaningful treatment effect)

bull At an interim analysis probability of crossing the efficacy boundary increases =gt gives larger chance for earlier read-out

30

Thank you for your attention

31

References

bull Bretz F Maurer W Brannath W Posch M A graphical approach to sequentially rejective multiple test procedures Statistics in Medicine 2009 28 586-604

bull Burman C-F Sonesson C and Guilbaud O (2009) A recycling framework for the construction of Bonferroni-based multiple tests Statistics in Medicine 28 739ndash761

bull Bretz F Posch M Glimm E Klinglmueller F Maurer W Rohmeyer K Graphical approaches for multiple comparison procedures using weighted Bonferroni Simes or parametric tests Biometrical Journal 2011 53(6) 894-913

bull Ye Y Li A Liu L and Yao B (2013) A group sequential Holm procedure with multiple primary endpoints Statistics in Medicine 32 1112-1124

bull Maurer W and Bretz F (2013a) ldquoGraphical Multiple Test Procedures With Memoryrdquo Statistics in Medicine DOI 101002sim5711

bull Willi MAURER and Frank BRETZ Multiple Testing in Group Sequential Trials Using Graphical Approaches American Statistical Association Statistics in Biopharmaceutical Research November 2013 Vol5 No4

bull Xi D Tamhane AC Allocating recycled significance levels in group sequential procedures for multiple endpoints Biom J 2015 Jan57(1)90-107

bull Tamhane A C Mehta C R and Liu L (2010) Testing a primary and a secondary endpoint in a group sequential design Biometrics 66 1174ndash1184

32

Back-up

33

Basic Parameters of the Study Design Table 1 Basic Parameters of the Study Design

Overall Survival Progression-Free Survival

BM+ BM- All Rand BM+ BM- All Rand

randomized Total (per comparison)

asymp 480 (asymp 320) (c)

asymp 720 (asymp 480) (c)

1200 (800) (fixed)

As OS As OS As OS

Significance level (a) 001 0015

Delayed Effect (b) 4m 5m 1m 1m

Cure Rate in Experimental (b) 15 10 0 0

HR after Delay Effect (b) (d) 06 075 056 075

Overall HR (b) Time1 Time2 070 061 084 074 077 069 063 084 072

Median controlexperimental 8m 12m 9m 12m 9m 12m 4m 64m 4m 47m 4m 55m

events Total Time1 Time2 asymp 285 385 (fixed)

asymp 693 asymp 963 380 (fixed) asymp 989

events per comparison Time1 Time2

asymp 198 asymp 266 asymp 478 asymp 661 asymp 261 asymp 672

(a) Initially allocated endpoint-specific alpha used for sample size calculation (b) For each comparison with the two experimental arms (c) Based on an expected BM+ proportion of 40 (Total number randomized is fixed) (d) As assumed for non-cure patients

34

Literature Overview

Graphical approach Bonferroni-based (Bretz et al 2009)

Non-parametric extensions (Bretz et al 2011)

GSP extension special case (Ye et al 2013)

GSP general allows for selection of recycling stages (Xi Tamhane 2015)

GSP extension general (Maurer Bretz 2013)

Graphical approach Bonferroni-based (Burman 2009)

GSP = group-sequential procedure Dark blue boxes referenced by FDA guidance

Graphs with memory (Maurer Bretz 2013a)

-649

-648 -577 -459

35

Multiplicity History of Regulatory Guidances bull E9 Statistical Principles for Clinical Trials (1998 Section 56)

bull Pre-specification of MTP bull EMA Points to Consider on Multiplicity Issues in Clinical Trials (2002)

bull Strong control of FWER bull EMA Concept paper on the need for a guideline on multiplicity issues in

clinical trials (Draft 2012) =gt Nov2012 Workshop bull EMA Guideline on multiplicity issues in clinical trials bull FDA Guidance document on multiplicity issues in clinical trials bull PMDA no multiplicity guidance (ldquoMTPs do not necessarily match with

labellingrdquo)

36

DRAFT Released for public

consultation in 2017

Related document within BMS Reporting Observed Significance Levels (p-values) in Clinical Study Reports Regulatory Documents and Publications 2014 (Stephane Munier)

Multiplicity Latest Regulatory Guidancersquos (2017) EMA Guideline on multiplicity issues

in clinical trials FDA Guidance document on

multiplicity issues in clinical trials

Principles bull Family-wise strong control bull Pre-specification of multiplicity adjustment

Secondary Endpoints bull Secondary endpoints expressing supportive evidence needs no MTP

bull Secondary endpoints that may become base for additional claims need MTP

(Section 6)

Distinction between different types of secondary endpoints is not clear all require MTP (Section IIIA)

Scope Similar (not the same)

Subgroups Section 7 (also own guidance) Excluded (another guidance)

Estimation and descriptive statistics Section 10 Not included

Graphical approach Not mentioned Explicitly encouraged

37

Experimental1 vs Control Experimental2 vs Control

Alternative4 Importance of Experimental1 PFS

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

ε

1

1 - ε

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain 38

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative4 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

ε

1

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

ε

1

1-ε

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 166 + 13 + 131 0

AC + 162 + 19 + 116 0

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 56 + 30

AC + 69 + 40 39

Experimental1 vs Control Experimental2 vs Control

Alternative3 Importance of both OS BM+

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

05 05

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

05

05

40

Timepoint 1 Timepoint 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative3 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

05

0002

0

05

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

0002

0

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

0009

0

OS BM

+

OS In All

0002

0

05

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 104 00 + 105 00

All + 87 - 14 + 84 - 16

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 46 NA + 40 NA

All + 49 NA + 42 NA

05 05 05 05 05

41

05 05 05 05

B) Final OS Look Adjusting Significance Levels bull The time of the final OS analysis will be determined by the number of

BM+ OS events in the 3 arms or in the control arm

bull As a result the number of events for each OS comparison is a random variable by design

bull Consequence bull Time 1 Assume none of the OS endpoints was rejected bull Time 2 If the of events for a given comparison is more (less) than expected

=gt the information fraction and nominal significance level used at Time 1 was larger (smaller) than what we should have used

For the final look critical values for OS will be re-calculated using the number of exactly observed events at the final analysis

The effect of adjustment is limited (always lt 0004 and in general lt 0001)

42

Illustration Effect of Final Alpha Adjustment

43 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Legend Observed IF at Time 1 lt Q1 Q1 - Q2 Q2 - Q3 Q3 lt

Illustration Effect of Final Alpha Adjustment

44 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Page 19: Multiplicity Correction in a Group-Sequential Oncology ... · PDF file• Largest difference: for Exp1 in scenario#1 (OS BM+: 11%, OS All: 13%, PFS BM+: 5%) 25 30 35 40 45 50 55 60

19

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 3a Time 2

OS BM+

OS In All

1

0025 [00060023]

0

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

20

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 3b Time 2

OS BM+

OS In All

1

0025 [00060023]

0

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

21

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 3c Time 2

OS In All

0025 [002001]

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

1 1

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

1

1

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ +171 00 +179 00

All +176 00 +179 00

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ +66 +57

All +77 +73 22

How Does the Power Gain Change under Different Assumptions

Endpoint Modified

Scenario Details

OS 1 OS larger difference in treatment effect by BM status

HR (for non-cure patients and after delay) increased by 10 in the BM- and decreased by 10 in the BM+ population (while keeping HR for all randomized the same) (a)

2 OS shorter delay 10 shorter delay in all 4 OS endpoints (a)

OS PFS 3 OS PFS Exp2 better OS and PFS treatment effect (delay cure rate HR in non-cure patients) improved by 10 in Experimental2 arm

4 OS PFS Exp2 worse OS and PFS treatment effect (delay cure rate HR in non-cure patients) worsened by 10 in Experimental2 arm

5 OS PFS longer delay OS delay increased with 50 PFS delay with 300

PFS 6 PFS no delay No delay in any of the 4 PFS endpoints

Time of analysis is generated based on of events in all 3 arms 23

0

01

02

03

04

05

06

07

0

01

02

03

04

05

06

07

PROTOCOL 1 OS Larger BM effect 2 OS shorter delay 3 Exp2 better 4 Exp2 worse 5 OS PFS longer delay 6 PFS no delay

Benefit of Additional Edges in Statistical Power of OS Time1

OS1+

OS1All

OS2+

OS2All

OS1

OS1AC

OS2

OS2AC

05

055

06

065

07

075

08

085

09

095

1

05

055

06

065

07

075

08

085

09

095

1

PROTOCOL 1 OS Larger BM effect 2 OS shorter delay 3 Exp2 better 4 Exp2 worse 5 OS PFS longer delay 6 PFS no delay

Benefit of Additional Edges in Statistical Power of OS Time2

OS1+

OS1All

OS2+

OS2All

OS1

OS1AC

OS2

OS2AC24

Experimental1 vs Control Experimental2 vs Control

Alternative1 Secure All Endpoints within Comparison

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

ε

ε

1 - ε 1 - ε

25

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dashed edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative1 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

ε

ε

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

ε

ε

1-ε 1-ε

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 170 + 14 + 178 + 17

All + 176 + 16 + 178 + 20

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 64 + 56

All + 75 + 71 26

Experimental1 vs Control Experimental2 vs Control

Alternative2 Giving BM+ Maximum Chance

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

001

0

1

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

001

0

1

ε2 ε2

1 - ε 1 - ε

1

1 1

ε2 ε2

1

27

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative2 Change in Power vs Base Case (under Protocol Assumptions)

1 - ε 1 - ε

1

1 1

ε2

0015

0

PFS BM+

PFS In All

OS BM

+

OS In All

0002

0

1

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

0002

0

1

ε2 ε2 ε2

1

Time 1 Time 2

OS BM+

OS In All

0002

0

OS BM+

OS In All

0002

0

05 05

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 135 + 08 + 138 + 08

All - 197 - 80 - 19 - 79

+20

+15

+10

+5

0

-5

-10

-15

-20

28

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 37 + 32

All - 67 - 68

4 Concluding Remarks

29

Concluding Remarks MTP

bull Limitation of the results on statistical power bull OS and PFS were generated independently

bull Identifying the testing strategy for an actual clinical trial always requires the input of several functions and the power results represent only one aspect of these considerations

bull Using graphical approach may result in overpowered endpoints However this still may be desirable reasons bull Protocol assumptions are complicated =gt just one detail off can result in power drop

(even in the presence of meaningful treatment effect)

bull At an interim analysis probability of crossing the efficacy boundary increases =gt gives larger chance for earlier read-out

30

Thank you for your attention

31

References

bull Bretz F Maurer W Brannath W Posch M A graphical approach to sequentially rejective multiple test procedures Statistics in Medicine 2009 28 586-604

bull Burman C-F Sonesson C and Guilbaud O (2009) A recycling framework for the construction of Bonferroni-based multiple tests Statistics in Medicine 28 739ndash761

bull Bretz F Posch M Glimm E Klinglmueller F Maurer W Rohmeyer K Graphical approaches for multiple comparison procedures using weighted Bonferroni Simes or parametric tests Biometrical Journal 2011 53(6) 894-913

bull Ye Y Li A Liu L and Yao B (2013) A group sequential Holm procedure with multiple primary endpoints Statistics in Medicine 32 1112-1124

bull Maurer W and Bretz F (2013a) ldquoGraphical Multiple Test Procedures With Memoryrdquo Statistics in Medicine DOI 101002sim5711

bull Willi MAURER and Frank BRETZ Multiple Testing in Group Sequential Trials Using Graphical Approaches American Statistical Association Statistics in Biopharmaceutical Research November 2013 Vol5 No4

bull Xi D Tamhane AC Allocating recycled significance levels in group sequential procedures for multiple endpoints Biom J 2015 Jan57(1)90-107

bull Tamhane A C Mehta C R and Liu L (2010) Testing a primary and a secondary endpoint in a group sequential design Biometrics 66 1174ndash1184

32

Back-up

33

Basic Parameters of the Study Design Table 1 Basic Parameters of the Study Design

Overall Survival Progression-Free Survival

BM+ BM- All Rand BM+ BM- All Rand

randomized Total (per comparison)

asymp 480 (asymp 320) (c)

asymp 720 (asymp 480) (c)

1200 (800) (fixed)

As OS As OS As OS

Significance level (a) 001 0015

Delayed Effect (b) 4m 5m 1m 1m

Cure Rate in Experimental (b) 15 10 0 0

HR after Delay Effect (b) (d) 06 075 056 075

Overall HR (b) Time1 Time2 070 061 084 074 077 069 063 084 072

Median controlexperimental 8m 12m 9m 12m 9m 12m 4m 64m 4m 47m 4m 55m

events Total Time1 Time2 asymp 285 385 (fixed)

asymp 693 asymp 963 380 (fixed) asymp 989

events per comparison Time1 Time2

asymp 198 asymp 266 asymp 478 asymp 661 asymp 261 asymp 672

(a) Initially allocated endpoint-specific alpha used for sample size calculation (b) For each comparison with the two experimental arms (c) Based on an expected BM+ proportion of 40 (Total number randomized is fixed) (d) As assumed for non-cure patients

34

Literature Overview

Graphical approach Bonferroni-based (Bretz et al 2009)

Non-parametric extensions (Bretz et al 2011)

GSP extension special case (Ye et al 2013)

GSP general allows for selection of recycling stages (Xi Tamhane 2015)

GSP extension general (Maurer Bretz 2013)

Graphical approach Bonferroni-based (Burman 2009)

GSP = group-sequential procedure Dark blue boxes referenced by FDA guidance

Graphs with memory (Maurer Bretz 2013a)

-649

-648 -577 -459

35

Multiplicity History of Regulatory Guidances bull E9 Statistical Principles for Clinical Trials (1998 Section 56)

bull Pre-specification of MTP bull EMA Points to Consider on Multiplicity Issues in Clinical Trials (2002)

bull Strong control of FWER bull EMA Concept paper on the need for a guideline on multiplicity issues in

clinical trials (Draft 2012) =gt Nov2012 Workshop bull EMA Guideline on multiplicity issues in clinical trials bull FDA Guidance document on multiplicity issues in clinical trials bull PMDA no multiplicity guidance (ldquoMTPs do not necessarily match with

labellingrdquo)

36

DRAFT Released for public

consultation in 2017

Related document within BMS Reporting Observed Significance Levels (p-values) in Clinical Study Reports Regulatory Documents and Publications 2014 (Stephane Munier)

Multiplicity Latest Regulatory Guidancersquos (2017) EMA Guideline on multiplicity issues

in clinical trials FDA Guidance document on

multiplicity issues in clinical trials

Principles bull Family-wise strong control bull Pre-specification of multiplicity adjustment

Secondary Endpoints bull Secondary endpoints expressing supportive evidence needs no MTP

bull Secondary endpoints that may become base for additional claims need MTP

(Section 6)

Distinction between different types of secondary endpoints is not clear all require MTP (Section IIIA)

Scope Similar (not the same)

Subgroups Section 7 (also own guidance) Excluded (another guidance)

Estimation and descriptive statistics Section 10 Not included

Graphical approach Not mentioned Explicitly encouraged

37

Experimental1 vs Control Experimental2 vs Control

Alternative4 Importance of Experimental1 PFS

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

ε

1

1 - ε

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain 38

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative4 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

ε

1

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

ε

1

1-ε

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 166 + 13 + 131 0

AC + 162 + 19 + 116 0

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 56 + 30

AC + 69 + 40 39

Experimental1 vs Control Experimental2 vs Control

Alternative3 Importance of both OS BM+

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

05 05

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

05

05

40

Timepoint 1 Timepoint 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative3 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

05

0002

0

05

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

0002

0

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

0009

0

OS BM

+

OS In All

0002

0

05

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 104 00 + 105 00

All + 87 - 14 + 84 - 16

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 46 NA + 40 NA

All + 49 NA + 42 NA

05 05 05 05 05

41

05 05 05 05

B) Final OS Look Adjusting Significance Levels bull The time of the final OS analysis will be determined by the number of

BM+ OS events in the 3 arms or in the control arm

bull As a result the number of events for each OS comparison is a random variable by design

bull Consequence bull Time 1 Assume none of the OS endpoints was rejected bull Time 2 If the of events for a given comparison is more (less) than expected

=gt the information fraction and nominal significance level used at Time 1 was larger (smaller) than what we should have used

For the final look critical values for OS will be re-calculated using the number of exactly observed events at the final analysis

The effect of adjustment is limited (always lt 0004 and in general lt 0001)

42

Illustration Effect of Final Alpha Adjustment

43 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Legend Observed IF at Time 1 lt Q1 Q1 - Q2 Q2 - Q3 Q3 lt

Illustration Effect of Final Alpha Adjustment

44 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Page 20: Multiplicity Correction in a Group-Sequential Oncology ... · PDF file• Largest difference: for Exp1 in scenario#1 (OS BM+: 11%, OS All: 13%, PFS BM+: 5%) 25 30 35 40 45 50 55 60

20

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 3b Time 2

OS BM+

OS In All

1

0025 [00060023]

0

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

21

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 3c Time 2

OS In All

0025 [002001]

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

1 1

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

1

1

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ +171 00 +179 00

All +176 00 +179 00

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ +66 +57

All +77 +73 22

How Does the Power Gain Change under Different Assumptions

Endpoint Modified

Scenario Details

OS 1 OS larger difference in treatment effect by BM status

HR (for non-cure patients and after delay) increased by 10 in the BM- and decreased by 10 in the BM+ population (while keeping HR for all randomized the same) (a)

2 OS shorter delay 10 shorter delay in all 4 OS endpoints (a)

OS PFS 3 OS PFS Exp2 better OS and PFS treatment effect (delay cure rate HR in non-cure patients) improved by 10 in Experimental2 arm

4 OS PFS Exp2 worse OS and PFS treatment effect (delay cure rate HR in non-cure patients) worsened by 10 in Experimental2 arm

5 OS PFS longer delay OS delay increased with 50 PFS delay with 300

PFS 6 PFS no delay No delay in any of the 4 PFS endpoints

Time of analysis is generated based on of events in all 3 arms 23

0

01

02

03

04

05

06

07

0

01

02

03

04

05

06

07

PROTOCOL 1 OS Larger BM effect 2 OS shorter delay 3 Exp2 better 4 Exp2 worse 5 OS PFS longer delay 6 PFS no delay

Benefit of Additional Edges in Statistical Power of OS Time1

OS1+

OS1All

OS2+

OS2All

OS1

OS1AC

OS2

OS2AC

05

055

06

065

07

075

08

085

09

095

1

05

055

06

065

07

075

08

085

09

095

1

PROTOCOL 1 OS Larger BM effect 2 OS shorter delay 3 Exp2 better 4 Exp2 worse 5 OS PFS longer delay 6 PFS no delay

Benefit of Additional Edges in Statistical Power of OS Time2

OS1+

OS1All

OS2+

OS2All

OS1

OS1AC

OS2

OS2AC24

Experimental1 vs Control Experimental2 vs Control

Alternative1 Secure All Endpoints within Comparison

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

ε

ε

1 - ε 1 - ε

25

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dashed edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative1 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

ε

ε

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

ε

ε

1-ε 1-ε

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 170 + 14 + 178 + 17

All + 176 + 16 + 178 + 20

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 64 + 56

All + 75 + 71 26

Experimental1 vs Control Experimental2 vs Control

Alternative2 Giving BM+ Maximum Chance

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

001

0

1

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

001

0

1

ε2 ε2

1 - ε 1 - ε

1

1 1

ε2 ε2

1

27

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative2 Change in Power vs Base Case (under Protocol Assumptions)

1 - ε 1 - ε

1

1 1

ε2

0015

0

PFS BM+

PFS In All

OS BM

+

OS In All

0002

0

1

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

0002

0

1

ε2 ε2 ε2

1

Time 1 Time 2

OS BM+

OS In All

0002

0

OS BM+

OS In All

0002

0

05 05

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 135 + 08 + 138 + 08

All - 197 - 80 - 19 - 79

+20

+15

+10

+5

0

-5

-10

-15

-20

28

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 37 + 32

All - 67 - 68

4 Concluding Remarks

29

Concluding Remarks MTP

bull Limitation of the results on statistical power bull OS and PFS were generated independently

bull Identifying the testing strategy for an actual clinical trial always requires the input of several functions and the power results represent only one aspect of these considerations

bull Using graphical approach may result in overpowered endpoints However this still may be desirable reasons bull Protocol assumptions are complicated =gt just one detail off can result in power drop

(even in the presence of meaningful treatment effect)

bull At an interim analysis probability of crossing the efficacy boundary increases =gt gives larger chance for earlier read-out

30

Thank you for your attention

31

References

bull Bretz F Maurer W Brannath W Posch M A graphical approach to sequentially rejective multiple test procedures Statistics in Medicine 2009 28 586-604

bull Burman C-F Sonesson C and Guilbaud O (2009) A recycling framework for the construction of Bonferroni-based multiple tests Statistics in Medicine 28 739ndash761

bull Bretz F Posch M Glimm E Klinglmueller F Maurer W Rohmeyer K Graphical approaches for multiple comparison procedures using weighted Bonferroni Simes or parametric tests Biometrical Journal 2011 53(6) 894-913

bull Ye Y Li A Liu L and Yao B (2013) A group sequential Holm procedure with multiple primary endpoints Statistics in Medicine 32 1112-1124

bull Maurer W and Bretz F (2013a) ldquoGraphical Multiple Test Procedures With Memoryrdquo Statistics in Medicine DOI 101002sim5711

bull Willi MAURER and Frank BRETZ Multiple Testing in Group Sequential Trials Using Graphical Approaches American Statistical Association Statistics in Biopharmaceutical Research November 2013 Vol5 No4

bull Xi D Tamhane AC Allocating recycled significance levels in group sequential procedures for multiple endpoints Biom J 2015 Jan57(1)90-107

bull Tamhane A C Mehta C R and Liu L (2010) Testing a primary and a secondary endpoint in a group sequential design Biometrics 66 1174ndash1184

32

Back-up

33

Basic Parameters of the Study Design Table 1 Basic Parameters of the Study Design

Overall Survival Progression-Free Survival

BM+ BM- All Rand BM+ BM- All Rand

randomized Total (per comparison)

asymp 480 (asymp 320) (c)

asymp 720 (asymp 480) (c)

1200 (800) (fixed)

As OS As OS As OS

Significance level (a) 001 0015

Delayed Effect (b) 4m 5m 1m 1m

Cure Rate in Experimental (b) 15 10 0 0

HR after Delay Effect (b) (d) 06 075 056 075

Overall HR (b) Time1 Time2 070 061 084 074 077 069 063 084 072

Median controlexperimental 8m 12m 9m 12m 9m 12m 4m 64m 4m 47m 4m 55m

events Total Time1 Time2 asymp 285 385 (fixed)

asymp 693 asymp 963 380 (fixed) asymp 989

events per comparison Time1 Time2

asymp 198 asymp 266 asymp 478 asymp 661 asymp 261 asymp 672

(a) Initially allocated endpoint-specific alpha used for sample size calculation (b) For each comparison with the two experimental arms (c) Based on an expected BM+ proportion of 40 (Total number randomized is fixed) (d) As assumed for non-cure patients

34

Literature Overview

Graphical approach Bonferroni-based (Bretz et al 2009)

Non-parametric extensions (Bretz et al 2011)

GSP extension special case (Ye et al 2013)

GSP general allows for selection of recycling stages (Xi Tamhane 2015)

GSP extension general (Maurer Bretz 2013)

Graphical approach Bonferroni-based (Burman 2009)

GSP = group-sequential procedure Dark blue boxes referenced by FDA guidance

Graphs with memory (Maurer Bretz 2013a)

-649

-648 -577 -459

35

Multiplicity History of Regulatory Guidances bull E9 Statistical Principles for Clinical Trials (1998 Section 56)

bull Pre-specification of MTP bull EMA Points to Consider on Multiplicity Issues in Clinical Trials (2002)

bull Strong control of FWER bull EMA Concept paper on the need for a guideline on multiplicity issues in

clinical trials (Draft 2012) =gt Nov2012 Workshop bull EMA Guideline on multiplicity issues in clinical trials bull FDA Guidance document on multiplicity issues in clinical trials bull PMDA no multiplicity guidance (ldquoMTPs do not necessarily match with

labellingrdquo)

36

DRAFT Released for public

consultation in 2017

Related document within BMS Reporting Observed Significance Levels (p-values) in Clinical Study Reports Regulatory Documents and Publications 2014 (Stephane Munier)

Multiplicity Latest Regulatory Guidancersquos (2017) EMA Guideline on multiplicity issues

in clinical trials FDA Guidance document on

multiplicity issues in clinical trials

Principles bull Family-wise strong control bull Pre-specification of multiplicity adjustment

Secondary Endpoints bull Secondary endpoints expressing supportive evidence needs no MTP

bull Secondary endpoints that may become base for additional claims need MTP

(Section 6)

Distinction between different types of secondary endpoints is not clear all require MTP (Section IIIA)

Scope Similar (not the same)

Subgroups Section 7 (also own guidance) Excluded (another guidance)

Estimation and descriptive statistics Section 10 Not included

Graphical approach Not mentioned Explicitly encouraged

37

Experimental1 vs Control Experimental2 vs Control

Alternative4 Importance of Experimental1 PFS

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

ε

1

1 - ε

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain 38

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative4 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

ε

1

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

ε

1

1-ε

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 166 + 13 + 131 0

AC + 162 + 19 + 116 0

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 56 + 30

AC + 69 + 40 39

Experimental1 vs Control Experimental2 vs Control

Alternative3 Importance of both OS BM+

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

05 05

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

05

05

40

Timepoint 1 Timepoint 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative3 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

05

0002

0

05

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

0002

0

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

0009

0

OS BM

+

OS In All

0002

0

05

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 104 00 + 105 00

All + 87 - 14 + 84 - 16

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 46 NA + 40 NA

All + 49 NA + 42 NA

05 05 05 05 05

41

05 05 05 05

B) Final OS Look Adjusting Significance Levels bull The time of the final OS analysis will be determined by the number of

BM+ OS events in the 3 arms or in the control arm

bull As a result the number of events for each OS comparison is a random variable by design

bull Consequence bull Time 1 Assume none of the OS endpoints was rejected bull Time 2 If the of events for a given comparison is more (less) than expected

=gt the information fraction and nominal significance level used at Time 1 was larger (smaller) than what we should have used

For the final look critical values for OS will be re-calculated using the number of exactly observed events at the final analysis

The effect of adjustment is limited (always lt 0004 and in general lt 0001)

42

Illustration Effect of Final Alpha Adjustment

43 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Legend Observed IF at Time 1 lt Q1 Q1 - Q2 Q2 - Q3 Q3 lt

Illustration Effect of Final Alpha Adjustment

44 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Page 21: Multiplicity Correction in a Group-Sequential Oncology ... · PDF file• Largest difference: for Exp1 in scenario#1 (OS BM+: 11%, OS All: 13%, PFS BM+: 5%) 25 30 35 40 45 50 55 60

21

Experimental1 vs Control Experimental2 vs Control

Illustration of Sequentially Rejective Procedure Step 3c Time 2

OS In All

0025 [002001]

PFS BM+

PFS In All

OS BM+

OS In All

1

001 [00020009]

0

1

1

For OS information fraction assumed BM+ 70 (OrsquoBrien-Fleming) AC 75 (Pocock)

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

1 1

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

1

1

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ +171 00 +179 00

All +176 00 +179 00

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ +66 +57

All +77 +73 22

How Does the Power Gain Change under Different Assumptions

Endpoint Modified

Scenario Details

OS 1 OS larger difference in treatment effect by BM status

HR (for non-cure patients and after delay) increased by 10 in the BM- and decreased by 10 in the BM+ population (while keeping HR for all randomized the same) (a)

2 OS shorter delay 10 shorter delay in all 4 OS endpoints (a)

OS PFS 3 OS PFS Exp2 better OS and PFS treatment effect (delay cure rate HR in non-cure patients) improved by 10 in Experimental2 arm

4 OS PFS Exp2 worse OS and PFS treatment effect (delay cure rate HR in non-cure patients) worsened by 10 in Experimental2 arm

5 OS PFS longer delay OS delay increased with 50 PFS delay with 300

PFS 6 PFS no delay No delay in any of the 4 PFS endpoints

Time of analysis is generated based on of events in all 3 arms 23

0

01

02

03

04

05

06

07

0

01

02

03

04

05

06

07

PROTOCOL 1 OS Larger BM effect 2 OS shorter delay 3 Exp2 better 4 Exp2 worse 5 OS PFS longer delay 6 PFS no delay

Benefit of Additional Edges in Statistical Power of OS Time1

OS1+

OS1All

OS2+

OS2All

OS1

OS1AC

OS2

OS2AC

05

055

06

065

07

075

08

085

09

095

1

05

055

06

065

07

075

08

085

09

095

1

PROTOCOL 1 OS Larger BM effect 2 OS shorter delay 3 Exp2 better 4 Exp2 worse 5 OS PFS longer delay 6 PFS no delay

Benefit of Additional Edges in Statistical Power of OS Time2

OS1+

OS1All

OS2+

OS2All

OS1

OS1AC

OS2

OS2AC24

Experimental1 vs Control Experimental2 vs Control

Alternative1 Secure All Endpoints within Comparison

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

ε

ε

1 - ε 1 - ε

25

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dashed edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative1 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

ε

ε

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

ε

ε

1-ε 1-ε

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 170 + 14 + 178 + 17

All + 176 + 16 + 178 + 20

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 64 + 56

All + 75 + 71 26

Experimental1 vs Control Experimental2 vs Control

Alternative2 Giving BM+ Maximum Chance

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

001

0

1

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

001

0

1

ε2 ε2

1 - ε 1 - ε

1

1 1

ε2 ε2

1

27

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative2 Change in Power vs Base Case (under Protocol Assumptions)

1 - ε 1 - ε

1

1 1

ε2

0015

0

PFS BM+

PFS In All

OS BM

+

OS In All

0002

0

1

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

0002

0

1

ε2 ε2 ε2

1

Time 1 Time 2

OS BM+

OS In All

0002

0

OS BM+

OS In All

0002

0

05 05

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 135 + 08 + 138 + 08

All - 197 - 80 - 19 - 79

+20

+15

+10

+5

0

-5

-10

-15

-20

28

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 37 + 32

All - 67 - 68

4 Concluding Remarks

29

Concluding Remarks MTP

bull Limitation of the results on statistical power bull OS and PFS were generated independently

bull Identifying the testing strategy for an actual clinical trial always requires the input of several functions and the power results represent only one aspect of these considerations

bull Using graphical approach may result in overpowered endpoints However this still may be desirable reasons bull Protocol assumptions are complicated =gt just one detail off can result in power drop

(even in the presence of meaningful treatment effect)

bull At an interim analysis probability of crossing the efficacy boundary increases =gt gives larger chance for earlier read-out

30

Thank you for your attention

31

References

bull Bretz F Maurer W Brannath W Posch M A graphical approach to sequentially rejective multiple test procedures Statistics in Medicine 2009 28 586-604

bull Burman C-F Sonesson C and Guilbaud O (2009) A recycling framework for the construction of Bonferroni-based multiple tests Statistics in Medicine 28 739ndash761

bull Bretz F Posch M Glimm E Klinglmueller F Maurer W Rohmeyer K Graphical approaches for multiple comparison procedures using weighted Bonferroni Simes or parametric tests Biometrical Journal 2011 53(6) 894-913

bull Ye Y Li A Liu L and Yao B (2013) A group sequential Holm procedure with multiple primary endpoints Statistics in Medicine 32 1112-1124

bull Maurer W and Bretz F (2013a) ldquoGraphical Multiple Test Procedures With Memoryrdquo Statistics in Medicine DOI 101002sim5711

bull Willi MAURER and Frank BRETZ Multiple Testing in Group Sequential Trials Using Graphical Approaches American Statistical Association Statistics in Biopharmaceutical Research November 2013 Vol5 No4

bull Xi D Tamhane AC Allocating recycled significance levels in group sequential procedures for multiple endpoints Biom J 2015 Jan57(1)90-107

bull Tamhane A C Mehta C R and Liu L (2010) Testing a primary and a secondary endpoint in a group sequential design Biometrics 66 1174ndash1184

32

Back-up

33

Basic Parameters of the Study Design Table 1 Basic Parameters of the Study Design

Overall Survival Progression-Free Survival

BM+ BM- All Rand BM+ BM- All Rand

randomized Total (per comparison)

asymp 480 (asymp 320) (c)

asymp 720 (asymp 480) (c)

1200 (800) (fixed)

As OS As OS As OS

Significance level (a) 001 0015

Delayed Effect (b) 4m 5m 1m 1m

Cure Rate in Experimental (b) 15 10 0 0

HR after Delay Effect (b) (d) 06 075 056 075

Overall HR (b) Time1 Time2 070 061 084 074 077 069 063 084 072

Median controlexperimental 8m 12m 9m 12m 9m 12m 4m 64m 4m 47m 4m 55m

events Total Time1 Time2 asymp 285 385 (fixed)

asymp 693 asymp 963 380 (fixed) asymp 989

events per comparison Time1 Time2

asymp 198 asymp 266 asymp 478 asymp 661 asymp 261 asymp 672

(a) Initially allocated endpoint-specific alpha used for sample size calculation (b) For each comparison with the two experimental arms (c) Based on an expected BM+ proportion of 40 (Total number randomized is fixed) (d) As assumed for non-cure patients

34

Literature Overview

Graphical approach Bonferroni-based (Bretz et al 2009)

Non-parametric extensions (Bretz et al 2011)

GSP extension special case (Ye et al 2013)

GSP general allows for selection of recycling stages (Xi Tamhane 2015)

GSP extension general (Maurer Bretz 2013)

Graphical approach Bonferroni-based (Burman 2009)

GSP = group-sequential procedure Dark blue boxes referenced by FDA guidance

Graphs with memory (Maurer Bretz 2013a)

-649

-648 -577 -459

35

Multiplicity History of Regulatory Guidances bull E9 Statistical Principles for Clinical Trials (1998 Section 56)

bull Pre-specification of MTP bull EMA Points to Consider on Multiplicity Issues in Clinical Trials (2002)

bull Strong control of FWER bull EMA Concept paper on the need for a guideline on multiplicity issues in

clinical trials (Draft 2012) =gt Nov2012 Workshop bull EMA Guideline on multiplicity issues in clinical trials bull FDA Guidance document on multiplicity issues in clinical trials bull PMDA no multiplicity guidance (ldquoMTPs do not necessarily match with

labellingrdquo)

36

DRAFT Released for public

consultation in 2017

Related document within BMS Reporting Observed Significance Levels (p-values) in Clinical Study Reports Regulatory Documents and Publications 2014 (Stephane Munier)

Multiplicity Latest Regulatory Guidancersquos (2017) EMA Guideline on multiplicity issues

in clinical trials FDA Guidance document on

multiplicity issues in clinical trials

Principles bull Family-wise strong control bull Pre-specification of multiplicity adjustment

Secondary Endpoints bull Secondary endpoints expressing supportive evidence needs no MTP

bull Secondary endpoints that may become base for additional claims need MTP

(Section 6)

Distinction between different types of secondary endpoints is not clear all require MTP (Section IIIA)

Scope Similar (not the same)

Subgroups Section 7 (also own guidance) Excluded (another guidance)

Estimation and descriptive statistics Section 10 Not included

Graphical approach Not mentioned Explicitly encouraged

37

Experimental1 vs Control Experimental2 vs Control

Alternative4 Importance of Experimental1 PFS

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

ε

1

1 - ε

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain 38

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative4 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

ε

1

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

ε

1

1-ε

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 166 + 13 + 131 0

AC + 162 + 19 + 116 0

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 56 + 30

AC + 69 + 40 39

Experimental1 vs Control Experimental2 vs Control

Alternative3 Importance of both OS BM+

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

05 05

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

05

05

40

Timepoint 1 Timepoint 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative3 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

05

0002

0

05

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

0002

0

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

0009

0

OS BM

+

OS In All

0002

0

05

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 104 00 + 105 00

All + 87 - 14 + 84 - 16

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 46 NA + 40 NA

All + 49 NA + 42 NA

05 05 05 05 05

41

05 05 05 05

B) Final OS Look Adjusting Significance Levels bull The time of the final OS analysis will be determined by the number of

BM+ OS events in the 3 arms or in the control arm

bull As a result the number of events for each OS comparison is a random variable by design

bull Consequence bull Time 1 Assume none of the OS endpoints was rejected bull Time 2 If the of events for a given comparison is more (less) than expected

=gt the information fraction and nominal significance level used at Time 1 was larger (smaller) than what we should have used

For the final look critical values for OS will be re-calculated using the number of exactly observed events at the final analysis

The effect of adjustment is limited (always lt 0004 and in general lt 0001)

42

Illustration Effect of Final Alpha Adjustment

43 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Legend Observed IF at Time 1 lt Q1 Q1 - Q2 Q2 - Q3 Q3 lt

Illustration Effect of Final Alpha Adjustment

44 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Page 22: Multiplicity Correction in a Group-Sequential Oncology ... · PDF file• Largest difference: for Exp1 in scenario#1 (OS BM+: 11%, OS All: 13%, PFS BM+: 5%) 25 30 35 40 45 50 55 60

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

1 1

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

1

1

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ +171 00 +179 00

All +176 00 +179 00

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ +66 +57

All +77 +73 22

How Does the Power Gain Change under Different Assumptions

Endpoint Modified

Scenario Details

OS 1 OS larger difference in treatment effect by BM status

HR (for non-cure patients and after delay) increased by 10 in the BM- and decreased by 10 in the BM+ population (while keeping HR for all randomized the same) (a)

2 OS shorter delay 10 shorter delay in all 4 OS endpoints (a)

OS PFS 3 OS PFS Exp2 better OS and PFS treatment effect (delay cure rate HR in non-cure patients) improved by 10 in Experimental2 arm

4 OS PFS Exp2 worse OS and PFS treatment effect (delay cure rate HR in non-cure patients) worsened by 10 in Experimental2 arm

5 OS PFS longer delay OS delay increased with 50 PFS delay with 300

PFS 6 PFS no delay No delay in any of the 4 PFS endpoints

Time of analysis is generated based on of events in all 3 arms 23

0

01

02

03

04

05

06

07

0

01

02

03

04

05

06

07

PROTOCOL 1 OS Larger BM effect 2 OS shorter delay 3 Exp2 better 4 Exp2 worse 5 OS PFS longer delay 6 PFS no delay

Benefit of Additional Edges in Statistical Power of OS Time1

OS1+

OS1All

OS2+

OS2All

OS1

OS1AC

OS2

OS2AC

05

055

06

065

07

075

08

085

09

095

1

05

055

06

065

07

075

08

085

09

095

1

PROTOCOL 1 OS Larger BM effect 2 OS shorter delay 3 Exp2 better 4 Exp2 worse 5 OS PFS longer delay 6 PFS no delay

Benefit of Additional Edges in Statistical Power of OS Time2

OS1+

OS1All

OS2+

OS2All

OS1

OS1AC

OS2

OS2AC24

Experimental1 vs Control Experimental2 vs Control

Alternative1 Secure All Endpoints within Comparison

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

ε

ε

1 - ε 1 - ε

25

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dashed edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative1 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

ε

ε

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

ε

ε

1-ε 1-ε

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 170 + 14 + 178 + 17

All + 176 + 16 + 178 + 20

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 64 + 56

All + 75 + 71 26

Experimental1 vs Control Experimental2 vs Control

Alternative2 Giving BM+ Maximum Chance

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

001

0

1

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

001

0

1

ε2 ε2

1 - ε 1 - ε

1

1 1

ε2 ε2

1

27

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative2 Change in Power vs Base Case (under Protocol Assumptions)

1 - ε 1 - ε

1

1 1

ε2

0015

0

PFS BM+

PFS In All

OS BM

+

OS In All

0002

0

1

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

0002

0

1

ε2 ε2 ε2

1

Time 1 Time 2

OS BM+

OS In All

0002

0

OS BM+

OS In All

0002

0

05 05

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 135 + 08 + 138 + 08

All - 197 - 80 - 19 - 79

+20

+15

+10

+5

0

-5

-10

-15

-20

28

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 37 + 32

All - 67 - 68

4 Concluding Remarks

29

Concluding Remarks MTP

bull Limitation of the results on statistical power bull OS and PFS were generated independently

bull Identifying the testing strategy for an actual clinical trial always requires the input of several functions and the power results represent only one aspect of these considerations

bull Using graphical approach may result in overpowered endpoints However this still may be desirable reasons bull Protocol assumptions are complicated =gt just one detail off can result in power drop

(even in the presence of meaningful treatment effect)

bull At an interim analysis probability of crossing the efficacy boundary increases =gt gives larger chance for earlier read-out

30

Thank you for your attention

31

References

bull Bretz F Maurer W Brannath W Posch M A graphical approach to sequentially rejective multiple test procedures Statistics in Medicine 2009 28 586-604

bull Burman C-F Sonesson C and Guilbaud O (2009) A recycling framework for the construction of Bonferroni-based multiple tests Statistics in Medicine 28 739ndash761

bull Bretz F Posch M Glimm E Klinglmueller F Maurer W Rohmeyer K Graphical approaches for multiple comparison procedures using weighted Bonferroni Simes or parametric tests Biometrical Journal 2011 53(6) 894-913

bull Ye Y Li A Liu L and Yao B (2013) A group sequential Holm procedure with multiple primary endpoints Statistics in Medicine 32 1112-1124

bull Maurer W and Bretz F (2013a) ldquoGraphical Multiple Test Procedures With Memoryrdquo Statistics in Medicine DOI 101002sim5711

bull Willi MAURER and Frank BRETZ Multiple Testing in Group Sequential Trials Using Graphical Approaches American Statistical Association Statistics in Biopharmaceutical Research November 2013 Vol5 No4

bull Xi D Tamhane AC Allocating recycled significance levels in group sequential procedures for multiple endpoints Biom J 2015 Jan57(1)90-107

bull Tamhane A C Mehta C R and Liu L (2010) Testing a primary and a secondary endpoint in a group sequential design Biometrics 66 1174ndash1184

32

Back-up

33

Basic Parameters of the Study Design Table 1 Basic Parameters of the Study Design

Overall Survival Progression-Free Survival

BM+ BM- All Rand BM+ BM- All Rand

randomized Total (per comparison)

asymp 480 (asymp 320) (c)

asymp 720 (asymp 480) (c)

1200 (800) (fixed)

As OS As OS As OS

Significance level (a) 001 0015

Delayed Effect (b) 4m 5m 1m 1m

Cure Rate in Experimental (b) 15 10 0 0

HR after Delay Effect (b) (d) 06 075 056 075

Overall HR (b) Time1 Time2 070 061 084 074 077 069 063 084 072

Median controlexperimental 8m 12m 9m 12m 9m 12m 4m 64m 4m 47m 4m 55m

events Total Time1 Time2 asymp 285 385 (fixed)

asymp 693 asymp 963 380 (fixed) asymp 989

events per comparison Time1 Time2

asymp 198 asymp 266 asymp 478 asymp 661 asymp 261 asymp 672

(a) Initially allocated endpoint-specific alpha used for sample size calculation (b) For each comparison with the two experimental arms (c) Based on an expected BM+ proportion of 40 (Total number randomized is fixed) (d) As assumed for non-cure patients

34

Literature Overview

Graphical approach Bonferroni-based (Bretz et al 2009)

Non-parametric extensions (Bretz et al 2011)

GSP extension special case (Ye et al 2013)

GSP general allows for selection of recycling stages (Xi Tamhane 2015)

GSP extension general (Maurer Bretz 2013)

Graphical approach Bonferroni-based (Burman 2009)

GSP = group-sequential procedure Dark blue boxes referenced by FDA guidance

Graphs with memory (Maurer Bretz 2013a)

-649

-648 -577 -459

35

Multiplicity History of Regulatory Guidances bull E9 Statistical Principles for Clinical Trials (1998 Section 56)

bull Pre-specification of MTP bull EMA Points to Consider on Multiplicity Issues in Clinical Trials (2002)

bull Strong control of FWER bull EMA Concept paper on the need for a guideline on multiplicity issues in

clinical trials (Draft 2012) =gt Nov2012 Workshop bull EMA Guideline on multiplicity issues in clinical trials bull FDA Guidance document on multiplicity issues in clinical trials bull PMDA no multiplicity guidance (ldquoMTPs do not necessarily match with

labellingrdquo)

36

DRAFT Released for public

consultation in 2017

Related document within BMS Reporting Observed Significance Levels (p-values) in Clinical Study Reports Regulatory Documents and Publications 2014 (Stephane Munier)

Multiplicity Latest Regulatory Guidancersquos (2017) EMA Guideline on multiplicity issues

in clinical trials FDA Guidance document on

multiplicity issues in clinical trials

Principles bull Family-wise strong control bull Pre-specification of multiplicity adjustment

Secondary Endpoints bull Secondary endpoints expressing supportive evidence needs no MTP

bull Secondary endpoints that may become base for additional claims need MTP

(Section 6)

Distinction between different types of secondary endpoints is not clear all require MTP (Section IIIA)

Scope Similar (not the same)

Subgroups Section 7 (also own guidance) Excluded (another guidance)

Estimation and descriptive statistics Section 10 Not included

Graphical approach Not mentioned Explicitly encouraged

37

Experimental1 vs Control Experimental2 vs Control

Alternative4 Importance of Experimental1 PFS

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

ε

1

1 - ε

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain 38

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative4 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

ε

1

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

ε

1

1-ε

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 166 + 13 + 131 0

AC + 162 + 19 + 116 0

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 56 + 30

AC + 69 + 40 39

Experimental1 vs Control Experimental2 vs Control

Alternative3 Importance of both OS BM+

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

05 05

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

05

05

40

Timepoint 1 Timepoint 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative3 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

05

0002

0

05

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

0002

0

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

0009

0

OS BM

+

OS In All

0002

0

05

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 104 00 + 105 00

All + 87 - 14 + 84 - 16

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 46 NA + 40 NA

All + 49 NA + 42 NA

05 05 05 05 05

41

05 05 05 05

B) Final OS Look Adjusting Significance Levels bull The time of the final OS analysis will be determined by the number of

BM+ OS events in the 3 arms or in the control arm

bull As a result the number of events for each OS comparison is a random variable by design

bull Consequence bull Time 1 Assume none of the OS endpoints was rejected bull Time 2 If the of events for a given comparison is more (less) than expected

=gt the information fraction and nominal significance level used at Time 1 was larger (smaller) than what we should have used

For the final look critical values for OS will be re-calculated using the number of exactly observed events at the final analysis

The effect of adjustment is limited (always lt 0004 and in general lt 0001)

42

Illustration Effect of Final Alpha Adjustment

43 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Legend Observed IF at Time 1 lt Q1 Q1 - Q2 Q2 - Q3 Q3 lt

Illustration Effect of Final Alpha Adjustment

44 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Page 23: Multiplicity Correction in a Group-Sequential Oncology ... · PDF file• Largest difference: for Exp1 in scenario#1 (OS BM+: 11%, OS All: 13%, PFS BM+: 5%) 25 30 35 40 45 50 55 60

How Does the Power Gain Change under Different Assumptions

Endpoint Modified

Scenario Details

OS 1 OS larger difference in treatment effect by BM status

HR (for non-cure patients and after delay) increased by 10 in the BM- and decreased by 10 in the BM+ population (while keeping HR for all randomized the same) (a)

2 OS shorter delay 10 shorter delay in all 4 OS endpoints (a)

OS PFS 3 OS PFS Exp2 better OS and PFS treatment effect (delay cure rate HR in non-cure patients) improved by 10 in Experimental2 arm

4 OS PFS Exp2 worse OS and PFS treatment effect (delay cure rate HR in non-cure patients) worsened by 10 in Experimental2 arm

5 OS PFS longer delay OS delay increased with 50 PFS delay with 300

PFS 6 PFS no delay No delay in any of the 4 PFS endpoints

Time of analysis is generated based on of events in all 3 arms 23

0

01

02

03

04

05

06

07

0

01

02

03

04

05

06

07

PROTOCOL 1 OS Larger BM effect 2 OS shorter delay 3 Exp2 better 4 Exp2 worse 5 OS PFS longer delay 6 PFS no delay

Benefit of Additional Edges in Statistical Power of OS Time1

OS1+

OS1All

OS2+

OS2All

OS1

OS1AC

OS2

OS2AC

05

055

06

065

07

075

08

085

09

095

1

05

055

06

065

07

075

08

085

09

095

1

PROTOCOL 1 OS Larger BM effect 2 OS shorter delay 3 Exp2 better 4 Exp2 worse 5 OS PFS longer delay 6 PFS no delay

Benefit of Additional Edges in Statistical Power of OS Time2

OS1+

OS1All

OS2+

OS2All

OS1

OS1AC

OS2

OS2AC24

Experimental1 vs Control Experimental2 vs Control

Alternative1 Secure All Endpoints within Comparison

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

ε

ε

1 - ε 1 - ε

25

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dashed edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative1 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

ε

ε

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

ε

ε

1-ε 1-ε

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 170 + 14 + 178 + 17

All + 176 + 16 + 178 + 20

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 64 + 56

All + 75 + 71 26

Experimental1 vs Control Experimental2 vs Control

Alternative2 Giving BM+ Maximum Chance

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

001

0

1

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

001

0

1

ε2 ε2

1 - ε 1 - ε

1

1 1

ε2 ε2

1

27

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative2 Change in Power vs Base Case (under Protocol Assumptions)

1 - ε 1 - ε

1

1 1

ε2

0015

0

PFS BM+

PFS In All

OS BM

+

OS In All

0002

0

1

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

0002

0

1

ε2 ε2 ε2

1

Time 1 Time 2

OS BM+

OS In All

0002

0

OS BM+

OS In All

0002

0

05 05

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 135 + 08 + 138 + 08

All - 197 - 80 - 19 - 79

+20

+15

+10

+5

0

-5

-10

-15

-20

28

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 37 + 32

All - 67 - 68

4 Concluding Remarks

29

Concluding Remarks MTP

bull Limitation of the results on statistical power bull OS and PFS were generated independently

bull Identifying the testing strategy for an actual clinical trial always requires the input of several functions and the power results represent only one aspect of these considerations

bull Using graphical approach may result in overpowered endpoints However this still may be desirable reasons bull Protocol assumptions are complicated =gt just one detail off can result in power drop

(even in the presence of meaningful treatment effect)

bull At an interim analysis probability of crossing the efficacy boundary increases =gt gives larger chance for earlier read-out

30

Thank you for your attention

31

References

bull Bretz F Maurer W Brannath W Posch M A graphical approach to sequentially rejective multiple test procedures Statistics in Medicine 2009 28 586-604

bull Burman C-F Sonesson C and Guilbaud O (2009) A recycling framework for the construction of Bonferroni-based multiple tests Statistics in Medicine 28 739ndash761

bull Bretz F Posch M Glimm E Klinglmueller F Maurer W Rohmeyer K Graphical approaches for multiple comparison procedures using weighted Bonferroni Simes or parametric tests Biometrical Journal 2011 53(6) 894-913

bull Ye Y Li A Liu L and Yao B (2013) A group sequential Holm procedure with multiple primary endpoints Statistics in Medicine 32 1112-1124

bull Maurer W and Bretz F (2013a) ldquoGraphical Multiple Test Procedures With Memoryrdquo Statistics in Medicine DOI 101002sim5711

bull Willi MAURER and Frank BRETZ Multiple Testing in Group Sequential Trials Using Graphical Approaches American Statistical Association Statistics in Biopharmaceutical Research November 2013 Vol5 No4

bull Xi D Tamhane AC Allocating recycled significance levels in group sequential procedures for multiple endpoints Biom J 2015 Jan57(1)90-107

bull Tamhane A C Mehta C R and Liu L (2010) Testing a primary and a secondary endpoint in a group sequential design Biometrics 66 1174ndash1184

32

Back-up

33

Basic Parameters of the Study Design Table 1 Basic Parameters of the Study Design

Overall Survival Progression-Free Survival

BM+ BM- All Rand BM+ BM- All Rand

randomized Total (per comparison)

asymp 480 (asymp 320) (c)

asymp 720 (asymp 480) (c)

1200 (800) (fixed)

As OS As OS As OS

Significance level (a) 001 0015

Delayed Effect (b) 4m 5m 1m 1m

Cure Rate in Experimental (b) 15 10 0 0

HR after Delay Effect (b) (d) 06 075 056 075

Overall HR (b) Time1 Time2 070 061 084 074 077 069 063 084 072

Median controlexperimental 8m 12m 9m 12m 9m 12m 4m 64m 4m 47m 4m 55m

events Total Time1 Time2 asymp 285 385 (fixed)

asymp 693 asymp 963 380 (fixed) asymp 989

events per comparison Time1 Time2

asymp 198 asymp 266 asymp 478 asymp 661 asymp 261 asymp 672

(a) Initially allocated endpoint-specific alpha used for sample size calculation (b) For each comparison with the two experimental arms (c) Based on an expected BM+ proportion of 40 (Total number randomized is fixed) (d) As assumed for non-cure patients

34

Literature Overview

Graphical approach Bonferroni-based (Bretz et al 2009)

Non-parametric extensions (Bretz et al 2011)

GSP extension special case (Ye et al 2013)

GSP general allows for selection of recycling stages (Xi Tamhane 2015)

GSP extension general (Maurer Bretz 2013)

Graphical approach Bonferroni-based (Burman 2009)

GSP = group-sequential procedure Dark blue boxes referenced by FDA guidance

Graphs with memory (Maurer Bretz 2013a)

-649

-648 -577 -459

35

Multiplicity History of Regulatory Guidances bull E9 Statistical Principles for Clinical Trials (1998 Section 56)

bull Pre-specification of MTP bull EMA Points to Consider on Multiplicity Issues in Clinical Trials (2002)

bull Strong control of FWER bull EMA Concept paper on the need for a guideline on multiplicity issues in

clinical trials (Draft 2012) =gt Nov2012 Workshop bull EMA Guideline on multiplicity issues in clinical trials bull FDA Guidance document on multiplicity issues in clinical trials bull PMDA no multiplicity guidance (ldquoMTPs do not necessarily match with

labellingrdquo)

36

DRAFT Released for public

consultation in 2017

Related document within BMS Reporting Observed Significance Levels (p-values) in Clinical Study Reports Regulatory Documents and Publications 2014 (Stephane Munier)

Multiplicity Latest Regulatory Guidancersquos (2017) EMA Guideline on multiplicity issues

in clinical trials FDA Guidance document on

multiplicity issues in clinical trials

Principles bull Family-wise strong control bull Pre-specification of multiplicity adjustment

Secondary Endpoints bull Secondary endpoints expressing supportive evidence needs no MTP

bull Secondary endpoints that may become base for additional claims need MTP

(Section 6)

Distinction between different types of secondary endpoints is not clear all require MTP (Section IIIA)

Scope Similar (not the same)

Subgroups Section 7 (also own guidance) Excluded (another guidance)

Estimation and descriptive statistics Section 10 Not included

Graphical approach Not mentioned Explicitly encouraged

37

Experimental1 vs Control Experimental2 vs Control

Alternative4 Importance of Experimental1 PFS

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

ε

1

1 - ε

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain 38

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative4 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

ε

1

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

ε

1

1-ε

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 166 + 13 + 131 0

AC + 162 + 19 + 116 0

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 56 + 30

AC + 69 + 40 39

Experimental1 vs Control Experimental2 vs Control

Alternative3 Importance of both OS BM+

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

05 05

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

05

05

40

Timepoint 1 Timepoint 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative3 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

05

0002

0

05

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

0002

0

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

0009

0

OS BM

+

OS In All

0002

0

05

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 104 00 + 105 00

All + 87 - 14 + 84 - 16

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 46 NA + 40 NA

All + 49 NA + 42 NA

05 05 05 05 05

41

05 05 05 05

B) Final OS Look Adjusting Significance Levels bull The time of the final OS analysis will be determined by the number of

BM+ OS events in the 3 arms or in the control arm

bull As a result the number of events for each OS comparison is a random variable by design

bull Consequence bull Time 1 Assume none of the OS endpoints was rejected bull Time 2 If the of events for a given comparison is more (less) than expected

=gt the information fraction and nominal significance level used at Time 1 was larger (smaller) than what we should have used

For the final look critical values for OS will be re-calculated using the number of exactly observed events at the final analysis

The effect of adjustment is limited (always lt 0004 and in general lt 0001)

42

Illustration Effect of Final Alpha Adjustment

43 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Legend Observed IF at Time 1 lt Q1 Q1 - Q2 Q2 - Q3 Q3 lt

Illustration Effect of Final Alpha Adjustment

44 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Page 24: Multiplicity Correction in a Group-Sequential Oncology ... · PDF file• Largest difference: for Exp1 in scenario#1 (OS BM+: 11%, OS All: 13%, PFS BM+: 5%) 25 30 35 40 45 50 55 60

0

01

02

03

04

05

06

07

0

01

02

03

04

05

06

07

PROTOCOL 1 OS Larger BM effect 2 OS shorter delay 3 Exp2 better 4 Exp2 worse 5 OS PFS longer delay 6 PFS no delay

Benefit of Additional Edges in Statistical Power of OS Time1

OS1+

OS1All

OS2+

OS2All

OS1

OS1AC

OS2

OS2AC

05

055

06

065

07

075

08

085

09

095

1

05

055

06

065

07

075

08

085

09

095

1

PROTOCOL 1 OS Larger BM effect 2 OS shorter delay 3 Exp2 better 4 Exp2 worse 5 OS PFS longer delay 6 PFS no delay

Benefit of Additional Edges in Statistical Power of OS Time2

OS1+

OS1All

OS2+

OS2All

OS1

OS1AC

OS2

OS2AC24

Experimental1 vs Control Experimental2 vs Control

Alternative1 Secure All Endpoints within Comparison

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

ε

ε

1 - ε 1 - ε

25

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dashed edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative1 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

ε

ε

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

ε

ε

1-ε 1-ε

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 170 + 14 + 178 + 17

All + 176 + 16 + 178 + 20

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 64 + 56

All + 75 + 71 26

Experimental1 vs Control Experimental2 vs Control

Alternative2 Giving BM+ Maximum Chance

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

001

0

1

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

001

0

1

ε2 ε2

1 - ε 1 - ε

1

1 1

ε2 ε2

1

27

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative2 Change in Power vs Base Case (under Protocol Assumptions)

1 - ε 1 - ε

1

1 1

ε2

0015

0

PFS BM+

PFS In All

OS BM

+

OS In All

0002

0

1

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

0002

0

1

ε2 ε2 ε2

1

Time 1 Time 2

OS BM+

OS In All

0002

0

OS BM+

OS In All

0002

0

05 05

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 135 + 08 + 138 + 08

All - 197 - 80 - 19 - 79

+20

+15

+10

+5

0

-5

-10

-15

-20

28

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 37 + 32

All - 67 - 68

4 Concluding Remarks

29

Concluding Remarks MTP

bull Limitation of the results on statistical power bull OS and PFS were generated independently

bull Identifying the testing strategy for an actual clinical trial always requires the input of several functions and the power results represent only one aspect of these considerations

bull Using graphical approach may result in overpowered endpoints However this still may be desirable reasons bull Protocol assumptions are complicated =gt just one detail off can result in power drop

(even in the presence of meaningful treatment effect)

bull At an interim analysis probability of crossing the efficacy boundary increases =gt gives larger chance for earlier read-out

30

Thank you for your attention

31

References

bull Bretz F Maurer W Brannath W Posch M A graphical approach to sequentially rejective multiple test procedures Statistics in Medicine 2009 28 586-604

bull Burman C-F Sonesson C and Guilbaud O (2009) A recycling framework for the construction of Bonferroni-based multiple tests Statistics in Medicine 28 739ndash761

bull Bretz F Posch M Glimm E Klinglmueller F Maurer W Rohmeyer K Graphical approaches for multiple comparison procedures using weighted Bonferroni Simes or parametric tests Biometrical Journal 2011 53(6) 894-913

bull Ye Y Li A Liu L and Yao B (2013) A group sequential Holm procedure with multiple primary endpoints Statistics in Medicine 32 1112-1124

bull Maurer W and Bretz F (2013a) ldquoGraphical Multiple Test Procedures With Memoryrdquo Statistics in Medicine DOI 101002sim5711

bull Willi MAURER and Frank BRETZ Multiple Testing in Group Sequential Trials Using Graphical Approaches American Statistical Association Statistics in Biopharmaceutical Research November 2013 Vol5 No4

bull Xi D Tamhane AC Allocating recycled significance levels in group sequential procedures for multiple endpoints Biom J 2015 Jan57(1)90-107

bull Tamhane A C Mehta C R and Liu L (2010) Testing a primary and a secondary endpoint in a group sequential design Biometrics 66 1174ndash1184

32

Back-up

33

Basic Parameters of the Study Design Table 1 Basic Parameters of the Study Design

Overall Survival Progression-Free Survival

BM+ BM- All Rand BM+ BM- All Rand

randomized Total (per comparison)

asymp 480 (asymp 320) (c)

asymp 720 (asymp 480) (c)

1200 (800) (fixed)

As OS As OS As OS

Significance level (a) 001 0015

Delayed Effect (b) 4m 5m 1m 1m

Cure Rate in Experimental (b) 15 10 0 0

HR after Delay Effect (b) (d) 06 075 056 075

Overall HR (b) Time1 Time2 070 061 084 074 077 069 063 084 072

Median controlexperimental 8m 12m 9m 12m 9m 12m 4m 64m 4m 47m 4m 55m

events Total Time1 Time2 asymp 285 385 (fixed)

asymp 693 asymp 963 380 (fixed) asymp 989

events per comparison Time1 Time2

asymp 198 asymp 266 asymp 478 asymp 661 asymp 261 asymp 672

(a) Initially allocated endpoint-specific alpha used for sample size calculation (b) For each comparison with the two experimental arms (c) Based on an expected BM+ proportion of 40 (Total number randomized is fixed) (d) As assumed for non-cure patients

34

Literature Overview

Graphical approach Bonferroni-based (Bretz et al 2009)

Non-parametric extensions (Bretz et al 2011)

GSP extension special case (Ye et al 2013)

GSP general allows for selection of recycling stages (Xi Tamhane 2015)

GSP extension general (Maurer Bretz 2013)

Graphical approach Bonferroni-based (Burman 2009)

GSP = group-sequential procedure Dark blue boxes referenced by FDA guidance

Graphs with memory (Maurer Bretz 2013a)

-649

-648 -577 -459

35

Multiplicity History of Regulatory Guidances bull E9 Statistical Principles for Clinical Trials (1998 Section 56)

bull Pre-specification of MTP bull EMA Points to Consider on Multiplicity Issues in Clinical Trials (2002)

bull Strong control of FWER bull EMA Concept paper on the need for a guideline on multiplicity issues in

clinical trials (Draft 2012) =gt Nov2012 Workshop bull EMA Guideline on multiplicity issues in clinical trials bull FDA Guidance document on multiplicity issues in clinical trials bull PMDA no multiplicity guidance (ldquoMTPs do not necessarily match with

labellingrdquo)

36

DRAFT Released for public

consultation in 2017

Related document within BMS Reporting Observed Significance Levels (p-values) in Clinical Study Reports Regulatory Documents and Publications 2014 (Stephane Munier)

Multiplicity Latest Regulatory Guidancersquos (2017) EMA Guideline on multiplicity issues

in clinical trials FDA Guidance document on

multiplicity issues in clinical trials

Principles bull Family-wise strong control bull Pre-specification of multiplicity adjustment

Secondary Endpoints bull Secondary endpoints expressing supportive evidence needs no MTP

bull Secondary endpoints that may become base for additional claims need MTP

(Section 6)

Distinction between different types of secondary endpoints is not clear all require MTP (Section IIIA)

Scope Similar (not the same)

Subgroups Section 7 (also own guidance) Excluded (another guidance)

Estimation and descriptive statistics Section 10 Not included

Graphical approach Not mentioned Explicitly encouraged

37

Experimental1 vs Control Experimental2 vs Control

Alternative4 Importance of Experimental1 PFS

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

ε

1

1 - ε

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain 38

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative4 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

ε

1

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

ε

1

1-ε

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 166 + 13 + 131 0

AC + 162 + 19 + 116 0

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 56 + 30

AC + 69 + 40 39

Experimental1 vs Control Experimental2 vs Control

Alternative3 Importance of both OS BM+

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

05 05

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

05

05

40

Timepoint 1 Timepoint 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative3 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

05

0002

0

05

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

0002

0

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

0009

0

OS BM

+

OS In All

0002

0

05

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 104 00 + 105 00

All + 87 - 14 + 84 - 16

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 46 NA + 40 NA

All + 49 NA + 42 NA

05 05 05 05 05

41

05 05 05 05

B) Final OS Look Adjusting Significance Levels bull The time of the final OS analysis will be determined by the number of

BM+ OS events in the 3 arms or in the control arm

bull As a result the number of events for each OS comparison is a random variable by design

bull Consequence bull Time 1 Assume none of the OS endpoints was rejected bull Time 2 If the of events for a given comparison is more (less) than expected

=gt the information fraction and nominal significance level used at Time 1 was larger (smaller) than what we should have used

For the final look critical values for OS will be re-calculated using the number of exactly observed events at the final analysis

The effect of adjustment is limited (always lt 0004 and in general lt 0001)

42

Illustration Effect of Final Alpha Adjustment

43 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Legend Observed IF at Time 1 lt Q1 Q1 - Q2 Q2 - Q3 Q3 lt

Illustration Effect of Final Alpha Adjustment

44 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Page 25: Multiplicity Correction in a Group-Sequential Oncology ... · PDF file• Largest difference: for Exp1 in scenario#1 (OS BM+: 11%, OS All: 13%, PFS BM+: 5%) 25 30 35 40 45 50 55 60

Experimental1 vs Control Experimental2 vs Control

Alternative1 Secure All Endpoints within Comparison

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

ε

ε

1 - ε 1 - ε

25

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dashed edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative1 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

ε

ε

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

ε

ε

1-ε 1-ε

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 170 + 14 + 178 + 17

All + 176 + 16 + 178 + 20

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 64 + 56

All + 75 + 71 26

Experimental1 vs Control Experimental2 vs Control

Alternative2 Giving BM+ Maximum Chance

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

001

0

1

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

001

0

1

ε2 ε2

1 - ε 1 - ε

1

1 1

ε2 ε2

1

27

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative2 Change in Power vs Base Case (under Protocol Assumptions)

1 - ε 1 - ε

1

1 1

ε2

0015

0

PFS BM+

PFS In All

OS BM

+

OS In All

0002

0

1

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

0002

0

1

ε2 ε2 ε2

1

Time 1 Time 2

OS BM+

OS In All

0002

0

OS BM+

OS In All

0002

0

05 05

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 135 + 08 + 138 + 08

All - 197 - 80 - 19 - 79

+20

+15

+10

+5

0

-5

-10

-15

-20

28

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 37 + 32

All - 67 - 68

4 Concluding Remarks

29

Concluding Remarks MTP

bull Limitation of the results on statistical power bull OS and PFS were generated independently

bull Identifying the testing strategy for an actual clinical trial always requires the input of several functions and the power results represent only one aspect of these considerations

bull Using graphical approach may result in overpowered endpoints However this still may be desirable reasons bull Protocol assumptions are complicated =gt just one detail off can result in power drop

(even in the presence of meaningful treatment effect)

bull At an interim analysis probability of crossing the efficacy boundary increases =gt gives larger chance for earlier read-out

30

Thank you for your attention

31

References

bull Bretz F Maurer W Brannath W Posch M A graphical approach to sequentially rejective multiple test procedures Statistics in Medicine 2009 28 586-604

bull Burman C-F Sonesson C and Guilbaud O (2009) A recycling framework for the construction of Bonferroni-based multiple tests Statistics in Medicine 28 739ndash761

bull Bretz F Posch M Glimm E Klinglmueller F Maurer W Rohmeyer K Graphical approaches for multiple comparison procedures using weighted Bonferroni Simes or parametric tests Biometrical Journal 2011 53(6) 894-913

bull Ye Y Li A Liu L and Yao B (2013) A group sequential Holm procedure with multiple primary endpoints Statistics in Medicine 32 1112-1124

bull Maurer W and Bretz F (2013a) ldquoGraphical Multiple Test Procedures With Memoryrdquo Statistics in Medicine DOI 101002sim5711

bull Willi MAURER and Frank BRETZ Multiple Testing in Group Sequential Trials Using Graphical Approaches American Statistical Association Statistics in Biopharmaceutical Research November 2013 Vol5 No4

bull Xi D Tamhane AC Allocating recycled significance levels in group sequential procedures for multiple endpoints Biom J 2015 Jan57(1)90-107

bull Tamhane A C Mehta C R and Liu L (2010) Testing a primary and a secondary endpoint in a group sequential design Biometrics 66 1174ndash1184

32

Back-up

33

Basic Parameters of the Study Design Table 1 Basic Parameters of the Study Design

Overall Survival Progression-Free Survival

BM+ BM- All Rand BM+ BM- All Rand

randomized Total (per comparison)

asymp 480 (asymp 320) (c)

asymp 720 (asymp 480) (c)

1200 (800) (fixed)

As OS As OS As OS

Significance level (a) 001 0015

Delayed Effect (b) 4m 5m 1m 1m

Cure Rate in Experimental (b) 15 10 0 0

HR after Delay Effect (b) (d) 06 075 056 075

Overall HR (b) Time1 Time2 070 061 084 074 077 069 063 084 072

Median controlexperimental 8m 12m 9m 12m 9m 12m 4m 64m 4m 47m 4m 55m

events Total Time1 Time2 asymp 285 385 (fixed)

asymp 693 asymp 963 380 (fixed) asymp 989

events per comparison Time1 Time2

asymp 198 asymp 266 asymp 478 asymp 661 asymp 261 asymp 672

(a) Initially allocated endpoint-specific alpha used for sample size calculation (b) For each comparison with the two experimental arms (c) Based on an expected BM+ proportion of 40 (Total number randomized is fixed) (d) As assumed for non-cure patients

34

Literature Overview

Graphical approach Bonferroni-based (Bretz et al 2009)

Non-parametric extensions (Bretz et al 2011)

GSP extension special case (Ye et al 2013)

GSP general allows for selection of recycling stages (Xi Tamhane 2015)

GSP extension general (Maurer Bretz 2013)

Graphical approach Bonferroni-based (Burman 2009)

GSP = group-sequential procedure Dark blue boxes referenced by FDA guidance

Graphs with memory (Maurer Bretz 2013a)

-649

-648 -577 -459

35

Multiplicity History of Regulatory Guidances bull E9 Statistical Principles for Clinical Trials (1998 Section 56)

bull Pre-specification of MTP bull EMA Points to Consider on Multiplicity Issues in Clinical Trials (2002)

bull Strong control of FWER bull EMA Concept paper on the need for a guideline on multiplicity issues in

clinical trials (Draft 2012) =gt Nov2012 Workshop bull EMA Guideline on multiplicity issues in clinical trials bull FDA Guidance document on multiplicity issues in clinical trials bull PMDA no multiplicity guidance (ldquoMTPs do not necessarily match with

labellingrdquo)

36

DRAFT Released for public

consultation in 2017

Related document within BMS Reporting Observed Significance Levels (p-values) in Clinical Study Reports Regulatory Documents and Publications 2014 (Stephane Munier)

Multiplicity Latest Regulatory Guidancersquos (2017) EMA Guideline on multiplicity issues

in clinical trials FDA Guidance document on

multiplicity issues in clinical trials

Principles bull Family-wise strong control bull Pre-specification of multiplicity adjustment

Secondary Endpoints bull Secondary endpoints expressing supportive evidence needs no MTP

bull Secondary endpoints that may become base for additional claims need MTP

(Section 6)

Distinction between different types of secondary endpoints is not clear all require MTP (Section IIIA)

Scope Similar (not the same)

Subgroups Section 7 (also own guidance) Excluded (another guidance)

Estimation and descriptive statistics Section 10 Not included

Graphical approach Not mentioned Explicitly encouraged

37

Experimental1 vs Control Experimental2 vs Control

Alternative4 Importance of Experimental1 PFS

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

ε

1

1 - ε

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain 38

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative4 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

ε

1

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

ε

1

1-ε

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 166 + 13 + 131 0

AC + 162 + 19 + 116 0

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 56 + 30

AC + 69 + 40 39

Experimental1 vs Control Experimental2 vs Control

Alternative3 Importance of both OS BM+

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

05 05

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

05

05

40

Timepoint 1 Timepoint 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative3 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

05

0002

0

05

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

0002

0

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

0009

0

OS BM

+

OS In All

0002

0

05

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 104 00 + 105 00

All + 87 - 14 + 84 - 16

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 46 NA + 40 NA

All + 49 NA + 42 NA

05 05 05 05 05

41

05 05 05 05

B) Final OS Look Adjusting Significance Levels bull The time of the final OS analysis will be determined by the number of

BM+ OS events in the 3 arms or in the control arm

bull As a result the number of events for each OS comparison is a random variable by design

bull Consequence bull Time 1 Assume none of the OS endpoints was rejected bull Time 2 If the of events for a given comparison is more (less) than expected

=gt the information fraction and nominal significance level used at Time 1 was larger (smaller) than what we should have used

For the final look critical values for OS will be re-calculated using the number of exactly observed events at the final analysis

The effect of adjustment is limited (always lt 0004 and in general lt 0001)

42

Illustration Effect of Final Alpha Adjustment

43 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Legend Observed IF at Time 1 lt Q1 Q1 - Q2 Q2 - Q3 Q3 lt

Illustration Effect of Final Alpha Adjustment

44 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Page 26: Multiplicity Correction in a Group-Sequential Oncology ... · PDF file• Largest difference: for Exp1 in scenario#1 (OS BM+: 11%, OS All: 13%, PFS BM+: 5%) 25 30 35 40 45 50 55 60

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative1 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

ε

ε

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

ε

ε

1-ε 1-ε

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 170 + 14 + 178 + 17

All + 176 + 16 + 178 + 20

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 64 + 56

All + 75 + 71 26

Experimental1 vs Control Experimental2 vs Control

Alternative2 Giving BM+ Maximum Chance

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

001

0

1

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

001

0

1

ε2 ε2

1 - ε 1 - ε

1

1 1

ε2 ε2

1

27

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative2 Change in Power vs Base Case (under Protocol Assumptions)

1 - ε 1 - ε

1

1 1

ε2

0015

0

PFS BM+

PFS In All

OS BM

+

OS In All

0002

0

1

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

0002

0

1

ε2 ε2 ε2

1

Time 1 Time 2

OS BM+

OS In All

0002

0

OS BM+

OS In All

0002

0

05 05

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 135 + 08 + 138 + 08

All - 197 - 80 - 19 - 79

+20

+15

+10

+5

0

-5

-10

-15

-20

28

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 37 + 32

All - 67 - 68

4 Concluding Remarks

29

Concluding Remarks MTP

bull Limitation of the results on statistical power bull OS and PFS were generated independently

bull Identifying the testing strategy for an actual clinical trial always requires the input of several functions and the power results represent only one aspect of these considerations

bull Using graphical approach may result in overpowered endpoints However this still may be desirable reasons bull Protocol assumptions are complicated =gt just one detail off can result in power drop

(even in the presence of meaningful treatment effect)

bull At an interim analysis probability of crossing the efficacy boundary increases =gt gives larger chance for earlier read-out

30

Thank you for your attention

31

References

bull Bretz F Maurer W Brannath W Posch M A graphical approach to sequentially rejective multiple test procedures Statistics in Medicine 2009 28 586-604

bull Burman C-F Sonesson C and Guilbaud O (2009) A recycling framework for the construction of Bonferroni-based multiple tests Statistics in Medicine 28 739ndash761

bull Bretz F Posch M Glimm E Klinglmueller F Maurer W Rohmeyer K Graphical approaches for multiple comparison procedures using weighted Bonferroni Simes or parametric tests Biometrical Journal 2011 53(6) 894-913

bull Ye Y Li A Liu L and Yao B (2013) A group sequential Holm procedure with multiple primary endpoints Statistics in Medicine 32 1112-1124

bull Maurer W and Bretz F (2013a) ldquoGraphical Multiple Test Procedures With Memoryrdquo Statistics in Medicine DOI 101002sim5711

bull Willi MAURER and Frank BRETZ Multiple Testing in Group Sequential Trials Using Graphical Approaches American Statistical Association Statistics in Biopharmaceutical Research November 2013 Vol5 No4

bull Xi D Tamhane AC Allocating recycled significance levels in group sequential procedures for multiple endpoints Biom J 2015 Jan57(1)90-107

bull Tamhane A C Mehta C R and Liu L (2010) Testing a primary and a secondary endpoint in a group sequential design Biometrics 66 1174ndash1184

32

Back-up

33

Basic Parameters of the Study Design Table 1 Basic Parameters of the Study Design

Overall Survival Progression-Free Survival

BM+ BM- All Rand BM+ BM- All Rand

randomized Total (per comparison)

asymp 480 (asymp 320) (c)

asymp 720 (asymp 480) (c)

1200 (800) (fixed)

As OS As OS As OS

Significance level (a) 001 0015

Delayed Effect (b) 4m 5m 1m 1m

Cure Rate in Experimental (b) 15 10 0 0

HR after Delay Effect (b) (d) 06 075 056 075

Overall HR (b) Time1 Time2 070 061 084 074 077 069 063 084 072

Median controlexperimental 8m 12m 9m 12m 9m 12m 4m 64m 4m 47m 4m 55m

events Total Time1 Time2 asymp 285 385 (fixed)

asymp 693 asymp 963 380 (fixed) asymp 989

events per comparison Time1 Time2

asymp 198 asymp 266 asymp 478 asymp 661 asymp 261 asymp 672

(a) Initially allocated endpoint-specific alpha used for sample size calculation (b) For each comparison with the two experimental arms (c) Based on an expected BM+ proportion of 40 (Total number randomized is fixed) (d) As assumed for non-cure patients

34

Literature Overview

Graphical approach Bonferroni-based (Bretz et al 2009)

Non-parametric extensions (Bretz et al 2011)

GSP extension special case (Ye et al 2013)

GSP general allows for selection of recycling stages (Xi Tamhane 2015)

GSP extension general (Maurer Bretz 2013)

Graphical approach Bonferroni-based (Burman 2009)

GSP = group-sequential procedure Dark blue boxes referenced by FDA guidance

Graphs with memory (Maurer Bretz 2013a)

-649

-648 -577 -459

35

Multiplicity History of Regulatory Guidances bull E9 Statistical Principles for Clinical Trials (1998 Section 56)

bull Pre-specification of MTP bull EMA Points to Consider on Multiplicity Issues in Clinical Trials (2002)

bull Strong control of FWER bull EMA Concept paper on the need for a guideline on multiplicity issues in

clinical trials (Draft 2012) =gt Nov2012 Workshop bull EMA Guideline on multiplicity issues in clinical trials bull FDA Guidance document on multiplicity issues in clinical trials bull PMDA no multiplicity guidance (ldquoMTPs do not necessarily match with

labellingrdquo)

36

DRAFT Released for public

consultation in 2017

Related document within BMS Reporting Observed Significance Levels (p-values) in Clinical Study Reports Regulatory Documents and Publications 2014 (Stephane Munier)

Multiplicity Latest Regulatory Guidancersquos (2017) EMA Guideline on multiplicity issues

in clinical trials FDA Guidance document on

multiplicity issues in clinical trials

Principles bull Family-wise strong control bull Pre-specification of multiplicity adjustment

Secondary Endpoints bull Secondary endpoints expressing supportive evidence needs no MTP

bull Secondary endpoints that may become base for additional claims need MTP

(Section 6)

Distinction between different types of secondary endpoints is not clear all require MTP (Section IIIA)

Scope Similar (not the same)

Subgroups Section 7 (also own guidance) Excluded (another guidance)

Estimation and descriptive statistics Section 10 Not included

Graphical approach Not mentioned Explicitly encouraged

37

Experimental1 vs Control Experimental2 vs Control

Alternative4 Importance of Experimental1 PFS

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

ε

1

1 - ε

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain 38

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative4 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

ε

1

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

ε

1

1-ε

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 166 + 13 + 131 0

AC + 162 + 19 + 116 0

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 56 + 30

AC + 69 + 40 39

Experimental1 vs Control Experimental2 vs Control

Alternative3 Importance of both OS BM+

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

05 05

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

05

05

40

Timepoint 1 Timepoint 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative3 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

05

0002

0

05

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

0002

0

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

0009

0

OS BM

+

OS In All

0002

0

05

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 104 00 + 105 00

All + 87 - 14 + 84 - 16

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 46 NA + 40 NA

All + 49 NA + 42 NA

05 05 05 05 05

41

05 05 05 05

B) Final OS Look Adjusting Significance Levels bull The time of the final OS analysis will be determined by the number of

BM+ OS events in the 3 arms or in the control arm

bull As a result the number of events for each OS comparison is a random variable by design

bull Consequence bull Time 1 Assume none of the OS endpoints was rejected bull Time 2 If the of events for a given comparison is more (less) than expected

=gt the information fraction and nominal significance level used at Time 1 was larger (smaller) than what we should have used

For the final look critical values for OS will be re-calculated using the number of exactly observed events at the final analysis

The effect of adjustment is limited (always lt 0004 and in general lt 0001)

42

Illustration Effect of Final Alpha Adjustment

43 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Legend Observed IF at Time 1 lt Q1 Q1 - Q2 Q2 - Q3 Q3 lt

Illustration Effect of Final Alpha Adjustment

44 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Page 27: Multiplicity Correction in a Group-Sequential Oncology ... · PDF file• Largest difference: for Exp1 in scenario#1 (OS BM+: 11%, OS All: 13%, PFS BM+: 5%) 25 30 35 40 45 50 55 60

Experimental1 vs Control Experimental2 vs Control

Alternative2 Giving BM+ Maximum Chance

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

001

0

1

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

001

0

1

ε2 ε2

1 - ε 1 - ε

1

1 1

ε2 ε2

1

27

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative2 Change in Power vs Base Case (under Protocol Assumptions)

1 - ε 1 - ε

1

1 1

ε2

0015

0

PFS BM+

PFS In All

OS BM

+

OS In All

0002

0

1

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

0002

0

1

ε2 ε2 ε2

1

Time 1 Time 2

OS BM+

OS In All

0002

0

OS BM+

OS In All

0002

0

05 05

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 135 + 08 + 138 + 08

All - 197 - 80 - 19 - 79

+20

+15

+10

+5

0

-5

-10

-15

-20

28

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 37 + 32

All - 67 - 68

4 Concluding Remarks

29

Concluding Remarks MTP

bull Limitation of the results on statistical power bull OS and PFS were generated independently

bull Identifying the testing strategy for an actual clinical trial always requires the input of several functions and the power results represent only one aspect of these considerations

bull Using graphical approach may result in overpowered endpoints However this still may be desirable reasons bull Protocol assumptions are complicated =gt just one detail off can result in power drop

(even in the presence of meaningful treatment effect)

bull At an interim analysis probability of crossing the efficacy boundary increases =gt gives larger chance for earlier read-out

30

Thank you for your attention

31

References

bull Bretz F Maurer W Brannath W Posch M A graphical approach to sequentially rejective multiple test procedures Statistics in Medicine 2009 28 586-604

bull Burman C-F Sonesson C and Guilbaud O (2009) A recycling framework for the construction of Bonferroni-based multiple tests Statistics in Medicine 28 739ndash761

bull Bretz F Posch M Glimm E Klinglmueller F Maurer W Rohmeyer K Graphical approaches for multiple comparison procedures using weighted Bonferroni Simes or parametric tests Biometrical Journal 2011 53(6) 894-913

bull Ye Y Li A Liu L and Yao B (2013) A group sequential Holm procedure with multiple primary endpoints Statistics in Medicine 32 1112-1124

bull Maurer W and Bretz F (2013a) ldquoGraphical Multiple Test Procedures With Memoryrdquo Statistics in Medicine DOI 101002sim5711

bull Willi MAURER and Frank BRETZ Multiple Testing in Group Sequential Trials Using Graphical Approaches American Statistical Association Statistics in Biopharmaceutical Research November 2013 Vol5 No4

bull Xi D Tamhane AC Allocating recycled significance levels in group sequential procedures for multiple endpoints Biom J 2015 Jan57(1)90-107

bull Tamhane A C Mehta C R and Liu L (2010) Testing a primary and a secondary endpoint in a group sequential design Biometrics 66 1174ndash1184

32

Back-up

33

Basic Parameters of the Study Design Table 1 Basic Parameters of the Study Design

Overall Survival Progression-Free Survival

BM+ BM- All Rand BM+ BM- All Rand

randomized Total (per comparison)

asymp 480 (asymp 320) (c)

asymp 720 (asymp 480) (c)

1200 (800) (fixed)

As OS As OS As OS

Significance level (a) 001 0015

Delayed Effect (b) 4m 5m 1m 1m

Cure Rate in Experimental (b) 15 10 0 0

HR after Delay Effect (b) (d) 06 075 056 075

Overall HR (b) Time1 Time2 070 061 084 074 077 069 063 084 072

Median controlexperimental 8m 12m 9m 12m 9m 12m 4m 64m 4m 47m 4m 55m

events Total Time1 Time2 asymp 285 385 (fixed)

asymp 693 asymp 963 380 (fixed) asymp 989

events per comparison Time1 Time2

asymp 198 asymp 266 asymp 478 asymp 661 asymp 261 asymp 672

(a) Initially allocated endpoint-specific alpha used for sample size calculation (b) For each comparison with the two experimental arms (c) Based on an expected BM+ proportion of 40 (Total number randomized is fixed) (d) As assumed for non-cure patients

34

Literature Overview

Graphical approach Bonferroni-based (Bretz et al 2009)

Non-parametric extensions (Bretz et al 2011)

GSP extension special case (Ye et al 2013)

GSP general allows for selection of recycling stages (Xi Tamhane 2015)

GSP extension general (Maurer Bretz 2013)

Graphical approach Bonferroni-based (Burman 2009)

GSP = group-sequential procedure Dark blue boxes referenced by FDA guidance

Graphs with memory (Maurer Bretz 2013a)

-649

-648 -577 -459

35

Multiplicity History of Regulatory Guidances bull E9 Statistical Principles for Clinical Trials (1998 Section 56)

bull Pre-specification of MTP bull EMA Points to Consider on Multiplicity Issues in Clinical Trials (2002)

bull Strong control of FWER bull EMA Concept paper on the need for a guideline on multiplicity issues in

clinical trials (Draft 2012) =gt Nov2012 Workshop bull EMA Guideline on multiplicity issues in clinical trials bull FDA Guidance document on multiplicity issues in clinical trials bull PMDA no multiplicity guidance (ldquoMTPs do not necessarily match with

labellingrdquo)

36

DRAFT Released for public

consultation in 2017

Related document within BMS Reporting Observed Significance Levels (p-values) in Clinical Study Reports Regulatory Documents and Publications 2014 (Stephane Munier)

Multiplicity Latest Regulatory Guidancersquos (2017) EMA Guideline on multiplicity issues

in clinical trials FDA Guidance document on

multiplicity issues in clinical trials

Principles bull Family-wise strong control bull Pre-specification of multiplicity adjustment

Secondary Endpoints bull Secondary endpoints expressing supportive evidence needs no MTP

bull Secondary endpoints that may become base for additional claims need MTP

(Section 6)

Distinction between different types of secondary endpoints is not clear all require MTP (Section IIIA)

Scope Similar (not the same)

Subgroups Section 7 (also own guidance) Excluded (another guidance)

Estimation and descriptive statistics Section 10 Not included

Graphical approach Not mentioned Explicitly encouraged

37

Experimental1 vs Control Experimental2 vs Control

Alternative4 Importance of Experimental1 PFS

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

ε

1

1 - ε

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain 38

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative4 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

ε

1

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

ε

1

1-ε

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 166 + 13 + 131 0

AC + 162 + 19 + 116 0

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 56 + 30

AC + 69 + 40 39

Experimental1 vs Control Experimental2 vs Control

Alternative3 Importance of both OS BM+

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

05 05

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

05

05

40

Timepoint 1 Timepoint 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative3 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

05

0002

0

05

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

0002

0

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

0009

0

OS BM

+

OS In All

0002

0

05

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 104 00 + 105 00

All + 87 - 14 + 84 - 16

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 46 NA + 40 NA

All + 49 NA + 42 NA

05 05 05 05 05

41

05 05 05 05

B) Final OS Look Adjusting Significance Levels bull The time of the final OS analysis will be determined by the number of

BM+ OS events in the 3 arms or in the control arm

bull As a result the number of events for each OS comparison is a random variable by design

bull Consequence bull Time 1 Assume none of the OS endpoints was rejected bull Time 2 If the of events for a given comparison is more (less) than expected

=gt the information fraction and nominal significance level used at Time 1 was larger (smaller) than what we should have used

For the final look critical values for OS will be re-calculated using the number of exactly observed events at the final analysis

The effect of adjustment is limited (always lt 0004 and in general lt 0001)

42

Illustration Effect of Final Alpha Adjustment

43 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Legend Observed IF at Time 1 lt Q1 Q1 - Q2 Q2 - Q3 Q3 lt

Illustration Effect of Final Alpha Adjustment

44 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Page 28: Multiplicity Correction in a Group-Sequential Oncology ... · PDF file• Largest difference: for Exp1 in scenario#1 (OS BM+: 11%, OS All: 13%, PFS BM+: 5%) 25 30 35 40 45 50 55 60

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative2 Change in Power vs Base Case (under Protocol Assumptions)

1 - ε 1 - ε

1

1 1

ε2

0015

0

PFS BM+

PFS In All

OS BM

+

OS In All

0002

0

1

0015

0

PFS BM+

PFS In All

OS BM+

OS In All

0002

0

1

ε2 ε2 ε2

1

Time 1 Time 2

OS BM+

OS In All

0002

0

OS BM+

OS In All

0002

0

05 05

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 135 + 08 + 138 + 08

All - 197 - 80 - 19 - 79

+20

+15

+10

+5

0

-5

-10

-15

-20

28

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 37 + 32

All - 67 - 68

4 Concluding Remarks

29

Concluding Remarks MTP

bull Limitation of the results on statistical power bull OS and PFS were generated independently

bull Identifying the testing strategy for an actual clinical trial always requires the input of several functions and the power results represent only one aspect of these considerations

bull Using graphical approach may result in overpowered endpoints However this still may be desirable reasons bull Protocol assumptions are complicated =gt just one detail off can result in power drop

(even in the presence of meaningful treatment effect)

bull At an interim analysis probability of crossing the efficacy boundary increases =gt gives larger chance for earlier read-out

30

Thank you for your attention

31

References

bull Bretz F Maurer W Brannath W Posch M A graphical approach to sequentially rejective multiple test procedures Statistics in Medicine 2009 28 586-604

bull Burman C-F Sonesson C and Guilbaud O (2009) A recycling framework for the construction of Bonferroni-based multiple tests Statistics in Medicine 28 739ndash761

bull Bretz F Posch M Glimm E Klinglmueller F Maurer W Rohmeyer K Graphical approaches for multiple comparison procedures using weighted Bonferroni Simes or parametric tests Biometrical Journal 2011 53(6) 894-913

bull Ye Y Li A Liu L and Yao B (2013) A group sequential Holm procedure with multiple primary endpoints Statistics in Medicine 32 1112-1124

bull Maurer W and Bretz F (2013a) ldquoGraphical Multiple Test Procedures With Memoryrdquo Statistics in Medicine DOI 101002sim5711

bull Willi MAURER and Frank BRETZ Multiple Testing in Group Sequential Trials Using Graphical Approaches American Statistical Association Statistics in Biopharmaceutical Research November 2013 Vol5 No4

bull Xi D Tamhane AC Allocating recycled significance levels in group sequential procedures for multiple endpoints Biom J 2015 Jan57(1)90-107

bull Tamhane A C Mehta C R and Liu L (2010) Testing a primary and a secondary endpoint in a group sequential design Biometrics 66 1174ndash1184

32

Back-up

33

Basic Parameters of the Study Design Table 1 Basic Parameters of the Study Design

Overall Survival Progression-Free Survival

BM+ BM- All Rand BM+ BM- All Rand

randomized Total (per comparison)

asymp 480 (asymp 320) (c)

asymp 720 (asymp 480) (c)

1200 (800) (fixed)

As OS As OS As OS

Significance level (a) 001 0015

Delayed Effect (b) 4m 5m 1m 1m

Cure Rate in Experimental (b) 15 10 0 0

HR after Delay Effect (b) (d) 06 075 056 075

Overall HR (b) Time1 Time2 070 061 084 074 077 069 063 084 072

Median controlexperimental 8m 12m 9m 12m 9m 12m 4m 64m 4m 47m 4m 55m

events Total Time1 Time2 asymp 285 385 (fixed)

asymp 693 asymp 963 380 (fixed) asymp 989

events per comparison Time1 Time2

asymp 198 asymp 266 asymp 478 asymp 661 asymp 261 asymp 672

(a) Initially allocated endpoint-specific alpha used for sample size calculation (b) For each comparison with the two experimental arms (c) Based on an expected BM+ proportion of 40 (Total number randomized is fixed) (d) As assumed for non-cure patients

34

Literature Overview

Graphical approach Bonferroni-based (Bretz et al 2009)

Non-parametric extensions (Bretz et al 2011)

GSP extension special case (Ye et al 2013)

GSP general allows for selection of recycling stages (Xi Tamhane 2015)

GSP extension general (Maurer Bretz 2013)

Graphical approach Bonferroni-based (Burman 2009)

GSP = group-sequential procedure Dark blue boxes referenced by FDA guidance

Graphs with memory (Maurer Bretz 2013a)

-649

-648 -577 -459

35

Multiplicity History of Regulatory Guidances bull E9 Statistical Principles for Clinical Trials (1998 Section 56)

bull Pre-specification of MTP bull EMA Points to Consider on Multiplicity Issues in Clinical Trials (2002)

bull Strong control of FWER bull EMA Concept paper on the need for a guideline on multiplicity issues in

clinical trials (Draft 2012) =gt Nov2012 Workshop bull EMA Guideline on multiplicity issues in clinical trials bull FDA Guidance document on multiplicity issues in clinical trials bull PMDA no multiplicity guidance (ldquoMTPs do not necessarily match with

labellingrdquo)

36

DRAFT Released for public

consultation in 2017

Related document within BMS Reporting Observed Significance Levels (p-values) in Clinical Study Reports Regulatory Documents and Publications 2014 (Stephane Munier)

Multiplicity Latest Regulatory Guidancersquos (2017) EMA Guideline on multiplicity issues

in clinical trials FDA Guidance document on

multiplicity issues in clinical trials

Principles bull Family-wise strong control bull Pre-specification of multiplicity adjustment

Secondary Endpoints bull Secondary endpoints expressing supportive evidence needs no MTP

bull Secondary endpoints that may become base for additional claims need MTP

(Section 6)

Distinction between different types of secondary endpoints is not clear all require MTP (Section IIIA)

Scope Similar (not the same)

Subgroups Section 7 (also own guidance) Excluded (another guidance)

Estimation and descriptive statistics Section 10 Not included

Graphical approach Not mentioned Explicitly encouraged

37

Experimental1 vs Control Experimental2 vs Control

Alternative4 Importance of Experimental1 PFS

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

ε

1

1 - ε

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain 38

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative4 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

ε

1

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

ε

1

1-ε

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 166 + 13 + 131 0

AC + 162 + 19 + 116 0

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 56 + 30

AC + 69 + 40 39

Experimental1 vs Control Experimental2 vs Control

Alternative3 Importance of both OS BM+

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

05 05

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

05

05

40

Timepoint 1 Timepoint 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative3 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

05

0002

0

05

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

0002

0

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

0009

0

OS BM

+

OS In All

0002

0

05

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 104 00 + 105 00

All + 87 - 14 + 84 - 16

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 46 NA + 40 NA

All + 49 NA + 42 NA

05 05 05 05 05

41

05 05 05 05

B) Final OS Look Adjusting Significance Levels bull The time of the final OS analysis will be determined by the number of

BM+ OS events in the 3 arms or in the control arm

bull As a result the number of events for each OS comparison is a random variable by design

bull Consequence bull Time 1 Assume none of the OS endpoints was rejected bull Time 2 If the of events for a given comparison is more (less) than expected

=gt the information fraction and nominal significance level used at Time 1 was larger (smaller) than what we should have used

For the final look critical values for OS will be re-calculated using the number of exactly observed events at the final analysis

The effect of adjustment is limited (always lt 0004 and in general lt 0001)

42

Illustration Effect of Final Alpha Adjustment

43 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Legend Observed IF at Time 1 lt Q1 Q1 - Q2 Q2 - Q3 Q3 lt

Illustration Effect of Final Alpha Adjustment

44 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Page 29: Multiplicity Correction in a Group-Sequential Oncology ... · PDF file• Largest difference: for Exp1 in scenario#1 (OS BM+: 11%, OS All: 13%, PFS BM+: 5%) 25 30 35 40 45 50 55 60

4 Concluding Remarks

29

Concluding Remarks MTP

bull Limitation of the results on statistical power bull OS and PFS were generated independently

bull Identifying the testing strategy for an actual clinical trial always requires the input of several functions and the power results represent only one aspect of these considerations

bull Using graphical approach may result in overpowered endpoints However this still may be desirable reasons bull Protocol assumptions are complicated =gt just one detail off can result in power drop

(even in the presence of meaningful treatment effect)

bull At an interim analysis probability of crossing the efficacy boundary increases =gt gives larger chance for earlier read-out

30

Thank you for your attention

31

References

bull Bretz F Maurer W Brannath W Posch M A graphical approach to sequentially rejective multiple test procedures Statistics in Medicine 2009 28 586-604

bull Burman C-F Sonesson C and Guilbaud O (2009) A recycling framework for the construction of Bonferroni-based multiple tests Statistics in Medicine 28 739ndash761

bull Bretz F Posch M Glimm E Klinglmueller F Maurer W Rohmeyer K Graphical approaches for multiple comparison procedures using weighted Bonferroni Simes or parametric tests Biometrical Journal 2011 53(6) 894-913

bull Ye Y Li A Liu L and Yao B (2013) A group sequential Holm procedure with multiple primary endpoints Statistics in Medicine 32 1112-1124

bull Maurer W and Bretz F (2013a) ldquoGraphical Multiple Test Procedures With Memoryrdquo Statistics in Medicine DOI 101002sim5711

bull Willi MAURER and Frank BRETZ Multiple Testing in Group Sequential Trials Using Graphical Approaches American Statistical Association Statistics in Biopharmaceutical Research November 2013 Vol5 No4

bull Xi D Tamhane AC Allocating recycled significance levels in group sequential procedures for multiple endpoints Biom J 2015 Jan57(1)90-107

bull Tamhane A C Mehta C R and Liu L (2010) Testing a primary and a secondary endpoint in a group sequential design Biometrics 66 1174ndash1184

32

Back-up

33

Basic Parameters of the Study Design Table 1 Basic Parameters of the Study Design

Overall Survival Progression-Free Survival

BM+ BM- All Rand BM+ BM- All Rand

randomized Total (per comparison)

asymp 480 (asymp 320) (c)

asymp 720 (asymp 480) (c)

1200 (800) (fixed)

As OS As OS As OS

Significance level (a) 001 0015

Delayed Effect (b) 4m 5m 1m 1m

Cure Rate in Experimental (b) 15 10 0 0

HR after Delay Effect (b) (d) 06 075 056 075

Overall HR (b) Time1 Time2 070 061 084 074 077 069 063 084 072

Median controlexperimental 8m 12m 9m 12m 9m 12m 4m 64m 4m 47m 4m 55m

events Total Time1 Time2 asymp 285 385 (fixed)

asymp 693 asymp 963 380 (fixed) asymp 989

events per comparison Time1 Time2

asymp 198 asymp 266 asymp 478 asymp 661 asymp 261 asymp 672

(a) Initially allocated endpoint-specific alpha used for sample size calculation (b) For each comparison with the two experimental arms (c) Based on an expected BM+ proportion of 40 (Total number randomized is fixed) (d) As assumed for non-cure patients

34

Literature Overview

Graphical approach Bonferroni-based (Bretz et al 2009)

Non-parametric extensions (Bretz et al 2011)

GSP extension special case (Ye et al 2013)

GSP general allows for selection of recycling stages (Xi Tamhane 2015)

GSP extension general (Maurer Bretz 2013)

Graphical approach Bonferroni-based (Burman 2009)

GSP = group-sequential procedure Dark blue boxes referenced by FDA guidance

Graphs with memory (Maurer Bretz 2013a)

-649

-648 -577 -459

35

Multiplicity History of Regulatory Guidances bull E9 Statistical Principles for Clinical Trials (1998 Section 56)

bull Pre-specification of MTP bull EMA Points to Consider on Multiplicity Issues in Clinical Trials (2002)

bull Strong control of FWER bull EMA Concept paper on the need for a guideline on multiplicity issues in

clinical trials (Draft 2012) =gt Nov2012 Workshop bull EMA Guideline on multiplicity issues in clinical trials bull FDA Guidance document on multiplicity issues in clinical trials bull PMDA no multiplicity guidance (ldquoMTPs do not necessarily match with

labellingrdquo)

36

DRAFT Released for public

consultation in 2017

Related document within BMS Reporting Observed Significance Levels (p-values) in Clinical Study Reports Regulatory Documents and Publications 2014 (Stephane Munier)

Multiplicity Latest Regulatory Guidancersquos (2017) EMA Guideline on multiplicity issues

in clinical trials FDA Guidance document on

multiplicity issues in clinical trials

Principles bull Family-wise strong control bull Pre-specification of multiplicity adjustment

Secondary Endpoints bull Secondary endpoints expressing supportive evidence needs no MTP

bull Secondary endpoints that may become base for additional claims need MTP

(Section 6)

Distinction between different types of secondary endpoints is not clear all require MTP (Section IIIA)

Scope Similar (not the same)

Subgroups Section 7 (also own guidance) Excluded (another guidance)

Estimation and descriptive statistics Section 10 Not included

Graphical approach Not mentioned Explicitly encouraged

37

Experimental1 vs Control Experimental2 vs Control

Alternative4 Importance of Experimental1 PFS

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

ε

1

1 - ε

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain 38

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative4 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

ε

1

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

ε

1

1-ε

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 166 + 13 + 131 0

AC + 162 + 19 + 116 0

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 56 + 30

AC + 69 + 40 39

Experimental1 vs Control Experimental2 vs Control

Alternative3 Importance of both OS BM+

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

05 05

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

05

05

40

Timepoint 1 Timepoint 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative3 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

05

0002

0

05

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

0002

0

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

0009

0

OS BM

+

OS In All

0002

0

05

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 104 00 + 105 00

All + 87 - 14 + 84 - 16

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 46 NA + 40 NA

All + 49 NA + 42 NA

05 05 05 05 05

41

05 05 05 05

B) Final OS Look Adjusting Significance Levels bull The time of the final OS analysis will be determined by the number of

BM+ OS events in the 3 arms or in the control arm

bull As a result the number of events for each OS comparison is a random variable by design

bull Consequence bull Time 1 Assume none of the OS endpoints was rejected bull Time 2 If the of events for a given comparison is more (less) than expected

=gt the information fraction and nominal significance level used at Time 1 was larger (smaller) than what we should have used

For the final look critical values for OS will be re-calculated using the number of exactly observed events at the final analysis

The effect of adjustment is limited (always lt 0004 and in general lt 0001)

42

Illustration Effect of Final Alpha Adjustment

43 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Legend Observed IF at Time 1 lt Q1 Q1 - Q2 Q2 - Q3 Q3 lt

Illustration Effect of Final Alpha Adjustment

44 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Page 30: Multiplicity Correction in a Group-Sequential Oncology ... · PDF file• Largest difference: for Exp1 in scenario#1 (OS BM+: 11%, OS All: 13%, PFS BM+: 5%) 25 30 35 40 45 50 55 60

Concluding Remarks MTP

bull Limitation of the results on statistical power bull OS and PFS were generated independently

bull Identifying the testing strategy for an actual clinical trial always requires the input of several functions and the power results represent only one aspect of these considerations

bull Using graphical approach may result in overpowered endpoints However this still may be desirable reasons bull Protocol assumptions are complicated =gt just one detail off can result in power drop

(even in the presence of meaningful treatment effect)

bull At an interim analysis probability of crossing the efficacy boundary increases =gt gives larger chance for earlier read-out

30

Thank you for your attention

31

References

bull Bretz F Maurer W Brannath W Posch M A graphical approach to sequentially rejective multiple test procedures Statistics in Medicine 2009 28 586-604

bull Burman C-F Sonesson C and Guilbaud O (2009) A recycling framework for the construction of Bonferroni-based multiple tests Statistics in Medicine 28 739ndash761

bull Bretz F Posch M Glimm E Klinglmueller F Maurer W Rohmeyer K Graphical approaches for multiple comparison procedures using weighted Bonferroni Simes or parametric tests Biometrical Journal 2011 53(6) 894-913

bull Ye Y Li A Liu L and Yao B (2013) A group sequential Holm procedure with multiple primary endpoints Statistics in Medicine 32 1112-1124

bull Maurer W and Bretz F (2013a) ldquoGraphical Multiple Test Procedures With Memoryrdquo Statistics in Medicine DOI 101002sim5711

bull Willi MAURER and Frank BRETZ Multiple Testing in Group Sequential Trials Using Graphical Approaches American Statistical Association Statistics in Biopharmaceutical Research November 2013 Vol5 No4

bull Xi D Tamhane AC Allocating recycled significance levels in group sequential procedures for multiple endpoints Biom J 2015 Jan57(1)90-107

bull Tamhane A C Mehta C R and Liu L (2010) Testing a primary and a secondary endpoint in a group sequential design Biometrics 66 1174ndash1184

32

Back-up

33

Basic Parameters of the Study Design Table 1 Basic Parameters of the Study Design

Overall Survival Progression-Free Survival

BM+ BM- All Rand BM+ BM- All Rand

randomized Total (per comparison)

asymp 480 (asymp 320) (c)

asymp 720 (asymp 480) (c)

1200 (800) (fixed)

As OS As OS As OS

Significance level (a) 001 0015

Delayed Effect (b) 4m 5m 1m 1m

Cure Rate in Experimental (b) 15 10 0 0

HR after Delay Effect (b) (d) 06 075 056 075

Overall HR (b) Time1 Time2 070 061 084 074 077 069 063 084 072

Median controlexperimental 8m 12m 9m 12m 9m 12m 4m 64m 4m 47m 4m 55m

events Total Time1 Time2 asymp 285 385 (fixed)

asymp 693 asymp 963 380 (fixed) asymp 989

events per comparison Time1 Time2

asymp 198 asymp 266 asymp 478 asymp 661 asymp 261 asymp 672

(a) Initially allocated endpoint-specific alpha used for sample size calculation (b) For each comparison with the two experimental arms (c) Based on an expected BM+ proportion of 40 (Total number randomized is fixed) (d) As assumed for non-cure patients

34

Literature Overview

Graphical approach Bonferroni-based (Bretz et al 2009)

Non-parametric extensions (Bretz et al 2011)

GSP extension special case (Ye et al 2013)

GSP general allows for selection of recycling stages (Xi Tamhane 2015)

GSP extension general (Maurer Bretz 2013)

Graphical approach Bonferroni-based (Burman 2009)

GSP = group-sequential procedure Dark blue boxes referenced by FDA guidance

Graphs with memory (Maurer Bretz 2013a)

-649

-648 -577 -459

35

Multiplicity History of Regulatory Guidances bull E9 Statistical Principles for Clinical Trials (1998 Section 56)

bull Pre-specification of MTP bull EMA Points to Consider on Multiplicity Issues in Clinical Trials (2002)

bull Strong control of FWER bull EMA Concept paper on the need for a guideline on multiplicity issues in

clinical trials (Draft 2012) =gt Nov2012 Workshop bull EMA Guideline on multiplicity issues in clinical trials bull FDA Guidance document on multiplicity issues in clinical trials bull PMDA no multiplicity guidance (ldquoMTPs do not necessarily match with

labellingrdquo)

36

DRAFT Released for public

consultation in 2017

Related document within BMS Reporting Observed Significance Levels (p-values) in Clinical Study Reports Regulatory Documents and Publications 2014 (Stephane Munier)

Multiplicity Latest Regulatory Guidancersquos (2017) EMA Guideline on multiplicity issues

in clinical trials FDA Guidance document on

multiplicity issues in clinical trials

Principles bull Family-wise strong control bull Pre-specification of multiplicity adjustment

Secondary Endpoints bull Secondary endpoints expressing supportive evidence needs no MTP

bull Secondary endpoints that may become base for additional claims need MTP

(Section 6)

Distinction between different types of secondary endpoints is not clear all require MTP (Section IIIA)

Scope Similar (not the same)

Subgroups Section 7 (also own guidance) Excluded (another guidance)

Estimation and descriptive statistics Section 10 Not included

Graphical approach Not mentioned Explicitly encouraged

37

Experimental1 vs Control Experimental2 vs Control

Alternative4 Importance of Experimental1 PFS

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

ε

1

1 - ε

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain 38

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative4 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

ε

1

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

ε

1

1-ε

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 166 + 13 + 131 0

AC + 162 + 19 + 116 0

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 56 + 30

AC + 69 + 40 39

Experimental1 vs Control Experimental2 vs Control

Alternative3 Importance of both OS BM+

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

05 05

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

05

05

40

Timepoint 1 Timepoint 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative3 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

05

0002

0

05

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

0002

0

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

0009

0

OS BM

+

OS In All

0002

0

05

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 104 00 + 105 00

All + 87 - 14 + 84 - 16

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 46 NA + 40 NA

All + 49 NA + 42 NA

05 05 05 05 05

41

05 05 05 05

B) Final OS Look Adjusting Significance Levels bull The time of the final OS analysis will be determined by the number of

BM+ OS events in the 3 arms or in the control arm

bull As a result the number of events for each OS comparison is a random variable by design

bull Consequence bull Time 1 Assume none of the OS endpoints was rejected bull Time 2 If the of events for a given comparison is more (less) than expected

=gt the information fraction and nominal significance level used at Time 1 was larger (smaller) than what we should have used

For the final look critical values for OS will be re-calculated using the number of exactly observed events at the final analysis

The effect of adjustment is limited (always lt 0004 and in general lt 0001)

42

Illustration Effect of Final Alpha Adjustment

43 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Legend Observed IF at Time 1 lt Q1 Q1 - Q2 Q2 - Q3 Q3 lt

Illustration Effect of Final Alpha Adjustment

44 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Page 31: Multiplicity Correction in a Group-Sequential Oncology ... · PDF file• Largest difference: for Exp1 in scenario#1 (OS BM+: 11%, OS All: 13%, PFS BM+: 5%) 25 30 35 40 45 50 55 60

Thank you for your attention

31

References

bull Bretz F Maurer W Brannath W Posch M A graphical approach to sequentially rejective multiple test procedures Statistics in Medicine 2009 28 586-604

bull Burman C-F Sonesson C and Guilbaud O (2009) A recycling framework for the construction of Bonferroni-based multiple tests Statistics in Medicine 28 739ndash761

bull Bretz F Posch M Glimm E Klinglmueller F Maurer W Rohmeyer K Graphical approaches for multiple comparison procedures using weighted Bonferroni Simes or parametric tests Biometrical Journal 2011 53(6) 894-913

bull Ye Y Li A Liu L and Yao B (2013) A group sequential Holm procedure with multiple primary endpoints Statistics in Medicine 32 1112-1124

bull Maurer W and Bretz F (2013a) ldquoGraphical Multiple Test Procedures With Memoryrdquo Statistics in Medicine DOI 101002sim5711

bull Willi MAURER and Frank BRETZ Multiple Testing in Group Sequential Trials Using Graphical Approaches American Statistical Association Statistics in Biopharmaceutical Research November 2013 Vol5 No4

bull Xi D Tamhane AC Allocating recycled significance levels in group sequential procedures for multiple endpoints Biom J 2015 Jan57(1)90-107

bull Tamhane A C Mehta C R and Liu L (2010) Testing a primary and a secondary endpoint in a group sequential design Biometrics 66 1174ndash1184

32

Back-up

33

Basic Parameters of the Study Design Table 1 Basic Parameters of the Study Design

Overall Survival Progression-Free Survival

BM+ BM- All Rand BM+ BM- All Rand

randomized Total (per comparison)

asymp 480 (asymp 320) (c)

asymp 720 (asymp 480) (c)

1200 (800) (fixed)

As OS As OS As OS

Significance level (a) 001 0015

Delayed Effect (b) 4m 5m 1m 1m

Cure Rate in Experimental (b) 15 10 0 0

HR after Delay Effect (b) (d) 06 075 056 075

Overall HR (b) Time1 Time2 070 061 084 074 077 069 063 084 072

Median controlexperimental 8m 12m 9m 12m 9m 12m 4m 64m 4m 47m 4m 55m

events Total Time1 Time2 asymp 285 385 (fixed)

asymp 693 asymp 963 380 (fixed) asymp 989

events per comparison Time1 Time2

asymp 198 asymp 266 asymp 478 asymp 661 asymp 261 asymp 672

(a) Initially allocated endpoint-specific alpha used for sample size calculation (b) For each comparison with the two experimental arms (c) Based on an expected BM+ proportion of 40 (Total number randomized is fixed) (d) As assumed for non-cure patients

34

Literature Overview

Graphical approach Bonferroni-based (Bretz et al 2009)

Non-parametric extensions (Bretz et al 2011)

GSP extension special case (Ye et al 2013)

GSP general allows for selection of recycling stages (Xi Tamhane 2015)

GSP extension general (Maurer Bretz 2013)

Graphical approach Bonferroni-based (Burman 2009)

GSP = group-sequential procedure Dark blue boxes referenced by FDA guidance

Graphs with memory (Maurer Bretz 2013a)

-649

-648 -577 -459

35

Multiplicity History of Regulatory Guidances bull E9 Statistical Principles for Clinical Trials (1998 Section 56)

bull Pre-specification of MTP bull EMA Points to Consider on Multiplicity Issues in Clinical Trials (2002)

bull Strong control of FWER bull EMA Concept paper on the need for a guideline on multiplicity issues in

clinical trials (Draft 2012) =gt Nov2012 Workshop bull EMA Guideline on multiplicity issues in clinical trials bull FDA Guidance document on multiplicity issues in clinical trials bull PMDA no multiplicity guidance (ldquoMTPs do not necessarily match with

labellingrdquo)

36

DRAFT Released for public

consultation in 2017

Related document within BMS Reporting Observed Significance Levels (p-values) in Clinical Study Reports Regulatory Documents and Publications 2014 (Stephane Munier)

Multiplicity Latest Regulatory Guidancersquos (2017) EMA Guideline on multiplicity issues

in clinical trials FDA Guidance document on

multiplicity issues in clinical trials

Principles bull Family-wise strong control bull Pre-specification of multiplicity adjustment

Secondary Endpoints bull Secondary endpoints expressing supportive evidence needs no MTP

bull Secondary endpoints that may become base for additional claims need MTP

(Section 6)

Distinction between different types of secondary endpoints is not clear all require MTP (Section IIIA)

Scope Similar (not the same)

Subgroups Section 7 (also own guidance) Excluded (another guidance)

Estimation and descriptive statistics Section 10 Not included

Graphical approach Not mentioned Explicitly encouraged

37

Experimental1 vs Control Experimental2 vs Control

Alternative4 Importance of Experimental1 PFS

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

ε

1

1 - ε

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain 38

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative4 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

ε

1

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

ε

1

1-ε

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 166 + 13 + 131 0

AC + 162 + 19 + 116 0

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 56 + 30

AC + 69 + 40 39

Experimental1 vs Control Experimental2 vs Control

Alternative3 Importance of both OS BM+

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

05 05

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

05

05

40

Timepoint 1 Timepoint 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative3 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

05

0002

0

05

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

0002

0

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

0009

0

OS BM

+

OS In All

0002

0

05

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 104 00 + 105 00

All + 87 - 14 + 84 - 16

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 46 NA + 40 NA

All + 49 NA + 42 NA

05 05 05 05 05

41

05 05 05 05

B) Final OS Look Adjusting Significance Levels bull The time of the final OS analysis will be determined by the number of

BM+ OS events in the 3 arms or in the control arm

bull As a result the number of events for each OS comparison is a random variable by design

bull Consequence bull Time 1 Assume none of the OS endpoints was rejected bull Time 2 If the of events for a given comparison is more (less) than expected

=gt the information fraction and nominal significance level used at Time 1 was larger (smaller) than what we should have used

For the final look critical values for OS will be re-calculated using the number of exactly observed events at the final analysis

The effect of adjustment is limited (always lt 0004 and in general lt 0001)

42

Illustration Effect of Final Alpha Adjustment

43 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Legend Observed IF at Time 1 lt Q1 Q1 - Q2 Q2 - Q3 Q3 lt

Illustration Effect of Final Alpha Adjustment

44 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Page 32: Multiplicity Correction in a Group-Sequential Oncology ... · PDF file• Largest difference: for Exp1 in scenario#1 (OS BM+: 11%, OS All: 13%, PFS BM+: 5%) 25 30 35 40 45 50 55 60

References

bull Bretz F Maurer W Brannath W Posch M A graphical approach to sequentially rejective multiple test procedures Statistics in Medicine 2009 28 586-604

bull Burman C-F Sonesson C and Guilbaud O (2009) A recycling framework for the construction of Bonferroni-based multiple tests Statistics in Medicine 28 739ndash761

bull Bretz F Posch M Glimm E Klinglmueller F Maurer W Rohmeyer K Graphical approaches for multiple comparison procedures using weighted Bonferroni Simes or parametric tests Biometrical Journal 2011 53(6) 894-913

bull Ye Y Li A Liu L and Yao B (2013) A group sequential Holm procedure with multiple primary endpoints Statistics in Medicine 32 1112-1124

bull Maurer W and Bretz F (2013a) ldquoGraphical Multiple Test Procedures With Memoryrdquo Statistics in Medicine DOI 101002sim5711

bull Willi MAURER and Frank BRETZ Multiple Testing in Group Sequential Trials Using Graphical Approaches American Statistical Association Statistics in Biopharmaceutical Research November 2013 Vol5 No4

bull Xi D Tamhane AC Allocating recycled significance levels in group sequential procedures for multiple endpoints Biom J 2015 Jan57(1)90-107

bull Tamhane A C Mehta C R and Liu L (2010) Testing a primary and a secondary endpoint in a group sequential design Biometrics 66 1174ndash1184

32

Back-up

33

Basic Parameters of the Study Design Table 1 Basic Parameters of the Study Design

Overall Survival Progression-Free Survival

BM+ BM- All Rand BM+ BM- All Rand

randomized Total (per comparison)

asymp 480 (asymp 320) (c)

asymp 720 (asymp 480) (c)

1200 (800) (fixed)

As OS As OS As OS

Significance level (a) 001 0015

Delayed Effect (b) 4m 5m 1m 1m

Cure Rate in Experimental (b) 15 10 0 0

HR after Delay Effect (b) (d) 06 075 056 075

Overall HR (b) Time1 Time2 070 061 084 074 077 069 063 084 072

Median controlexperimental 8m 12m 9m 12m 9m 12m 4m 64m 4m 47m 4m 55m

events Total Time1 Time2 asymp 285 385 (fixed)

asymp 693 asymp 963 380 (fixed) asymp 989

events per comparison Time1 Time2

asymp 198 asymp 266 asymp 478 asymp 661 asymp 261 asymp 672

(a) Initially allocated endpoint-specific alpha used for sample size calculation (b) For each comparison with the two experimental arms (c) Based on an expected BM+ proportion of 40 (Total number randomized is fixed) (d) As assumed for non-cure patients

34

Literature Overview

Graphical approach Bonferroni-based (Bretz et al 2009)

Non-parametric extensions (Bretz et al 2011)

GSP extension special case (Ye et al 2013)

GSP general allows for selection of recycling stages (Xi Tamhane 2015)

GSP extension general (Maurer Bretz 2013)

Graphical approach Bonferroni-based (Burman 2009)

GSP = group-sequential procedure Dark blue boxes referenced by FDA guidance

Graphs with memory (Maurer Bretz 2013a)

-649

-648 -577 -459

35

Multiplicity History of Regulatory Guidances bull E9 Statistical Principles for Clinical Trials (1998 Section 56)

bull Pre-specification of MTP bull EMA Points to Consider on Multiplicity Issues in Clinical Trials (2002)

bull Strong control of FWER bull EMA Concept paper on the need for a guideline on multiplicity issues in

clinical trials (Draft 2012) =gt Nov2012 Workshop bull EMA Guideline on multiplicity issues in clinical trials bull FDA Guidance document on multiplicity issues in clinical trials bull PMDA no multiplicity guidance (ldquoMTPs do not necessarily match with

labellingrdquo)

36

DRAFT Released for public

consultation in 2017

Related document within BMS Reporting Observed Significance Levels (p-values) in Clinical Study Reports Regulatory Documents and Publications 2014 (Stephane Munier)

Multiplicity Latest Regulatory Guidancersquos (2017) EMA Guideline on multiplicity issues

in clinical trials FDA Guidance document on

multiplicity issues in clinical trials

Principles bull Family-wise strong control bull Pre-specification of multiplicity adjustment

Secondary Endpoints bull Secondary endpoints expressing supportive evidence needs no MTP

bull Secondary endpoints that may become base for additional claims need MTP

(Section 6)

Distinction between different types of secondary endpoints is not clear all require MTP (Section IIIA)

Scope Similar (not the same)

Subgroups Section 7 (also own guidance) Excluded (another guidance)

Estimation and descriptive statistics Section 10 Not included

Graphical approach Not mentioned Explicitly encouraged

37

Experimental1 vs Control Experimental2 vs Control

Alternative4 Importance of Experimental1 PFS

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

ε

1

1 - ε

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain 38

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative4 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

ε

1

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

ε

1

1-ε

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 166 + 13 + 131 0

AC + 162 + 19 + 116 0

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 56 + 30

AC + 69 + 40 39

Experimental1 vs Control Experimental2 vs Control

Alternative3 Importance of both OS BM+

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

05 05

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

05

05

40

Timepoint 1 Timepoint 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative3 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

05

0002

0

05

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

0002

0

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

0009

0

OS BM

+

OS In All

0002

0

05

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 104 00 + 105 00

All + 87 - 14 + 84 - 16

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 46 NA + 40 NA

All + 49 NA + 42 NA

05 05 05 05 05

41

05 05 05 05

B) Final OS Look Adjusting Significance Levels bull The time of the final OS analysis will be determined by the number of

BM+ OS events in the 3 arms or in the control arm

bull As a result the number of events for each OS comparison is a random variable by design

bull Consequence bull Time 1 Assume none of the OS endpoints was rejected bull Time 2 If the of events for a given comparison is more (less) than expected

=gt the information fraction and nominal significance level used at Time 1 was larger (smaller) than what we should have used

For the final look critical values for OS will be re-calculated using the number of exactly observed events at the final analysis

The effect of adjustment is limited (always lt 0004 and in general lt 0001)

42

Illustration Effect of Final Alpha Adjustment

43 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Legend Observed IF at Time 1 lt Q1 Q1 - Q2 Q2 - Q3 Q3 lt

Illustration Effect of Final Alpha Adjustment

44 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Page 33: Multiplicity Correction in a Group-Sequential Oncology ... · PDF file• Largest difference: for Exp1 in scenario#1 (OS BM+: 11%, OS All: 13%, PFS BM+: 5%) 25 30 35 40 45 50 55 60

Back-up

33

Basic Parameters of the Study Design Table 1 Basic Parameters of the Study Design

Overall Survival Progression-Free Survival

BM+ BM- All Rand BM+ BM- All Rand

randomized Total (per comparison)

asymp 480 (asymp 320) (c)

asymp 720 (asymp 480) (c)

1200 (800) (fixed)

As OS As OS As OS

Significance level (a) 001 0015

Delayed Effect (b) 4m 5m 1m 1m

Cure Rate in Experimental (b) 15 10 0 0

HR after Delay Effect (b) (d) 06 075 056 075

Overall HR (b) Time1 Time2 070 061 084 074 077 069 063 084 072

Median controlexperimental 8m 12m 9m 12m 9m 12m 4m 64m 4m 47m 4m 55m

events Total Time1 Time2 asymp 285 385 (fixed)

asymp 693 asymp 963 380 (fixed) asymp 989

events per comparison Time1 Time2

asymp 198 asymp 266 asymp 478 asymp 661 asymp 261 asymp 672

(a) Initially allocated endpoint-specific alpha used for sample size calculation (b) For each comparison with the two experimental arms (c) Based on an expected BM+ proportion of 40 (Total number randomized is fixed) (d) As assumed for non-cure patients

34

Literature Overview

Graphical approach Bonferroni-based (Bretz et al 2009)

Non-parametric extensions (Bretz et al 2011)

GSP extension special case (Ye et al 2013)

GSP general allows for selection of recycling stages (Xi Tamhane 2015)

GSP extension general (Maurer Bretz 2013)

Graphical approach Bonferroni-based (Burman 2009)

GSP = group-sequential procedure Dark blue boxes referenced by FDA guidance

Graphs with memory (Maurer Bretz 2013a)

-649

-648 -577 -459

35

Multiplicity History of Regulatory Guidances bull E9 Statistical Principles for Clinical Trials (1998 Section 56)

bull Pre-specification of MTP bull EMA Points to Consider on Multiplicity Issues in Clinical Trials (2002)

bull Strong control of FWER bull EMA Concept paper on the need for a guideline on multiplicity issues in

clinical trials (Draft 2012) =gt Nov2012 Workshop bull EMA Guideline on multiplicity issues in clinical trials bull FDA Guidance document on multiplicity issues in clinical trials bull PMDA no multiplicity guidance (ldquoMTPs do not necessarily match with

labellingrdquo)

36

DRAFT Released for public

consultation in 2017

Related document within BMS Reporting Observed Significance Levels (p-values) in Clinical Study Reports Regulatory Documents and Publications 2014 (Stephane Munier)

Multiplicity Latest Regulatory Guidancersquos (2017) EMA Guideline on multiplicity issues

in clinical trials FDA Guidance document on

multiplicity issues in clinical trials

Principles bull Family-wise strong control bull Pre-specification of multiplicity adjustment

Secondary Endpoints bull Secondary endpoints expressing supportive evidence needs no MTP

bull Secondary endpoints that may become base for additional claims need MTP

(Section 6)

Distinction between different types of secondary endpoints is not clear all require MTP (Section IIIA)

Scope Similar (not the same)

Subgroups Section 7 (also own guidance) Excluded (another guidance)

Estimation and descriptive statistics Section 10 Not included

Graphical approach Not mentioned Explicitly encouraged

37

Experimental1 vs Control Experimental2 vs Control

Alternative4 Importance of Experimental1 PFS

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

ε

1

1 - ε

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain 38

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative4 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

ε

1

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

ε

1

1-ε

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 166 + 13 + 131 0

AC + 162 + 19 + 116 0

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 56 + 30

AC + 69 + 40 39

Experimental1 vs Control Experimental2 vs Control

Alternative3 Importance of both OS BM+

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

05 05

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

05

05

40

Timepoint 1 Timepoint 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative3 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

05

0002

0

05

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

0002

0

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

0009

0

OS BM

+

OS In All

0002

0

05

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 104 00 + 105 00

All + 87 - 14 + 84 - 16

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 46 NA + 40 NA

All + 49 NA + 42 NA

05 05 05 05 05

41

05 05 05 05

B) Final OS Look Adjusting Significance Levels bull The time of the final OS analysis will be determined by the number of

BM+ OS events in the 3 arms or in the control arm

bull As a result the number of events for each OS comparison is a random variable by design

bull Consequence bull Time 1 Assume none of the OS endpoints was rejected bull Time 2 If the of events for a given comparison is more (less) than expected

=gt the information fraction and nominal significance level used at Time 1 was larger (smaller) than what we should have used

For the final look critical values for OS will be re-calculated using the number of exactly observed events at the final analysis

The effect of adjustment is limited (always lt 0004 and in general lt 0001)

42

Illustration Effect of Final Alpha Adjustment

43 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Legend Observed IF at Time 1 lt Q1 Q1 - Q2 Q2 - Q3 Q3 lt

Illustration Effect of Final Alpha Adjustment

44 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Page 34: Multiplicity Correction in a Group-Sequential Oncology ... · PDF file• Largest difference: for Exp1 in scenario#1 (OS BM+: 11%, OS All: 13%, PFS BM+: 5%) 25 30 35 40 45 50 55 60

Basic Parameters of the Study Design Table 1 Basic Parameters of the Study Design

Overall Survival Progression-Free Survival

BM+ BM- All Rand BM+ BM- All Rand

randomized Total (per comparison)

asymp 480 (asymp 320) (c)

asymp 720 (asymp 480) (c)

1200 (800) (fixed)

As OS As OS As OS

Significance level (a) 001 0015

Delayed Effect (b) 4m 5m 1m 1m

Cure Rate in Experimental (b) 15 10 0 0

HR after Delay Effect (b) (d) 06 075 056 075

Overall HR (b) Time1 Time2 070 061 084 074 077 069 063 084 072

Median controlexperimental 8m 12m 9m 12m 9m 12m 4m 64m 4m 47m 4m 55m

events Total Time1 Time2 asymp 285 385 (fixed)

asymp 693 asymp 963 380 (fixed) asymp 989

events per comparison Time1 Time2

asymp 198 asymp 266 asymp 478 asymp 661 asymp 261 asymp 672

(a) Initially allocated endpoint-specific alpha used for sample size calculation (b) For each comparison with the two experimental arms (c) Based on an expected BM+ proportion of 40 (Total number randomized is fixed) (d) As assumed for non-cure patients

34

Literature Overview

Graphical approach Bonferroni-based (Bretz et al 2009)

Non-parametric extensions (Bretz et al 2011)

GSP extension special case (Ye et al 2013)

GSP general allows for selection of recycling stages (Xi Tamhane 2015)

GSP extension general (Maurer Bretz 2013)

Graphical approach Bonferroni-based (Burman 2009)

GSP = group-sequential procedure Dark blue boxes referenced by FDA guidance

Graphs with memory (Maurer Bretz 2013a)

-649

-648 -577 -459

35

Multiplicity History of Regulatory Guidances bull E9 Statistical Principles for Clinical Trials (1998 Section 56)

bull Pre-specification of MTP bull EMA Points to Consider on Multiplicity Issues in Clinical Trials (2002)

bull Strong control of FWER bull EMA Concept paper on the need for a guideline on multiplicity issues in

clinical trials (Draft 2012) =gt Nov2012 Workshop bull EMA Guideline on multiplicity issues in clinical trials bull FDA Guidance document on multiplicity issues in clinical trials bull PMDA no multiplicity guidance (ldquoMTPs do not necessarily match with

labellingrdquo)

36

DRAFT Released for public

consultation in 2017

Related document within BMS Reporting Observed Significance Levels (p-values) in Clinical Study Reports Regulatory Documents and Publications 2014 (Stephane Munier)

Multiplicity Latest Regulatory Guidancersquos (2017) EMA Guideline on multiplicity issues

in clinical trials FDA Guidance document on

multiplicity issues in clinical trials

Principles bull Family-wise strong control bull Pre-specification of multiplicity adjustment

Secondary Endpoints bull Secondary endpoints expressing supportive evidence needs no MTP

bull Secondary endpoints that may become base for additional claims need MTP

(Section 6)

Distinction between different types of secondary endpoints is not clear all require MTP (Section IIIA)

Scope Similar (not the same)

Subgroups Section 7 (also own guidance) Excluded (another guidance)

Estimation and descriptive statistics Section 10 Not included

Graphical approach Not mentioned Explicitly encouraged

37

Experimental1 vs Control Experimental2 vs Control

Alternative4 Importance of Experimental1 PFS

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

ε

1

1 - ε

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain 38

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative4 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

ε

1

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

ε

1

1-ε

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 166 + 13 + 131 0

AC + 162 + 19 + 116 0

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 56 + 30

AC + 69 + 40 39

Experimental1 vs Control Experimental2 vs Control

Alternative3 Importance of both OS BM+

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

05 05

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

05

05

40

Timepoint 1 Timepoint 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative3 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

05

0002

0

05

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

0002

0

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

0009

0

OS BM

+

OS In All

0002

0

05

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 104 00 + 105 00

All + 87 - 14 + 84 - 16

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 46 NA + 40 NA

All + 49 NA + 42 NA

05 05 05 05 05

41

05 05 05 05

B) Final OS Look Adjusting Significance Levels bull The time of the final OS analysis will be determined by the number of

BM+ OS events in the 3 arms or in the control arm

bull As a result the number of events for each OS comparison is a random variable by design

bull Consequence bull Time 1 Assume none of the OS endpoints was rejected bull Time 2 If the of events for a given comparison is more (less) than expected

=gt the information fraction and nominal significance level used at Time 1 was larger (smaller) than what we should have used

For the final look critical values for OS will be re-calculated using the number of exactly observed events at the final analysis

The effect of adjustment is limited (always lt 0004 and in general lt 0001)

42

Illustration Effect of Final Alpha Adjustment

43 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Legend Observed IF at Time 1 lt Q1 Q1 - Q2 Q2 - Q3 Q3 lt

Illustration Effect of Final Alpha Adjustment

44 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Page 35: Multiplicity Correction in a Group-Sequential Oncology ... · PDF file• Largest difference: for Exp1 in scenario#1 (OS BM+: 11%, OS All: 13%, PFS BM+: 5%) 25 30 35 40 45 50 55 60

Literature Overview

Graphical approach Bonferroni-based (Bretz et al 2009)

Non-parametric extensions (Bretz et al 2011)

GSP extension special case (Ye et al 2013)

GSP general allows for selection of recycling stages (Xi Tamhane 2015)

GSP extension general (Maurer Bretz 2013)

Graphical approach Bonferroni-based (Burman 2009)

GSP = group-sequential procedure Dark blue boxes referenced by FDA guidance

Graphs with memory (Maurer Bretz 2013a)

-649

-648 -577 -459

35

Multiplicity History of Regulatory Guidances bull E9 Statistical Principles for Clinical Trials (1998 Section 56)

bull Pre-specification of MTP bull EMA Points to Consider on Multiplicity Issues in Clinical Trials (2002)

bull Strong control of FWER bull EMA Concept paper on the need for a guideline on multiplicity issues in

clinical trials (Draft 2012) =gt Nov2012 Workshop bull EMA Guideline on multiplicity issues in clinical trials bull FDA Guidance document on multiplicity issues in clinical trials bull PMDA no multiplicity guidance (ldquoMTPs do not necessarily match with

labellingrdquo)

36

DRAFT Released for public

consultation in 2017

Related document within BMS Reporting Observed Significance Levels (p-values) in Clinical Study Reports Regulatory Documents and Publications 2014 (Stephane Munier)

Multiplicity Latest Regulatory Guidancersquos (2017) EMA Guideline on multiplicity issues

in clinical trials FDA Guidance document on

multiplicity issues in clinical trials

Principles bull Family-wise strong control bull Pre-specification of multiplicity adjustment

Secondary Endpoints bull Secondary endpoints expressing supportive evidence needs no MTP

bull Secondary endpoints that may become base for additional claims need MTP

(Section 6)

Distinction between different types of secondary endpoints is not clear all require MTP (Section IIIA)

Scope Similar (not the same)

Subgroups Section 7 (also own guidance) Excluded (another guidance)

Estimation and descriptive statistics Section 10 Not included

Graphical approach Not mentioned Explicitly encouraged

37

Experimental1 vs Control Experimental2 vs Control

Alternative4 Importance of Experimental1 PFS

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

ε

1

1 - ε

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain 38

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative4 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

ε

1

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

ε

1

1-ε

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 166 + 13 + 131 0

AC + 162 + 19 + 116 0

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 56 + 30

AC + 69 + 40 39

Experimental1 vs Control Experimental2 vs Control

Alternative3 Importance of both OS BM+

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

05 05

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

05

05

40

Timepoint 1 Timepoint 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative3 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

05

0002

0

05

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

0002

0

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

0009

0

OS BM

+

OS In All

0002

0

05

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 104 00 + 105 00

All + 87 - 14 + 84 - 16

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 46 NA + 40 NA

All + 49 NA + 42 NA

05 05 05 05 05

41

05 05 05 05

B) Final OS Look Adjusting Significance Levels bull The time of the final OS analysis will be determined by the number of

BM+ OS events in the 3 arms or in the control arm

bull As a result the number of events for each OS comparison is a random variable by design

bull Consequence bull Time 1 Assume none of the OS endpoints was rejected bull Time 2 If the of events for a given comparison is more (less) than expected

=gt the information fraction and nominal significance level used at Time 1 was larger (smaller) than what we should have used

For the final look critical values for OS will be re-calculated using the number of exactly observed events at the final analysis

The effect of adjustment is limited (always lt 0004 and in general lt 0001)

42

Illustration Effect of Final Alpha Adjustment

43 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Legend Observed IF at Time 1 lt Q1 Q1 - Q2 Q2 - Q3 Q3 lt

Illustration Effect of Final Alpha Adjustment

44 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Page 36: Multiplicity Correction in a Group-Sequential Oncology ... · PDF file• Largest difference: for Exp1 in scenario#1 (OS BM+: 11%, OS All: 13%, PFS BM+: 5%) 25 30 35 40 45 50 55 60

Multiplicity History of Regulatory Guidances bull E9 Statistical Principles for Clinical Trials (1998 Section 56)

bull Pre-specification of MTP bull EMA Points to Consider on Multiplicity Issues in Clinical Trials (2002)

bull Strong control of FWER bull EMA Concept paper on the need for a guideline on multiplicity issues in

clinical trials (Draft 2012) =gt Nov2012 Workshop bull EMA Guideline on multiplicity issues in clinical trials bull FDA Guidance document on multiplicity issues in clinical trials bull PMDA no multiplicity guidance (ldquoMTPs do not necessarily match with

labellingrdquo)

36

DRAFT Released for public

consultation in 2017

Related document within BMS Reporting Observed Significance Levels (p-values) in Clinical Study Reports Regulatory Documents and Publications 2014 (Stephane Munier)

Multiplicity Latest Regulatory Guidancersquos (2017) EMA Guideline on multiplicity issues

in clinical trials FDA Guidance document on

multiplicity issues in clinical trials

Principles bull Family-wise strong control bull Pre-specification of multiplicity adjustment

Secondary Endpoints bull Secondary endpoints expressing supportive evidence needs no MTP

bull Secondary endpoints that may become base for additional claims need MTP

(Section 6)

Distinction between different types of secondary endpoints is not clear all require MTP (Section IIIA)

Scope Similar (not the same)

Subgroups Section 7 (also own guidance) Excluded (another guidance)

Estimation and descriptive statistics Section 10 Not included

Graphical approach Not mentioned Explicitly encouraged

37

Experimental1 vs Control Experimental2 vs Control

Alternative4 Importance of Experimental1 PFS

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

ε

1

1 - ε

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain 38

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative4 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

ε

1

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

ε

1

1-ε

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 166 + 13 + 131 0

AC + 162 + 19 + 116 0

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 56 + 30

AC + 69 + 40 39

Experimental1 vs Control Experimental2 vs Control

Alternative3 Importance of both OS BM+

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

05 05

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

05

05

40

Timepoint 1 Timepoint 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative3 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

05

0002

0

05

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

0002

0

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

0009

0

OS BM

+

OS In All

0002

0

05

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 104 00 + 105 00

All + 87 - 14 + 84 - 16

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 46 NA + 40 NA

All + 49 NA + 42 NA

05 05 05 05 05

41

05 05 05 05

B) Final OS Look Adjusting Significance Levels bull The time of the final OS analysis will be determined by the number of

BM+ OS events in the 3 arms or in the control arm

bull As a result the number of events for each OS comparison is a random variable by design

bull Consequence bull Time 1 Assume none of the OS endpoints was rejected bull Time 2 If the of events for a given comparison is more (less) than expected

=gt the information fraction and nominal significance level used at Time 1 was larger (smaller) than what we should have used

For the final look critical values for OS will be re-calculated using the number of exactly observed events at the final analysis

The effect of adjustment is limited (always lt 0004 and in general lt 0001)

42

Illustration Effect of Final Alpha Adjustment

43 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Legend Observed IF at Time 1 lt Q1 Q1 - Q2 Q2 - Q3 Q3 lt

Illustration Effect of Final Alpha Adjustment

44 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Page 37: Multiplicity Correction in a Group-Sequential Oncology ... · PDF file• Largest difference: for Exp1 in scenario#1 (OS BM+: 11%, OS All: 13%, PFS BM+: 5%) 25 30 35 40 45 50 55 60

Multiplicity Latest Regulatory Guidancersquos (2017) EMA Guideline on multiplicity issues

in clinical trials FDA Guidance document on

multiplicity issues in clinical trials

Principles bull Family-wise strong control bull Pre-specification of multiplicity adjustment

Secondary Endpoints bull Secondary endpoints expressing supportive evidence needs no MTP

bull Secondary endpoints that may become base for additional claims need MTP

(Section 6)

Distinction between different types of secondary endpoints is not clear all require MTP (Section IIIA)

Scope Similar (not the same)

Subgroups Section 7 (also own guidance) Excluded (another guidance)

Estimation and descriptive statistics Section 10 Not included

Graphical approach Not mentioned Explicitly encouraged

37

Experimental1 vs Control Experimental2 vs Control

Alternative4 Importance of Experimental1 PFS

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

ε

1

1 - ε

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain 38

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative4 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

ε

1

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

ε

1

1-ε

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 166 + 13 + 131 0

AC + 162 + 19 + 116 0

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 56 + 30

AC + 69 + 40 39

Experimental1 vs Control Experimental2 vs Control

Alternative3 Importance of both OS BM+

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

05 05

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

05

05

40

Timepoint 1 Timepoint 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative3 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

05

0002

0

05

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

0002

0

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

0009

0

OS BM

+

OS In All

0002

0

05

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 104 00 + 105 00

All + 87 - 14 + 84 - 16

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 46 NA + 40 NA

All + 49 NA + 42 NA

05 05 05 05 05

41

05 05 05 05

B) Final OS Look Adjusting Significance Levels bull The time of the final OS analysis will be determined by the number of

BM+ OS events in the 3 arms or in the control arm

bull As a result the number of events for each OS comparison is a random variable by design

bull Consequence bull Time 1 Assume none of the OS endpoints was rejected bull Time 2 If the of events for a given comparison is more (less) than expected

=gt the information fraction and nominal significance level used at Time 1 was larger (smaller) than what we should have used

For the final look critical values for OS will be re-calculated using the number of exactly observed events at the final analysis

The effect of adjustment is limited (always lt 0004 and in general lt 0001)

42

Illustration Effect of Final Alpha Adjustment

43 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Legend Observed IF at Time 1 lt Q1 Q1 - Q2 Q2 - Q3 Q3 lt

Illustration Effect of Final Alpha Adjustment

44 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Page 38: Multiplicity Correction in a Group-Sequential Oncology ... · PDF file• Largest difference: for Exp1 in scenario#1 (OS BM+: 11%, OS All: 13%, PFS BM+: 5%) 25 30 35 40 45 50 55 60

Experimental1 vs Control Experimental2 vs Control

Alternative4 Importance of Experimental1 PFS

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

0015

0

1

PFS BM+

PFS In All

OS BM+

OS In All

1

001

0

1

ε

1

1 - ε

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain 38

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative4 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

ε

1

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

ε

1

1-ε

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 166 + 13 + 131 0

AC + 162 + 19 + 116 0

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 56 + 30

AC + 69 + 40 39

Experimental1 vs Control Experimental2 vs Control

Alternative3 Importance of both OS BM+

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

05 05

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

05

05

40

Timepoint 1 Timepoint 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative3 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

05

0002

0

05

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

0002

0

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

0009

0

OS BM

+

OS In All

0002

0

05

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 104 00 + 105 00

All + 87 - 14 + 84 - 16

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 46 NA + 40 NA

All + 49 NA + 42 NA

05 05 05 05 05

41

05 05 05 05

B) Final OS Look Adjusting Significance Levels bull The time of the final OS analysis will be determined by the number of

BM+ OS events in the 3 arms or in the control arm

bull As a result the number of events for each OS comparison is a random variable by design

bull Consequence bull Time 1 Assume none of the OS endpoints was rejected bull Time 2 If the of events for a given comparison is more (less) than expected

=gt the information fraction and nominal significance level used at Time 1 was larger (smaller) than what we should have used

For the final look critical values for OS will be re-calculated using the number of exactly observed events at the final analysis

The effect of adjustment is limited (always lt 0004 and in general lt 0001)

42

Illustration Effect of Final Alpha Adjustment

43 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Legend Observed IF at Time 1 lt Q1 Q1 - Q2 Q2 - Q3 Q3 lt

Illustration Effect of Final Alpha Adjustment

44 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Page 39: Multiplicity Correction in a Group-Sequential Oncology ... · PDF file• Largest difference: for Exp1 in scenario#1 (OS BM+: 11%, OS All: 13%, PFS BM+: 5%) 25 30 35 40 45 50 55 60

Time 1 Time 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative4 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

0015

0

1

PFS BM

+

PFS In All

OS BM

+

OS In All

1

0002

0

1

ε

1

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

1

0009

0

OS BM

+

OS In All

1

0002

0

ε

1

1-ε

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 166 + 13 + 131 0

AC + 162 + 19 + 116 0

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 56 + 30

AC + 69 + 40 39

Experimental1 vs Control Experimental2 vs Control

Alternative3 Importance of both OS BM+

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

05 05

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

05

05

40

Timepoint 1 Timepoint 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative3 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

05

0002

0

05

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

0002

0

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

0009

0

OS BM

+

OS In All

0002

0

05

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 104 00 + 105 00

All + 87 - 14 + 84 - 16

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 46 NA + 40 NA

All + 49 NA + 42 NA

05 05 05 05 05

41

05 05 05 05

B) Final OS Look Adjusting Significance Levels bull The time of the final OS analysis will be determined by the number of

BM+ OS events in the 3 arms or in the control arm

bull As a result the number of events for each OS comparison is a random variable by design

bull Consequence bull Time 1 Assume none of the OS endpoints was rejected bull Time 2 If the of events for a given comparison is more (less) than expected

=gt the information fraction and nominal significance level used at Time 1 was larger (smaller) than what we should have used

For the final look critical values for OS will be re-calculated using the number of exactly observed events at the final analysis

The effect of adjustment is limited (always lt 0004 and in general lt 0001)

42

Illustration Effect of Final Alpha Adjustment

43 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Legend Observed IF at Time 1 lt Q1 Q1 - Q2 Q2 - Q3 Q3 lt

Illustration Effect of Final Alpha Adjustment

44 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Page 40: Multiplicity Correction in a Group-Sequential Oncology ... · PDF file• Largest difference: for Exp1 in scenario#1 (OS BM+: 11%, OS All: 13%, PFS BM+: 5%) 25 30 35 40 45 50 55 60

Experimental1 vs Control Experimental2 vs Control

Alternative3 Importance of both OS BM+

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

0015

0

05

PFS BM+

PFS In All

OS BM+

OS In All

05

001

0

05 05

Along the green edges alpha can be passed only at IA Along the blue edge recycled alpha can be used only at interim Along the dotted edges alpha is passed only if for the given vertex

only dotted outgoing edges remain

05

05

40

Timepoint 1 Timepoint 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative3 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

05

0002

0

05

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

0002

0

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

0009

0

OS BM

+

OS In All

0002

0

05

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 104 00 + 105 00

All + 87 - 14 + 84 - 16

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 46 NA + 40 NA

All + 49 NA + 42 NA

05 05 05 05 05

41

05 05 05 05

B) Final OS Look Adjusting Significance Levels bull The time of the final OS analysis will be determined by the number of

BM+ OS events in the 3 arms or in the control arm

bull As a result the number of events for each OS comparison is a random variable by design

bull Consequence bull Time 1 Assume none of the OS endpoints was rejected bull Time 2 If the of events for a given comparison is more (less) than expected

=gt the information fraction and nominal significance level used at Time 1 was larger (smaller) than what we should have used

For the final look critical values for OS will be re-calculated using the number of exactly observed events at the final analysis

The effect of adjustment is limited (always lt 0004 and in general lt 0001)

42

Illustration Effect of Final Alpha Adjustment

43 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Legend Observed IF at Time 1 lt Q1 Q1 - Q2 Q2 - Q3 Q3 lt

Illustration Effect of Final Alpha Adjustment

44 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Page 41: Multiplicity Correction in a Group-Sequential Oncology ... · PDF file• Largest difference: for Exp1 in scenario#1 (OS BM+: 11%, OS All: 13%, PFS BM+: 5%) 25 30 35 40 45 50 55 60

Timepoint 1 Timepoint 2

Experimental1 vs Control Experimental2 vs Control Experimental1 vs Control Experimental2 vs Control

Alternative3 Change in Power vs Base Case (under Protocol Assumptions)

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

05

0002

0

05

0015

0

PFS BM

+

PFS In All

OS BM

+

OS In All

0002

0

For OS information fraction assumed BM+ 70 AC 75

+20

+15

+10

+5

0

-5

-10

-15

-20

OS BM

+

OS In All

0009

0

OS BM

+

OS In All

0002

0

05

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 104 00 + 105 00

All + 87 - 14 + 84 - 16

Exp1 vs Control Exp2 vs Control

OS PFS OS PFS

BM+ + 46 NA + 40 NA

All + 49 NA + 42 NA

05 05 05 05 05

41

05 05 05 05

B) Final OS Look Adjusting Significance Levels bull The time of the final OS analysis will be determined by the number of

BM+ OS events in the 3 arms or in the control arm

bull As a result the number of events for each OS comparison is a random variable by design

bull Consequence bull Time 1 Assume none of the OS endpoints was rejected bull Time 2 If the of events for a given comparison is more (less) than expected

=gt the information fraction and nominal significance level used at Time 1 was larger (smaller) than what we should have used

For the final look critical values for OS will be re-calculated using the number of exactly observed events at the final analysis

The effect of adjustment is limited (always lt 0004 and in general lt 0001)

42

Illustration Effect of Final Alpha Adjustment

43 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Legend Observed IF at Time 1 lt Q1 Q1 - Q2 Q2 - Q3 Q3 lt

Illustration Effect of Final Alpha Adjustment

44 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Page 42: Multiplicity Correction in a Group-Sequential Oncology ... · PDF file• Largest difference: for Exp1 in scenario#1 (OS BM+: 11%, OS All: 13%, PFS BM+: 5%) 25 30 35 40 45 50 55 60

B) Final OS Look Adjusting Significance Levels bull The time of the final OS analysis will be determined by the number of

BM+ OS events in the 3 arms or in the control arm

bull As a result the number of events for each OS comparison is a random variable by design

bull Consequence bull Time 1 Assume none of the OS endpoints was rejected bull Time 2 If the of events for a given comparison is more (less) than expected

=gt the information fraction and nominal significance level used at Time 1 was larger (smaller) than what we should have used

For the final look critical values for OS will be re-calculated using the number of exactly observed events at the final analysis

The effect of adjustment is limited (always lt 0004 and in general lt 0001)

42

Illustration Effect of Final Alpha Adjustment

43 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Legend Observed IF at Time 1 lt Q1 Q1 - Q2 Q2 - Q3 Q3 lt

Illustration Effect of Final Alpha Adjustment

44 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Page 43: Multiplicity Correction in a Group-Sequential Oncology ... · PDF file• Largest difference: for Exp1 in scenario#1 (OS BM+: 11%, OS All: 13%, PFS BM+: 5%) 25 30 35 40 45 50 55 60

Illustration Effect of Final Alpha Adjustment

43 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Legend Observed IF at Time 1 lt Q1 Q1 - Q2 Q2 - Q3 Q3 lt

Illustration Effect of Final Alpha Adjustment

44 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005

Page 44: Multiplicity Correction in a Group-Sequential Oncology ... · PDF file• Largest difference: for Exp1 in scenario#1 (OS BM+: 11%, OS All: 13%, PFS BM+: 5%) 25 30 35 40 45 50 55 60

Illustration Effect of Final Alpha Adjustment

44 The first 1000 samples generated under protocol assumptions number of events fixed in control arm assumed a local significance level of 005