The Regression discontinuity design: continuous and binary ...

32
The Regression discontinuity design: continuous and binary outcomes Sara Geneletti London School of Economics Department of Statistics Joint work with Gianluca Baio, Aidan O’Keeffe & Federico Ricciardi (UCL), Sylvia Richardson (MRC-BSU), Linda Sharples (LSHTM) & other collaborators — funded by UK MRC-MRP grant MR/K014838/1 Bayes Pharma 2019 RDD 1 / 28

Transcript of The Regression discontinuity design: continuous and binary ...

Page 1: The Regression discontinuity design: continuous and binary ...

The Regression discontinuity design: continuousand binary outcomes

Sara GenelettiLondon School of EconomicsDepartment of Statistics

Joint work with Gianluca Baio, Aidan O’Keeffe & Federico Ricciardi (UCL), Sylvia Richardson

(MRC-BSU), Linda Sharples (LSHTM) & other collaborators — funded by UK MRC-MRP grant

MR/K014838/1

Bayes Pharma 2019

RDD 1 / 28

Page 2: The Regression discontinuity design: continuous and binary ...

Outline

1. Basic set-up and key concept

2. Sharp and Fuzzy designs

3. Continuous outcomes: LDL cholesterol levelsI DataI EstimatorsI Prior constraintsI Results

4. Binary outcomes: attaining recommended levelsI EstimatorsI Prior constraints

5. The future of RDDs

RDD 2 / 28

Page 3: The Regression discontinuity design: continuous and binary ...

The Regression Discontinuity Design – what is it?

I The Regression Discontinuity Design (RDD) was first introduced inthe econometrics literature during the 1960s[5].

I The original idea was to exploit policy thresholds to estimate thecausal effect of an educational intervention.

I The RDD has proven to be very useful when treatment is assignedbased on a pre-specified rule linked to a continuous variable . Forexample:

- Antiretroviral HIV drugs might be prescribed when a patients CD4count is less than 200 cells/mm3 [1];

- Statins might be prescribed when a patient’s 10-year risk of acardiovascular event (10-year CVD risk score) exceeds a certainthreshold (e.g. in the UK previously 20% and now 10% ) [2]

RDD 3 / 28

Page 4: The Regression discontinuity design: continuous and binary ...

RDD – key idea

I The key idea is that the threshold acts like a randomizing device

I For those who are familiar with this: it is an instrumental variable .

I This is possible if we consider the units close to the threshold ascoming from the same population in which the assignment variablehas its own natural (random) variability ⇒ (conditional)exchangeability

RDD 4 / 28

Page 5: The Regression discontinuity design: continuous and binary ...

RDD

Education example

I We want to quantify the effect of going to college on future incomeI Comparing the income of individuals who attended college and

those who did not will not provide us with the effect of collegeattendance aloneI Confounders such as social class, ability, motivation etc will make

this difficult

I That is a classic problem of observational studies

RDD 5 / 28

Page 6: The Regression discontinuity design: continuous and binary ...

Education Example continued

I Often college scholarships are given on the basis of grades obtainedin final school examinations, eg if the average exam grade is above75%, the student gets a scholarship

I Suppose one student has an average of 74% and another an averageof 76%:I Can we really consider them as coming from different populations

especially if in other respects (eg family income, post code etc) theyare similar?

I Given that there is natural variability in exam performance even forthe same individual?

RDD 6 / 28

Page 7: The Regression discontinuity design: continuous and binary ...

RDD

Public health example

I Many medicines are prescribed according to a particular guidelineI Antiretroviral HIV drugs prescribed when patient’s CD4 count is less

than 200 cells/mm3[1];I Blood pressure medication is prescribed when patient’s BP is

140/90mmHg or above;I Statins are prescribed when eg 10 year Framingham risk score is over

20%

I Consider a population of HIV patients and suppose patient A has aCD4 count of 195 and patient B has a count of 205 cells/mm3

I Theoretically , patient A gets the drugs while patient B does notI Can we really consider them as coming from different populations?

I If the two are the same in every other relevant respect (eg individualcircumstances etc)

I Given that there is a natural variability in CD4 counts and in theinstruments used to measure them?

RDD 7 / 28

Page 8: The Regression discontinuity design: continuous and binary ...

Application: prescription of statins in primary care UK

Statins

I A class of drugs used to lower cholesterol and prescribed to preventheart diseaseI Trials show an average reduction of LDL cholesterol of ≈ 2 mmol/lI UK NHS guidelines are to prescribe statins to individuals without

previous CVD if their 10 year CVD score exceeds 20%(10%)

I Data: Clinical practice database containing routine GP prescriptions as well asinformation on the variables that determine them (THIN)

I 587 general practices in the UK, covering 5.2% of the (2013) UK population —over 10 million individuals living in the UK and fairly representative of thegeneral population

I Individual characteristics (sex, date of birth, date of registration, proxies ofsocioeconomic status)

I Medical history (GP visits, prescriptions, exams)

I Relevant clinical outcomes (LDL level, CHD events, deaths)

RDD 8 / 28

Page 9: The Regression discontinuity design: continuous and binary ...

Notation (in our application)

I X = continuous forcing/assignment variable (risk score);

I Z = threshold indicator (is risk score above/below 20%);

I T = treatment administered (statin prescription);

I C ≡ (O,U) = observed and unobserved covariates (social class,co-morbidities);

I Y = continuous outcome (LDL cholesterol level).

X and Z

I X is the continuous variable and X = x0 at the threshold

I Z = 1 if X ≥ x0 and Z = 0 if X < x0

RDD 9 / 28

Page 10: The Regression discontinuity design: continuous and binary ...

Sharp vs Fuzzy RDDs

0.0 0.1 0.2 0.3 0.4 0.5 0.6

01

23

45

67

Sharp design:Risk Score vs. LDL

10−year CVD Risk Score

LDL

chol

este

rol (

mm

ol/l)

TreatedUnreated

0.0 0.1 0.2 0.3 0.4 0.5 0.6

0.0

0.2

0.4

0.6

0.8

1.0

Sharp design:Risk Score vs. p(T=1)

10−year CVD Risk Score

p(T

=1)

TreatedUnreated

0.0 0.1 0.2 0.3 0.4 0.5 0.6

01

23

45

67

Risk Score vs. LDL Chol Level (mmol/l)

10−year CVD Risk Score

LDL

Cho

l Lev

el (

mm

ol/l)

TreatedUnreated

0.0 0.1 0.2 0.3 0.4 0.5 0.6

0.0

0.2

0.4

0.6

0.8

1.0

Real data:Risk Score vs. p(T=1)

10−year CVD Risk Score

p(T

=1)

TreatedUnreated

RDD 10 / 28

Page 11: The Regression discontinuity design: continuous and binary ...

RDD – Assumptions

1 Unconfoundedness: Y⊥⊥Z | (T,C, X) guarantees that the unitsjust above and below the threshold are “similar”.

2 Independence of Guidelines: Z⊥⊥C | X the threshold is set by anexternal body, e.g. a government agency.

3 Monotonicity : No decision-maker systematically defies theguidelines - i.e. GPs don’t only prescibe to those below thethreshold(!)

4 Continuity : E(Y |Z,X = x, T,C) is continuous at in x (at x0) forT = 0, 1If the outcome is discontinuous then the effect of thresholdindicator will be confounded with the effect of whatever isresponsible for the discontinuity

1-3 are instrumental variable assumptions

RDD 11 / 28

Page 12: The Regression discontinuity design: continuous and binary ...

The Causal Effect

- Denote xc = x− x0 to be the forcing/assignment variable centredat x0

- Consider the linear model

E(Y ) = µil = β0l + β1lxcil l = above, below

- NB: “close” to the threshold, the covariates C are balanced, so noneed to control for them (kind of...) — but: how close is close?

- The issue of bandwidth selection (how close) is still unresolved

RDD 12 / 28

Page 13: The Regression discontinuity design: continuous and binary ...

Estimation

Sharp RDD

- The formula for the sharp causal estimator is

ATE = E(Y |Z = 1)− E(Y |Z = 0)− E(T |Z = 0) = ∆β = β0a − β0b

Fuzzy RDD

- The formula for the fuzzy causal effect estimator is

LATE =E(Y |Z = 1)− E(Y |Z = 0)

E(T |Z = 1)− E(T |Z = 0)=

∆β

∆π=β0a − β0bπa − πb

I πl is an estimate of Pr(T = 1|Z = z), e.g. the chance of beingtreated when above or below the threshold.

RDD 13 / 28

Page 14: The Regression discontinuity design: continuous and binary ...

RDD – examples

I Increasingly popular in Public Health/Epidemiology

I HIV: The CD4 count is often used to deterimine drug assignment[1].

I HPV: The date of birth of a woman (pre/post vaccineavailability)[4].

I Prostate cancer: PSA is a chemical produced by the body and usedto determine treatment[3].

I Cholesterol: 10-year CVD risk score to determine statintreatment[2].

RDD 14 / 28

Page 15: The Regression discontinuity design: continuous and binary ...

Problems with the LATE and Bayesian solutions

I The denominator of LATE can be very small (i.e. πa ≈ πb)I Informative priors on the relevant parameters can encode knowledge

and assumptions about these two probabilities so that the resultingestimator does not explode to ∞

Other advantages of Bayes

I Estimation of variances and intervals does not rely on asymptotics— just a byproduct of MCMC procedures + can naturally includemore appropriate models (vs 2SLS)

I Can encode more complex models to account for different levels ofcompliance in a straightforward manner

RDD 15 / 28

Page 16: The Regression discontinuity design: continuous and binary ...

Bayesian modelling: µil = β0l + β1lxcil

1. Informative prior on the slopes, based on clinical expert opinions

Estimated prior predictive distribution of LDL cholesterol for a patient whose risk score = 0

LDL level (mmol/l)

Fre

quen

cy D

ensi

ty

0 1 2 3 4 5 6

0.0

0.2

0.4

0.6

0.8

1.0

Estimated prior predictive distribution of LDL cholesterol for a patient whose risk score = 0.199

LDL level (mmol/l)

Fre

quen

cy D

ensi

ty

0 1 2 3 4 5 6

0.0

0.2

0.4

0.6

0.8

1.0

β1l ∼ Normal(m1l, s21l), for suitable values of m1l and s21l

RDD 16 / 28

Page 17: The Regression discontinuity design: continuous and binary ...

Bayesian modelling: µil = β0l + β1lxcil

2. Informative priors on the intercepts:β0b ∼ Normal(m0, s

20) and β0a = β0b + φ

I Weakly informative prior: φ ∼ Normal(0, 2)I “Skeptical” prior on the effect of treatment, which is assumed to be

null

I Strongly informative prior: φ ∼ Normal(−2, 1)I “Enthusiastic” prior, strongly based on the available information

coming from the RCTs (reduction of 2 mmol/l)1

I Relatively small variance to represent strong belief in the trials

3. Informative prior on the probability of treatment:

logit(πa) ∼ Normal(2, 1), logit(πb) ∼ Normal(−2, 1)

I NB: implies that ∆π = πa − πb is centered far from 0 but can varyI Helps stabilise the denominator and thus the LATE

1Ward et al (2007) [6]RDD 16 / 28

Page 18: The Regression discontinuity design: continuous and binary ...

Bayesian modelling: logit(πl)

2. Informative priors on the intercepts:β0b ∼ Normal(m0, s

20) and β0a = β0b + φ

I Weakly informative prior: φ ∼ Normal(0, 2)I “Skeptical” prior on the effect of treatment, which is assumed to be

null

I Strongly informative prior: φ ∼ Normal(−2, 1)I “Enthusiastic” prior, strongly based on the available information

coming from the RCTs (reduction of 2 mmol/l)1

I Relatively small variance to represent strong belief in the trials

3. Informative prior on the probability of treatment:

logit(πa) ∼ Normal(2, 1), logit(πb) ∼ Normal(−2, 1)

I NB: implies that ∆π = πa − πb is centered far from 0 but can varyI Helps stabilise the denominator and thus the LATE

1Ward et al (2007) [6]RDD 16 / 28

Page 19: The Regression discontinuity design: continuous and binary ...

Bayesian modelling: logit(πl)

Prior density estimates for probability of treatment above and below the threshold

Probability of treatment

Den

sity

0.0 0.2 0.4 0.6 0.8 1.0

01

23

45

6

Below ThresholdAbove Threshold

RDD 17 / 28

Page 20: The Regression discontinuity design: continuous and binary ...

Simulation study results

Bandwidth = 0.25 (fairly large!), Treatment effect size ∼ Normal(−2, 0.52)

LATE estimation (Strong IV)

Un

ob

se

rve

d c

on

fou

nd

ing

leve

l

−3.0 −2.5 −2.0 −1.5 −1.0

1 (low)

2

3

4 (high)

LATE estimation (Weak IV)−80 −60 −40 −20 0

LATEcnst =∆

sipβ

∆fixπ

LATEflex =∆

wipβ

∆fdpπ

LATEunct =∆

wipβ

∆uncπ

RDD 18 / 28

Page 21: The Regression discontinuity design: continuous and binary ...

Binary outcomes

I Most of the RDD literature focusses on continuous outcomes, butoften in biostatistics, practitioners are interested in binary outcomes

Example

I Did LDL cholesterol levels drop to recommended levels after statinprescription?

I Guidelines in the UK state that LDL cholesterol levels should bebelow 2 mmol/l for patients who are at high risk (e.g. with multipleco-morbidities)

I And below 3 mmol/l for low risk patients

I Using the same data we dichotomised the LDL cholesterol outcomesuch that Y = 1 is LDL cholesterol levels are below 2mmol/l (or 3mmol/l) and Y = 0 otherwise

RDD 19 / 28

Page 22: The Regression discontinuity design: continuous and binary ...

LDL cholesterol levels

0.1 0.2 0.3 0.4

0.0

0.2

0.4

0.6

0.8

1.0

Risk Score vs p(LDL<3)

10−year CVD Risk Score

p(LD

L<3)

Treated

Unreated

Plot A

0.1 0.2 0.3 0.4

0.0

0.2

0.4

0.6

0.8

1.0

Risk Score vs p(T=1)

10−year CVD Risk Score

p(T

=1)

Treated

Unreated

Plot B

0.1 0.2 0.3 0.4

0.0

0.2

0.4

0.6

0.8

1.0

Risk Score vs p(LDL<2)

10−year CVD Risk Score

p(LD

L<2)

Treated

Unreated

Plot C

0.1 0.2 0.3 0.4

0.0

0.2

0.4

0.6

0.8

1.0

Risk Score vs p(T=1)

10−year CVD Risk Score

p(T

=1)

Treated

Unreated

Plot D

RDD 20 / 28

Page 23: The Regression discontinuity design: continuous and binary ...

Binary outcomes

I We can draw on the IV-based Multiplicative Structural MeanModels (MSMMs), which consider the causal Risk Ratio for theTreated (RRT)

RRT =E[Ea(Y | Z) | T = 1]

E[Eb(Y | Z) | T = 1]

= 1− E(Y | Z = 1)− E(Y | Z = 0)

E(Y T̄ | Z = 1)− E(Y T̄ | Z = 0)

when a set of assumptions holds (log-linear in t + no T -Z multiplicative

interaction)[2]I Known issues of standard estimators (e.g. generalised method of moments):

I May give absurd results (lower 95% interval estimate < 0)I The data for the product term (Y T̄ ) are usually sparse ⇒

implausibly wide interval estimates

I Can “fix” it by using suitable constraints — similar to those used tostabilise the denominator of the LATE in the continuous case

RDD 21 / 28

Page 24: The Regression discontinuity design: continuous and binary ...

Constraining the models (1)

I The RRT is expressed as a function of a set of parameters (in thesame spirit as the LATE)

RRT = f (exp(αa)− exp(αb))

where:I αa and αb are the intercepts in the log-linear models for

E(Y | Z = 1) and E(Y | Z = 0)I For convenience, model yil ∼ Poisson(µil) — consistent with MSMM

assumptions

I Typically, we would put priors on αa and αb, which would induce aprior on RRT

I But: can also put a prior on RRT to ensure that it is > 0 andanother on αa. This would then induce a prior on αb, e.g.

RRT ∼ Gamma(3, 1) αa ∼ p(αa) and αb = g (RRT, αa)

RDD 22 / 28

Page 25: The Regression discontinuity design: continuous and binary ...

Constraining the models (1)

I The RRT is expressed as a function of a set of parameters (in thesame spirit as the LATE)

RRT = f (exp(αa)− exp(αb))

where:I αa and αb are the intercepts in the log-linear models for

E(Y | Z = 1) and E(Y | Z = 0)I For convenience, model yil ∼ Poisson(µil) — consistent with MSMM

assumptions

I Typically, we would put priors on αa and αb, which would induce aprior on RRT

I But: can also put a prior on RRT to ensure that it is > 0 andanother on αa. This would then induce a prior on αb, e.g.

RRT ∼ Gamma(3, 1) αa ∼ p(αa) and αb = g (RRT, αa)

RDD 22 / 28

Page 26: The Regression discontinuity design: continuous and binary ...

Results

●●●

●●●

●●●

●●●

02

46

810

bandwidth

RR

T

0.025 0.05 0.075 0.1

A) Low confounding, Strong IV, RR= 4.48

●● ●●●● ●●

●●

●●●

02

46

810

bandwidthR

RT

0.025 0.05 0.075 0.1

pois.flexpois.prod.flexMSMMLSMM

B) High confounding, Weak IV, RR= 2.11

●● ●●●

●●●●

●●●●

02

46

810

bandwidth

RR

T

0.025 0.05 0.075 0.1

C) Low confounding, Weak IV, RR= 2.11

●●

●●

●●

●●

●●

02

46

810

bandwidth

RR

T

0.025 0.05 0.075 0.1

D) High confounding, Weak IV, RR= 1

“pois.flex” ⇒ based on E(Y T̄ | Z)

“pois.prod.flex” ⇒ based onE(Y | T̄ , Z)E(T̄ | Z)

“MSMM” ⇒ Generalised method of moments []

“LSMM” ⇒ Logistic Structural Mean Model []

RDD 23 / 28

Page 27: The Regression discontinuity design: continuous and binary ...

Sensitivity/Range analysis

I For both continuous and binary outcomes we have developed anumber of estimators which are based on slightly differentassumptions about e.g. how the denominator works or whether dataare sparse

I All of them (and indeed the standard RDD/IV estimators) shouldbe used in any real context

I In a best case scenario all of the estimates have substantial overlap(as was the case in our application)

I When they do not then certainly prefer the ones we develop(!) anduse the simulations to understand why there are discrepancies

I I am always reluctant to give a single point estimate with a credibleinterval – plausible ranges are better

RDD 24 / 28

Page 28: The Regression discontinuity design: continuous and binary ...

Systematic RDDs

There is a lot of potential for making RDDs less opportunistic and moresystematic

Picture this...

I Imagine there is a new drug on the market - it’s passed trials etc

I The NHS wants to know: where do we set the threshold to optimisebenefits (minimise cost?)

I Some rough idea comes from trials but we know they have lowexternal validity

I Run 3/4 RDDs with different thresholds in different sub-populations

I Is this any less ethical than changing the guidelines as evidence fromprimary care emerges?

RDD 25 / 28

Page 29: The Regression discontinuity design: continuous and binary ...

Conclusions

I “Real World Evidence” (i.e. Electronic Health Record data) isincreasingly popular in researchI Causal estimates are still tricky because of issues with self-selection,

confounding, etc

I Useful to (critically!) explore specific designs to balancecharacteristicsI RDDI Interrupted time seriesI ...

I Bayesian modelling particularly helpfulI Because data are available in registries, administrative databases,

there are likely to be RCTs (may be on small samples/time frames)to base priors on

I Design alone may not be sufficient to obtain balance — may need toimpose constraints ⇒ explicit and typically relatively easy in a fullBayesian framework

RDD 26 / 28

Page 30: The Regression discontinuity design: continuous and binary ...

Conclusions

I “Real World Evidence” (i.e. Electronic Health Record data) isincreasingly popular in researchI Causal estimates are still tricky because of issues with self-selection,

confounding, etc

I Useful to (critically!) explore specific designs to balancecharacteristicsI RDDI Interrupted time seriesI ...

I Bayesian modelling particularly helpfulI Because data are available in registries, administrative databases,

there are likely to be RCTs (may be on small samples/time frames)to base priors on

I Design alone may not be sufficient to obtain balance — may need toimpose constraints ⇒ explicit and typically relatively easy in a fullBayesian framework

RDD 26 / 28

Page 31: The Regression discontinuity design: continuous and binary ...

Thank you!

RDD 27 / 28

Page 32: The Regression discontinuity design: continuous and binary ...

References

Jacob Bor, Shahira Ahmed, Matthew P. Fox, Sydney Rosen, Gesine Meyer-Rath, Ingrid T. Katz, Frank Tanser, Deenan Pillay, and Till Baernighausen.

Effect of eliminating CD4-count thresholds on HIV treatment initiation in South Africa: An empirical modeling study.PLOS ONE, 12(6), JUN 15 2017.

S. Geneletti, F. Ricciardi, A. G. OKeeffe, and G. Baio.

Bayesian modelling for binary outcomes in the regression discontinuity design.Submitted to JASA, 2016.

Jonathan Shoag, Joshua Halpern, Brian Eisner, Richard Lee, Sameer Mittal, Christopher E. Barbieri, and Daniel Shoag.

Efficacy of Prostate-Specific Antigen Screening: Use of Regression Discontinuity in the PLCO Cancer Screening Trial.JAMA ONCOLOGY, 1(7):983–986, OCT 2015.

Leah M. Smith, Linda E. Levesque, Jay S. Kaufman, and Erin C. Strumpf.

Strategies for evaluating the assumptions of the regression discontinuity design: a case study using a human papillomavirus vaccination programme.INTERNATIONAL JOURNAL OF EPIDEMIOLOGY, 46(3):939–949, JUN 2017.

D. L. Thistlethwaite and D. T. Campbell.

Regression-discontinuity analysis: an alternative to the ex-post facto experiment.Journal of Educational Psychology, 51:309–317, 1960.

S. Ward, L. Jones, A. Pandor, M. Holmes, R. Ara, A. Ryan, W. Yeo, and N. Payne.

A systematic review and economic evaluation of statins for the prevention of coronary events.Health Technology Assessment, 11(14), 2007.

RDD 28 / 28