8. Instrumental variables regression · Implication of these conditions: ‘ The relevant and...

38
8. Instrumental variables regression Recall: In Section 5 we analyzed five sources of estimation bias aris- ing because the regressor is correlated with the error term -→ Violation of the first OLS assumption These threats to internal validity are Omitted variable bias Misspecification of the functional form Measurement error Sample selection bias Simultaneous causality 213

Transcript of 8. Instrumental variables regression · Implication of these conditions: ‘ The relevant and...

Page 1: 8. Instrumental variables regression · Implication of these conditions: ‘ The relevant and exogenous instrument Z can capture move-ments in X that are exogenous ‘ This exogenous

8. Instrumental variables regression

Recall:

• In Section 5 we analyzed five sources of estimation bias aris-ing because the regressor is correlated with the error term

−→ Violation of the first OLS assumption

• These threats to internal validity are

Omitted variable bias

Misspecification of the functional form

Measurement error

Sample selection bias

Simultaneous causality

213

Page 2: 8. Instrumental variables regression · Implication of these conditions: ‘ The relevant and exogenous instrument Z can capture move-ments in X that are exogenous ‘ This exogenous

Now:

• General technique that helps to obtain a consistent estimatorof the unknown coefficients when the regressor X is corre-lated with the error term u

−→ Instrumental variables (IV) regression

Basic idea:

• Think of the variation in X as having two parts:

one part that is correlated with u(the problematic part)

a second part that is uncorrelated with u(the unproblematic part which can be used for estimation)

214

Page 3: 8. Instrumental variables regression · Implication of these conditions: ‘ The relevant and exogenous instrument Z can capture move-ments in X that are exogenous ‘ This exogenous

Issues of this section:

• How can we isolate the problematic from the unproblematicparts in the variations of X?

−→ By the use of instrumental variables(instruments)

• What are good instruments and how can we find them?

215

Page 4: 8. Instrumental variables regression · Implication of these conditions: ‘ The relevant and exogenous instrument Z can capture move-ments in X that are exogenous ‘ This exogenous

8.1. The IV estimator with a single regressor anda single instrument

IV model and assumptions:

• We consider the single-regressor model

Yi = β0 + β1 ·Xi + ui, i = 1, . . . , n, (8.1)

• Xi and ui are assumed to be correlated, that is

Corr(Xi, ui) 6= 0

• We use the additional instrumental variable Z to isolate thatpart of Xi that is uncorrelated with ui

216

Page 5: 8. Instrumental variables regression · Implication of these conditions: ‘ The relevant and exogenous instrument Z can capture move-ments in X that are exogenous ‘ This exogenous

Terminology:

• We call variables correlated with the error term endogenous

• We call variables uncorrelated with the error term exogenous

Two conditions for a valid instrument Z:

1. Instrument relevance condition:

Corr(Zi, Xi) 6= 0

(variation in the instrument Zi is related to variation in Xi)

2. Instrument exogeneity condition:

Corr(Zi, ui) = 0

(that part of the variation in Xi captured by Zi is exogenous)

217

Page 6: 8. Instrumental variables regression · Implication of these conditions: ‘ The relevant and exogenous instrument Z can capture move-ments in X that are exogenous ‘ This exogenous

Implication of these conditions:• The relevant and exogenous instrument Z can capture move-

ments in X that are exogenous

• This exogenous part of X can be used to consistently esti-mate β1

Formalization of this concept:• Two stage least squares estimation

(TSLS)

• First stage:

Decomposition of X into the problematic and the problem-free components

• Second stage:

Use the problem-free component to estimate β1

218

Page 7: 8. Instrumental variables regression · Implication of these conditions: ‘ The relevant and exogenous instrument Z can capture move-ments in X that are exogenous ‘ This exogenous

Two stage least squares estimator:

1. Consider the regression equation

Xi = π0 + π1 · Zi︸ ︷︷ ︸

Part #1

+ vi︸︷︷︸

Part #2

(8.2)

Part #1 is that part of Xi that can be predicted by Zi

Since Zi is exogenous it follows that

Corr(π0 + π1 · Zi, ui) =π1

|π1|·Corr(Zi, ui) = 0

(Part #1 is the problem-free part)

Part #2 is vi for which we have Corr(vi, ui) 6= 0(Part #2 is the problematic part)

We apply OLS to Eq. (8.2) to obtain π0 and π1

219

Page 8: 8. Instrumental variables regression · Implication of these conditions: ‘ The relevant and exogenous instrument Z can capture move-ments in X that are exogenous ‘ This exogenous

Two stage least squares estimator: [continued]

2. We use the predicted values Xi = π0 + π1 · Zi and considerthe regression equation

Yi = β0 + β1 · Xi + ui (8.3)

We apply OLS to Eq. (8.3) and obtain the TSLS estima-tors βTSLS

0 of β0 and βTSLS1 of β1

220

Page 9: 8. Instrumental variables regression · Implication of these conditions: ‘ The relevant and exogenous instrument Z can capture move-ments in X that are exogenous ‘ This exogenous

Example:

• Estimation of the demand curve for butter based on data onthe quantity of butter consumed (Qbutter

i ) and butter prices(P butter

i ) sampled over n years (i = 1, . . . , n)

• We aim at estimating the butter demand curve

Yi = β0 + β1 ·Xi + ui,

where

Yi = ln(Qbutteri )

Xi = ln(P butteri )

β1 = price elasticity of butter demand

221

Page 10: 8. Instrumental variables regression · Implication of these conditions: ‘ The relevant and exogenous instrument Z can capture move-ments in X that are exogenous ‘ This exogenous

Example: [continued]

• We have a simultaneous causality bias here since there arecausal links from ln(P butter

i ) to ln(Qbutteri ), but also from

ln(Qbutteri ) to ln(P butter

i ) via the interaction between the de-mand for and the supply of butter

• It follows from Section 5.1.5. (Slides 143–145) that the re-gressor ln(P butter

i ) is likely to be correlated with the errorterm

−→ OLS estimator of β1 will be inconsistent

222

Page 11: 8. Instrumental variables regression · Implication of these conditions: ‘ The relevant and exogenous instrument Z can capture move-ments in X that are exogenous ‘ This exogenous

Equilibrium price and quantity data

223

Page 12: 8. Instrumental variables regression · Implication of these conditions: ‘ The relevant and exogenous instrument Z can capture move-ments in X that are exogenous ‘ This exogenous

Equilibrium price and quantity data [continued]

224

Page 13: 8. Instrumental variables regression · Implication of these conditions: ‘ The relevant and exogenous instrument Z can capture move-ments in X that are exogenous ‘ This exogenous

Equilibrium price and quantity data [continued]

225

Page 14: 8. Instrumental variables regression · Implication of these conditions: ‘ The relevant and exogenous instrument Z can capture move-ments in X that are exogenous ‘ This exogenous

Example: [continued]

• To circumvent this problem we need an instrumental variableZi which shifts the supply curve but leaves the demand curveunaffected

• Such an instrument Zi could be the the variable RAINFALL inthe butter-producing region

Relevance condition:Below average rainfall reduces cattle-grazing and thus re-duces butter production at a given price:

Corr(RAINFALLi, ln(P butteri )) 6= 0

Exogeneity condition:Demand for butter does not depend on the rainfall:

Corr(RAINFALLi, ui) = 0

226

Page 15: 8. Instrumental variables regression · Implication of these conditions: ‘ The relevant and exogenous instrument Z can capture move-ments in X that are exogenous ‘ This exogenous

Example: [continued]

• TSLS estimation:

Stage 1:Regress ln(P butter

i ) on RAINFALLi and compute ln(P butteri )

(Isolation of price changes due to shifts in the supplycurve)

Stage 2:Regress ln(Qbutter

i ) on ln(P butteri )

227

Page 16: 8. Instrumental variables regression · Implication of these conditions: ‘ The relevant and exogenous instrument Z can capture move-ments in X that are exogenous ‘ This exogenous

Statistical inference for TSLS:

• It can be shown that the TSLS estimator βTSLS1 is consistent

and, in large samples, approximately normally distributed:

βTSLS1 ∼ N(β1, σ2

βTSLS1

),

where

σ2βTSLS1

=1nVar {[Zi − E(Z)] · ui}

[Cov(Zi, Xi)]2 (8.4)

• The standard error of βTSLS1 can be estimated by estimating

the variance and covariance terms appearing on the right-hand side of Eq. (8.4) and taking the square root of theestimate of σ2

βTSLS1

228

Page 17: 8. Instrumental variables regression · Implication of these conditions: ‘ The relevant and exogenous instrument Z can capture move-ments in X that are exogenous ‘ This exogenous

Statistical inference for TSLS: [continued]

• These standard errors are routinely computed by economo-metric software packages like EViews

• Because βTSLS1 is normally distributed in large samples, hy-

pothesis tests and confidence intervals about β1 can be con-ducted in the usual way

Attention:

• The ususal OLS standard errors of Stage 2 are not identicalto the TSLS standard errors described above and thus areinvalid(since these ignore the prediction errors of the Xi)

• One should use the special TSLS routines implemented inthe software packages

229

Page 18: 8. Instrumental variables regression · Implication of these conditions: ‘ The relevant and exogenous instrument Z can capture move-ments in X that are exogenous ‘ This exogenous

8.2. The general IV regression model

Now:

• Generalization of the IV regression model to multiple regres-sors and instruments

Four types of variables:

• The dependent variable Y

• Problematic endogenous regressors

• Included exogenous regressors

• Instrumental variables

230

Page 19: 8. Instrumental variables regression · Implication of these conditions: ‘ The relevant and exogenous instrument Z can capture move-ments in X that are exogenous ‘ This exogenous

Definition 8.1: (General IV regression model)

The general IV regression model is

Yi = β0+β1X1i+. . .+βkXki+βk+1W1i+. . .+βk+rWri+ui, (8.5)

i = 1, . . . , n, where

• Yi is the dependent variable,

• β0, β1, . . . , βk+r are unknown regression coefficients,

• X1i, . . . , Xki are k endogenous regressors potentially corre-lated with ui,

• W1i, . . . , Wri are r included exogenous regressors which areuncorrelated with ui or are control variables,

• ui is the error term,

• Z1i, . . . , Zmi are m instrumental variables.

231

Page 20: 8. Instrumental variables regression · Implication of these conditions: ‘ The relevant and exogenous instrument Z can capture move-ments in X that are exogenous ‘ This exogenous

Definition 8.1: (General IV regression model) [continued]

The coefficients are overidentified if there are more instrumentsthan endogenous regressors (m > k), they are underidentified ifm < k, and they are exactly identified if m = k. Estimation ofthe IV regression model requires exact identification or overiden-tification.

Now:

• Adaption of the TSLS principle to the general IV model de-scribed in Definition 8.1

232

Page 21: 8. Instrumental variables regression · Implication of these conditions: ‘ The relevant and exogenous instrument Z can capture move-ments in X that are exogenous ‘ This exogenous

TSLS in the general IV model:

Consider the general IV regression model (8.5) from Slide 231

1. First-stage regression(s):

Regress X1i on the instrumental variables (Z1i, . . . , Zmi)and the included exogenous variables (W1i, . . . , Wri) usingOLS, that is estimate the following equation via OLS:

X1i = π0 + π1Z1i + . . . + πmZmi

+πm+1W1i + . . . + πm+rWri + vi (8.6)

Compute the predicted values X1i from this regression

Repeat this for all endogenous regressors X2i, . . . , Xki,thereby computing the predicted values X2i, . . . , Xki

233

Page 22: 8. Instrumental variables regression · Implication of these conditions: ‘ The relevant and exogenous instrument Z can capture move-ments in X that are exogenous ‘ This exogenous

TSLS in the general IV model: [continued]2. Second-stage regression

Regress Yi on the predicted values of the endogenous vari-ables X1i, . . . , Xki and the included exogenous variables(W1i, . . . , Wri), that is estimate the following equation viaOLS:

Y1i = β0 + β1X1i + . . . + βkXki+βk+1W1i + . . . + βk+rWri + ui (8.7)

The TSLS estimators βTSLS0 , . . . , βTSLS

k+r are the OLS esti-mators from the second-stage regression (8.7)

Remark:• The two stages are done automatically within TSLS estima-

tion commands in EViews

234

Page 23: 8. Instrumental variables regression · Implication of these conditions: ‘ The relevant and exogenous instrument Z can capture move-ments in X that are exogenous ‘ This exogenous

Now:

• Adaption of the conditions for a valid instrument Z from Slide217 (relevance and exogeneity) to the general IV regressionmodel

Intuitively:

• When there are multiple included endogenous variables, thecondition for instrument relevance

must be formulated in a way that it rules out multi-collinearity in the second-stage regression

should reflect that the instruments provide enough infor-mation about the exogenous movements in the endoge-nous variables to sort out their seperate effects on Y

235

Page 24: 8. Instrumental variables regression · Implication of these conditions: ‘ The relevant and exogenous instrument Z can capture move-ments in X that are exogenous ‘ This exogenous

Definition 8.2: (Conditions for valid instruments)

A set of m instruments Z1i, . . . , Zmi must satisfy the followingtwo conditions to be valid:

1. Instrument relevance:

In general, let X∗1i be the predicted value of X1i from

the regression of X1i on the instruments Z1i, . . . , Zmi andthe included exogenous regressors W1i, . . . , Wri and letX∗

2i, . . . , X∗ki be analogously defined. Furthermore, let 1

denote the n-dimensional vector 1 ≡ (1, . . . ,1)′. Then(X∗

1, . . . , X∗k, W1, . . . , Wr,1) are not perfectly multicollinear.

236

Page 25: 8. Instrumental variables regression · Implication of these conditions: ‘ The relevant and exogenous instrument Z can capture move-ments in X that are exogenous ‘ This exogenous

Definition 8.2: (Conditions for valid instruments) [continued]

1. Instrument relevance: [continued]

If there is only one endogenous regressor Xi, then forthe previous condition to hold, at least one instrumentZji, (j = 1, . . . , m), must have a non-zero coefficient inthe regression equation

Xi = π0 + π1Z1i + . . . + πmZmi

+πm+1W1i + . . . + πm+rWri + vi.

2. Instrument exogeneity:

All instruments are uncorrelated with the error term:

Corr(Z1i, ui) = 0, . . . ,Corr(Zmi, ui) = 0.

237

Page 26: 8. Instrumental variables regression · Implication of these conditions: ‘ The relevant and exogenous instrument Z can capture move-ments in X that are exogenous ‘ This exogenous

Next:

• Under which conditions are the TSLS estimators consistentand do have a sampling distribution that is normal in largesamples?

• If we can specify conditions under which this is the case,then the principles of statistical inference for TSLS in thesingle-regressor case as described on Sildes 228–229 carryover to the general case of multiple instruments and multipleendogenous variables(t-statistics, F -statistics, confidence intervals)

238

Page 27: 8. Instrumental variables regression · Implication of these conditions: ‘ The relevant and exogenous instrument Z can capture move-ments in X that are exogenous ‘ This exogenous

The IV regression assumptions:

The variables and errors in the IV regression model in Eq. (8.5)should satisfy the following conditions:

1. E(ui|W1i, . . . , Wri) = 0

2. (X1i, . . . , Xki, W1i, . . . , Wri, Z1i, . . . , Zmi, Yi) are i.i.d. draws fromtheir joint distribution

3. Large outliers are unlikely: X’s, W ’s, Z’s, and Y variableshave nonzero finite fourth moments

4. The two conditions for valid instruments stated in Definition8.2 hold

239

Page 28: 8. Instrumental variables regression · Implication of these conditions: ‘ The relevant and exogenous instrument Z can capture move-ments in X that are exogenous ‘ This exogenous

Remarks:

• The calculation of TSLS standard errors is done automati-cally by software packages like EViews

• One should use heteroskedasticity-robust standard errors forthe same reasoning as in the conventional multiple linearregression model

240

Page 29: 8. Instrumental variables regression · Implication of these conditions: ‘ The relevant and exogenous instrument Z can capture move-ments in X that are exogenous ‘ This exogenous

8.3. Checking instrument validity

Important question:

• Is a given set of instruments valid in a particular application?

Meaning of ’instrument relevance’:

• Instrumental relevance plays a role akin to the sample size

• A more relevant instrument produces a more accurate esti-mator, just as a large sample size produces a more accurateestimator

• The more relevant is the instrument, the better is the nor-mal approximation to the sampling distribution of the TSLSestimator and its t- and F -statistics

241

Page 30: 8. Instrumental variables regression · Implication of these conditions: ‘ The relevant and exogenous instrument Z can capture move-ments in X that are exogenous ‘ This exogenous

Problems with ’weak’ instruments:• If the instruments are ’weak’, then the TSLS estimator can

be badly biased and the normal distribution is a poor approx-imation to the sampling distribution of the TSLS estimator

−→ No justification for performing statistical inference as de-scribed even when the sampling size is large

−→ TSLS is no longer reliable

Checking for ’weak’ instruments:• How relevant must instruments be for the normal distribution

to provide a good approximation in practice?

• Complicated answer in the general IV model

• Simple rule of thumb in the practically most relevant case ofa single endogenous regressor

242

Page 31: 8. Instrumental variables regression · Implication of these conditions: ‘ The relevant and exogenous instrument Z can capture move-ments in X that are exogenous ‘ This exogenous

Rule of thumb 8.3: (Checking for weak instruments)

Consider the first-stage F -statistic testing the hypothesis thatthe coefficients on the instruments Z1i, . . . , Zmi in the first-stageregression (8.6) on Slide 233 are all simultaneously equal to zero:

H0 : π1 = π2 = . . . = πm = 0 vs.

H1 : At least one πj 6= 0 (j = 1, . . . , m).

When there is a single endogenous regressor, a first-stage F -statistic less than 10 indicates that the instruments are weak. Inthis case the TSLS estimator is biased (even in large samples)and the TSLS t-statistics and confidence intervals are unreliable.

243

Page 32: 8. Instrumental variables regression · Implication of these conditions: ‘ The relevant and exogenous instrument Z can capture move-ments in X that are exogenous ‘ This exogenous

Meaning of ’instrument exogeneity’:

• If the instruments are not exogenous, then the TSLS is in-consistent

−→ TSLS estimation and inference based on it are unreliable

Statistical tests for exogenous instruments:

• No statistical tests are available when the coefficients areexactly identified(that is when m = k in the IV model (8.5) on Slide 231)

• If the coefficients are overidentified, that is when m > k inEq. (8.5), it is possible to test the hypothesis that the ’extra’instruments are exogenous under the maintained assumptionthat there are enough valid instruments to identify the coef-ficients of interest

244

Page 33: 8. Instrumental variables regression · Implication of these conditions: ‘ The relevant and exogenous instrument Z can capture move-ments in X that are exogenous ‘ This exogenous

Theorem 8.4: (The overidentifying restrictions test)

Let uTSLSi be the residuals from TSLS estimation of Eq. (8.5)

from Slide 231. Use OLS to estimate the regression coefficientsin

uTSLSi = δ0 + δ1Z1i + . . . + δmZmi

+ δm+1W1i + . . . + δm+rWri + ei, (8.8)

where ei is the regression error term. Let F denote the homoske-dasticity-only F -statistic testing the null hypothesis

H0 : δ1 = . . . = δm = 0.

The overidentifying restrictions test statistic is

J = m · F.

(The J-test.)

245

Page 34: 8. Instrumental variables regression · Implication of these conditions: ‘ The relevant and exogenous instrument Z can capture move-ments in X that are exogenous ‘ This exogenous

Theorem 8.4: (The J-test) [continued]

Under the null hypothesis that all instruments are exogenous(suggesting that the instruments should approximately be uncor-related with uTSLS

i ), and if ei is homoskedastic, in large samplesJ is distributed χ2

m−k, where m − k is the ’degree of overidenti-fication’, that is, the number of instruments minus the numberof endogenous regressors.

Remark:

• An application of the J-test is provided in the case study’The demand for cigarettes’

−→ See class for details

246

Page 35: 8. Instrumental variables regression · Implication of these conditions: ‘ The relevant and exogenous instrument Z can capture move-ments in X that are exogenous ‘ This exogenous

8.4. Where do valid instruments come from?

Important question:

• How can we find instrumental variables for a given applicationthat are both relevant and exogenous?

Two main approaches:

1. Use economic theory to suggest instruments

2. Find an exogenous source of variation in X arising from arandom phenomenon that induces shifts in the endogenousregressor

247

Page 36: 8. Instrumental variables regression · Implication of these conditions: ‘ The relevant and exogenous instrument Z can capture move-ments in X that are exogenous ‘ This exogenous

Example of Approach #1:

• Consider the butter demand example from Section 8.1.

• Understanding of the economics of agricultural markets leadsus to look for an instrument that shifts the supply curve butnot the demand curve

• This leads us to consider weather conditions in agriculturalregions

−→ Instrument variable: RAINFALL in agricultural regions

248

Page 37: 8. Instrumental variables regression · Implication of these conditions: ‘ The relevant and exogenous instrument Z can capture move-ments in X that are exogenous ‘ This exogenous

Example of Approach #2:

• Consider the effect on test scores of class size

• The regressor CLASS SIZE may be correlated with the errorterm because of omitted variable bias

• In some districts, however, earthquake damages may increasethe average class size

• This variation in class size may be unrelated to potentiallyomitted variables that affect student achievement

−→ Instrument variable: that portion of CLASS SIZE that acr-rues to earthquake damage

249

Page 38: 8. Instrumental variables regression · Implication of these conditions: ‘ The relevant and exogenous instrument Z can capture move-ments in X that are exogenous ‘ This exogenous

Case studies:

• Three examples of how researchers use their expert knowl-edge of their empirical problem to find adequate instrumentalvariables:

Does putting criminals in jail reduce crime?

Does cutting class sizes increase test scores?

Does aggressive treatment of heart attacks prolong lives?

(see class for a thorough discussion)

250