Regression #2 - Purdue Universityjltobias/671/lecture_notes/regression2.pdfInterpretation Consider...

Regression #2

Econ 671

Purdue University

Justin L. Tobias (Purdue) Regression #2 1 / 24

Estimation

In this lecture, we address estimation of the linear regression model.

There are many objective functions that can be employed to obtainan estimator; here we discuss the most common one that delivers thefamiliar OLS estimator.

We then discuss issues of parameter interpretation, prediction andreview details associated with R-squared.

Estimation

yi = xiβ + εi .

The most widely employed approach seeks to minimize the contribution ofthe error term εi by minimizing the sum of squared residuals:

(yi − xi β)2 = minβ1,β2,...,βk

(yi − β1 − β2xi2 − · · · − βkxik)2.

Unlike the simple regression case, where we consider k = 2 specifically,and derive an estimator for that particular case, we seek to obtain anestimator when k is an arbitrary number.

Estimation

A “representative” first-order condition from this objective function(differentiating with respect to βj) yields an equation of the form:

This implies that, for the intercept parameter:

The complete vector β is obtained as the solution of this set of kequations in k unknowns.

Estimation

We can assemble these k equations together in vector / matrix form as:x11 x21 · · · xn1x12 x22 · · · xn2

......

. . ....

x1k x2k · · · xnk

y1y2...

x11 x12 · · · x1kx21 x22 · · · x2k

......

. . ....

xn1 xn2 · · · xnk

β1β2...

00...0

.or, compactly in terms of our regression notation,

Under the assumptions of our regression model, then,

Estimation

Of course, arriving at β = (X ′X )−1X ′y is easier and more direct if wesimply apply rules governing vector differentiation. (See, Appendix A).That is, we seek to minimize:

(y − X β)′(y − X β)

orminβ

(y ′y − β′X ′y − y ′X β + β′X ′X β

Differentiating with respect to the vector β and setting the result to zerogives:

−2X ′y + 2X ′X β = 0

orβ = (X ′X )−1X ′y .

Estimation

Thus we have a simple-to-calculate, closed form solution for theestimated coefficient vector.

Given this estimated coefficient vector, fitted (predicted) values areeasily obtained:

y = X β

as are the residuals:ε = y − X β.

Interpretation

As you all know, multiple regression is advantageous in that it allowsthe researcher to “control” for other factors when determining theeffect of a particular xj on y.

Indeed, the language “After controlling for the influence of otherfactors, the marginal effect of xj on y is βj” is commonly used.

In the following subsection, we justify this interpretation in a moreformal way.

Interpretation

Consider the regression model:

y = X1β1 + x2β2 + ε.

Here, X1 represents a set of covariates that are important to accountfor, but are not necessarily the objects of interest.

x2 is regarded as a vector (for simplicity and without loss ofgenerality), so that β1 is a (k − 1) × 1 vector while β2 is a scalar.

Some questions: How can we get β2 directly? Does this provide anyinsight behind the interpretation of multiple regression coefficients?

Interpretation

y = X1β1 + x2β2 + ε.

We can write this as

y = [X1 x2]

[β1β2

]+ ε.

To calculateβ = (X ′X )−1X ′y ,

we then note that

X ′X =

[X ′1x ′2

][X1 x2] =

[X ′1X1 X ′1x2x ′2X1 x ′2x2

Interpretation

y = X1β1 + x2β2 + ε.

Likewise,

X ′y =

[X ′1x ′2

[X ′1yx ′2y

Putting these two equations together, we then obtain:[X ′1X1 X ′1x2x ′2X1 x ′2x2

] [β1β2

[X ′1yx ′2y

This produces two “equations:” the first, a vector-valued equation for β1and the second a scalar equation for β2.

Interpretation

The first of these equations gives:

We can rearrange this to get:

The second of these equations also gives:

Interpretation

The last slide gave two equations in two unknowns. We can substitute thefirst of these into the second to get an expression that only involves β2:

Regrouping terms gives:

Let us define

so that

This provides a “direct” way to estimate the short regression coefficient β2.

Interpretation

The matrix M1 has some nice properties:

M1 is symmetric:

M1 is idempotent:

Interpretation

With this in mind, we provide an alternate way to obtain β2 which helps toclarify its interpretation:

Theorem

β2 can be obtained from the following two-step procedure:

Interpretation

Proof.

The proof of this result is straightforward. To see this, consider theregression in step 1:

x2 = X1θ + u

The estimated residuals are obtained as:

Interpretation

Proof.

In the second step, we fit the regression:

y = γu + η.

Therefore,

Interpretation

So, β2 can be obtained from this two-step procedure.

The procedure is rather intuitive: The way that we estimate β2 is tofirst find the part of x2 that cannot be (linearly) explained by X1.(These are the residuals, u). We then regress y on this “leftover”part of x2 to get β2.

This procedure is often referred to as residual regression. It alsoclearly justifies our common interpretation of the coefficient as “aftercontrolling for the effect of X1.”

Coefficient of Determination: R2

Undoubtedly, most (all?) of you are familiar with the coefficient ofdetermination or R2.

R2 is often a useful diagnostic, measuring the proportion of variationin the data that is explainable by variation among the explanatoryvariables in the model.

In what follows, we briefly review its derivation and discuss some ofits properties.

To begin, define the total sum of squares (TSS) as follows:

Likewise, we can define the explained sum of squares (ESS) as follows:

and, again, in vector / matrix form, this can be written as:

and the residual sum of squares:

We now introduce two results, both rather obvious, but will be neededwhen constructing R2:

ι′ε = 0

Proof.

Our second result states:(y − ιy)′ ε = 0.

Proof.

The second line follows from the result we have just proved. The last lineagain follows from the fact that X ′ε = 0, by construction of the OLSestimator.

With these result in hand, we can now derive R2. To this end, note:

which implies

Since this holds for each element of the above vectors, the sum of squaredelements must also be equal:

Using our previous result, the right hand side reduces to:

The coefficient of determination, R2 is then defined as:

R2 =ESS

TSSor R2 = 1 − RSS

Some properties:

Regression #2 - Purdue Universityjltobias/671/lecture_notes/regression2.pdfInterpretation Consider...

Documents

Transcript of Regression #2 - Purdue Universityjltobias/671/lecture_notes/regression2.pdfInterpretation Consider...

Regression Models - University of Wisconsin–Madisonssc.wisc.edu/~bhansen/390/390Lecture18.pdf · 2014-04-01 · Regression Models • Bivariate data (y,x) • Multivariate (y,x

Simple Regression Equation Multiple Regression y = a + bx Test Score Slope y-intercept Predicted Score y = a + b x + b x + b x ….. Predicted Score

Lecture 9: Linear Regression · Why Linear Regression? •Suppose we want to model the dependent variable Y in terms of three predictors, X 1, X 2, X 3 Y = f(X 1, X 2, X 3) •Typically

Logistic Regression: Interaction Terms - · PDF fileInteractions in Logistic Regression I For linear regression, with predictors X 1 and X 2 we saw that an interaction model is a model

Multiple Linear Regression - Johns Hopkins University...... X 3) has their own regression coefficient Review: Simple linear regression • Y’ is a linear function of X • Y’ =

Session 1 Regression Analysis Basics - Statistical … · Session 1 Regression Analysis Basics Goals: 1. ... define household income as X 4 = X 1 + X 3. Since X 3 is ... dichotomous),

Some apl examples - Dyalog - · Web view1 reg x y 1 is x1 linear regression 0.9 1.55 2 reg x y 2 is x2 quadratic regression 0.29 0.9 0.98 3 reg x y 3 is x3 cubic regression 0.06 0.29

Week 4 Multiple regression analysis. More general regression model Consider one Y variable and n independent variables X i, e.g. X 1, X 2, X 3. Data on.

Regression Analyses. Multiple IVs Single DV (continuous) Generalization of simple linear regression Y’ = b 0 + b 1 X 1 + b 2 X 2 + b 3 X 3...b k X k Where.

Nonparametric quantile regression with heavy-tailed and ... · Nonparametric quantile regression 27 Assumption V V(x,z) is monotone increasing in z and V(x,mq) = 0 for any x. Besides,

6. UNCONDITIONAL LOGISTIC REGRESSION FOR LARGE STRATA · as the diflerence between two logits. Let us define a single binary regression variable x by x = 1 for exposed and x = 0 for

Multivariate Logistic Regression - McGill · PDF file1 Multivariate Logistic Regression As in univariate logistic regression, let π(x) represent the probability of an event that

Kapitel X - Lineare Regression - Deskriptive Statistik · InstitutfürVolkswirtschaftslehre(ECON) Lehrstuhl für Ökonometrie und Statistik Kapitel X - Lineare Regression DeskriptiveStatistik

MATLAB Tutorials - MIT...16.62x MATLAB Tutorials Linear Regression Multiple linear regression >> [B, Bint, R, Rint, stats] = regress(y, X)B: vector of regression coefficients Bint:

The regression line of X on Y Is: x- x = Preparation of the table. 1200 2070 2000 3080 2700 The regression line is: x. 123.6 = 0.68(y- 17.2) Learning Outcomes Identifies the concept

Non-linear regression techniques Part - IIlasa.epfl.ch/.../Lec_IX_NonlinearRegression_Part_II.pdfii M i T M X x y w X N p X w V V y y yy Probabilistic Regression Prior model on distribution

REGRESSION MODEL ASSUMPTIONS. The Regression Model We have hypothesized that: y = 0 + 1 x + | | + | | So far we focused on the regression part –

1 1 Slide The Simple Linear Regression Model n Simple Linear Regression Model y = 0 + 1 x + n Simple Linear Regression Equation E( y ) = 0 +

3. Regression · Multiple lineare Regression § Multiple lineare Regression nimmt an, dass sich das abhängige Merkmal y als lineare Kombination der unabhängigen Merkmale x (,j)

Biostat 200 Lecture 10 1. Simple linear regression Population regression equationμ y|x = α + x α and are constants and are called the coefficients.