Misconceptions, Problems, and Fallacies in … Notes/ProblemsAndFallacies.pdf · Misconceptions,...

32
Misconceptions, Problems, and Fallacies in Correlational Analysis James H. Steiger Department of Psychology and Human Development Vanderbilt University James H. Steiger (Vanderbilt University) Misconceptions, Problems, and Fallacies 1 / 32

Transcript of Misconceptions, Problems, and Fallacies in … Notes/ProblemsAndFallacies.pdf · Misconceptions,...

Page 1: Misconceptions, Problems, and Fallacies in … Notes/ProblemsAndFallacies.pdf · Misconceptions, Problems, and Fallacies in Correlational Analysis James H. Steiger Department of Psychology

Misconceptions, Problems, and Fallaciesin Correlational Analysis

James H. Steiger

Department of Psychology and Human DevelopmentVanderbilt University

James H. Steiger (Vanderbilt University) Misconceptions, Problems, and Fallacies 1 / 32

Page 2: Misconceptions, Problems, and Fallacies in … Notes/ProblemsAndFallacies.pdf · Misconceptions, Problems, and Fallacies in Correlational Analysis James H. Steiger Department of Psychology

Introduction to Linear Regression1 Introduction

2 Interpreting a Correlation

Some Typical Bivariate Normal Scatterplots

Anscombe’s Quartet

3 No Correlation vs. No Relation

4 Perfect Correlation vs. Equivalence

Height and Weight on the Planet Zorg

5 Combining Populations, and Ignoring Explanatory Variables

6 Restriction of Range

7 Prediction vs. Explanation: The Shrinkage Problem

8 The Third Variable Fallacy

9 Correlation and Causality

James H. Steiger (Vanderbilt University) Misconceptions, Problems, and Fallacies 2 / 32

Page 3: Misconceptions, Problems, and Fallacies in … Notes/ProblemsAndFallacies.pdf · Misconceptions, Problems, and Fallacies in Correlational Analysis James H. Steiger Department of Psychology

Introduction

Introduction

In this module, we discuss some common problems and fallacies regardingcorrelation coefficients and their interpretation:

1 Interpreting a Correlation

2 No Correlation vs. No Relation

3 Perfect Correlation vs. Equivalence

4 Combining Populations, and Ignoring Explanatory Variables

5 Restriction of Range

6 Prediction vs. Explanation: The Shrinkage Problem

7 The Third Variable Fallacy

8 Correlation and Causality

James H. Steiger (Vanderbilt University) Misconceptions, Problems, and Fallacies 3 / 32

Page 4: Misconceptions, Problems, and Fallacies in … Notes/ProblemsAndFallacies.pdf · Misconceptions, Problems, and Fallacies in Correlational Analysis James H. Steiger Department of Psychology

Interpreting a Correlation

Interpreting a Correlation

If correlations have a familiar and well-behaved distribution, such as thebivariate normal form, then there is a well-defined relationship between theshape of the scatterplot and the correlation coefficient.

Many undergraduate courses begin with a demonstration that connects aseries of scatterplots with the corresponding correlations.

James H. Steiger (Vanderbilt University) Misconceptions, Problems, and Fallacies 4 / 32

Page 5: Misconceptions, Problems, and Fallacies in … Notes/ProblemsAndFallacies.pdf · Misconceptions, Problems, and Fallacies in Correlational Analysis James H. Steiger Department of Psychology

Interpreting a Correlation Some Typical Bivariate Normal Scatterplots

Interpreting a CorrelationSome Typical Bivariate Normal Scatterplots

Example (Some Typical Scatterplots)

Let’s examine some bivariate normal scatterplots in which the data comefrom populations with means of 0 and variances of 1. These will give youa feel for how correlations are reflected in a bivariate normal scatterplot.

James H. Steiger (Vanderbilt University) Misconceptions, Problems, and Fallacies 5 / 32

Page 6: Misconceptions, Problems, and Fallacies in … Notes/ProblemsAndFallacies.pdf · Misconceptions, Problems, and Fallacies in Correlational Analysis James H. Steiger Department of Psychology

Interpreting a Correlation Some Typical Bivariate Normal Scatterplots

Interpreting a CorrelationSome Typical Bivariate Normal Scatterplots

Example (Some Typical Scatterplots)

−3 −2 −1 0 1 2 3 4

−3

−2

−1

01

23

rho = 0, n = 500

X

Y

James H. Steiger (Vanderbilt University) Misconceptions, Problems, and Fallacies 6 / 32

Page 7: Misconceptions, Problems, and Fallacies in … Notes/ProblemsAndFallacies.pdf · Misconceptions, Problems, and Fallacies in Correlational Analysis James H. Steiger Department of Psychology

Interpreting a Correlation Some Typical Bivariate Normal Scatterplots

Interpreting a CorrelationSome Typical Bivariate Normal Scatterplots

Example (Some Typical Scatterplots)

−2 −1 0 1 2 3

−3

−2

−1

01

23

rho = 0.2, n = 500

X

Y

James H. Steiger (Vanderbilt University) Misconceptions, Problems, and Fallacies 7 / 32

Page 8: Misconceptions, Problems, and Fallacies in … Notes/ProblemsAndFallacies.pdf · Misconceptions, Problems, and Fallacies in Correlational Analysis James H. Steiger Department of Psychology

Interpreting a Correlation Some Typical Bivariate Normal Scatterplots

Interpreting a CorrelationSome Typical Bivariate Normal Scatterplots

Example (Some Typical Scatterplots)

−3 −2 −1 0 1 2 3

−3

−2

−1

01

2

rho = 0.5, n = 500

X

Y

James H. Steiger (Vanderbilt University) Misconceptions, Problems, and Fallacies 8 / 32

Page 9: Misconceptions, Problems, and Fallacies in … Notes/ProblemsAndFallacies.pdf · Misconceptions, Problems, and Fallacies in Correlational Analysis James H. Steiger Department of Psychology

Interpreting a Correlation Some Typical Bivariate Normal Scatterplots

Interpreting a CorrelationSome Typical Bivariate Normal Scatterplots

Example (Some Typical Scatterplots)

−2 −1 0 1 2 3

−3

−2

−1

01

23

rho = 0.75, n = 500

X

Y

James H. Steiger (Vanderbilt University) Misconceptions, Problems, and Fallacies 9 / 32

Page 10: Misconceptions, Problems, and Fallacies in … Notes/ProblemsAndFallacies.pdf · Misconceptions, Problems, and Fallacies in Correlational Analysis James H. Steiger Department of Psychology

Interpreting a Correlation Some Typical Bivariate Normal Scatterplots

Interpreting a CorrelationSome Typical Bivariate Normal Scatterplots

Example (Some Typical Scatterplots)

−3 −2 −1 0 1 2 3

−3

−2

−1

01

2

rho = 0.9, n = 500

X

Y

James H. Steiger (Vanderbilt University) Misconceptions, Problems, and Fallacies 10 / 32

Page 11: Misconceptions, Problems, and Fallacies in … Notes/ProblemsAndFallacies.pdf · Misconceptions, Problems, and Fallacies in Correlational Analysis James H. Steiger Department of Psychology

Interpreting a Correlation Some Typical Bivariate Normal Scatterplots

Interpreting a CorrelationSome Typical Bivariate Normal Scatterplots

Example (Some Typical Scatterplots)

−4 −2 0 2 4

−2

02

4

rho = 0.95, n = 500

X

Y

James H. Steiger (Vanderbilt University) Misconceptions, Problems, and Fallacies 11 / 32

Page 12: Misconceptions, Problems, and Fallacies in … Notes/ProblemsAndFallacies.pdf · Misconceptions, Problems, and Fallacies in Correlational Analysis James H. Steiger Department of Psychology

Interpreting a Correlation Some Typical Bivariate Normal Scatterplots

Interpreting a CorrelationSome Typical Bivariate Normal Scatterplots

The problem with such examples is that people somethimes generalizefrom them incorrectly.

While a particular shape of scatterplot is connected with a certain level ofcorrelation (circular scatterplots tend to reflect near-zero correlation, forexample), the converse need not be true.

For example, a particular level of correlation may be connected with aninfinity of different-shaped scatterplots.

Consequently, to understand the relationship between two variables, youmust always examine the scatterplot.

This point is dramatized in the famous example known as Anscombe’sQuartet.

James H. Steiger (Vanderbilt University) Misconceptions, Problems, and Fallacies 12 / 32

Page 13: Misconceptions, Problems, and Fallacies in … Notes/ProblemsAndFallacies.pdf · Misconceptions, Problems, and Fallacies in Correlational Analysis James H. Steiger Department of Psychology

Interpreting a Correlation Anscombe’s Quartet

Interpreting a CorrelationAnscombe’s Quartet

Anscombe presented 4 data sets that all have exactly the same means,variances, correlations, covariances, and regression lines.

Yet only one of the examples looks like the classic “bivariate normal witherror” form.

Let’s examine the scatterplots.

James H. Steiger (Vanderbilt University) Misconceptions, Problems, and Fallacies 13 / 32

Page 14: Misconceptions, Problems, and Fallacies in … Notes/ProblemsAndFallacies.pdf · Misconceptions, Problems, and Fallacies in Correlational Analysis James H. Steiger Department of Psychology

Interpreting a Correlation Anscombe’s Quartet

Interpreting a CorrelationAnscombe’s Quartet

5 10 15 20

46

810

12

x1

y1

5 10 15 20

46

810

12

x2

y2

5 10 15 20

46

810

12

x3

y3

5 10 15 20

46

810

12

x4

y4

James H. Steiger (Vanderbilt University) Misconceptions, Problems, and Fallacies 14 / 32

Page 15: Misconceptions, Problems, and Fallacies in … Notes/ProblemsAndFallacies.pdf · Misconceptions, Problems, and Fallacies in Correlational Analysis James H. Steiger Department of Psychology

No Correlation vs. No Relation

No Correlation vs. No Relation

In the previous subsection, we saw how many different scatterplots cansupport the idential correlation (and also support the same linearregression line).

The point is, the linear regression line is typically plotted with theassumption that an underlying linear model is correct.

If that assumption is correct, we might say that “all bets are off”regarding the relationship underlying a correlation.

James H. Steiger (Vanderbilt University) Misconceptions, Problems, and Fallacies 15 / 32

Page 16: Misconceptions, Problems, and Fallacies in … Notes/ProblemsAndFallacies.pdf · Misconceptions, Problems, and Fallacies in Correlational Analysis James H. Steiger Department of Psychology

No Correlation vs. No Relation

No Correlation vs. No Relation

A classic example of this is embodied in a “trick” exam question that hasannoyed a generation of students.

In this question, students are manipulated into agreeing with thestatement that “a zero correlation between X and Y implies norelationship between X and Y .”

This statement is incorrect. It is true that if there is no underlyingrelationship between X and Y (i.e., they are independent), then there willbe no correlation between them.

However, a relationship can be perfect (but nonlinear) and the correlationcoefficient can be zero.

James H. Steiger (Vanderbilt University) Misconceptions, Problems, and Fallacies 16 / 32

Page 17: Misconceptions, Problems, and Fallacies in … Notes/ProblemsAndFallacies.pdf · Misconceptions, Problems, and Fallacies in Correlational Analysis James H. Steiger Department of Psychology

No Correlation vs. No Relation

No Correlation vs. No Relation

Suppose, for example, the variables are X and Y = sin(X ), and we havedata for values of x ranging from 0 to π. The relationship between X andY is perfect, but nonlinear

On the next slide, we plot the relationship and compute the correlation,which is within rounding error of zero.

James H. Steiger (Vanderbilt University) Misconceptions, Problems, and Fallacies 17 / 32

Page 18: Misconceptions, Problems, and Fallacies in … Notes/ProblemsAndFallacies.pdf · Misconceptions, Problems, and Fallacies in Correlational Analysis James H. Steiger Department of Psychology

No Correlation vs. No Relation

No Correlation vs. No Relation

> x <- seq(from = 0, to = pi, length.out = 100)

> y <- sin(x)

> plot(x, y)

0.0 0.5 1.0 1.5 2.0 2.5 3.0

0.0

0.2

0.4

0.6

0.8

1.0

x

y

> cor(x, y)

[1] -1.423e-17

James H. Steiger (Vanderbilt University) Misconceptions, Problems, and Fallacies 18 / 32

Page 19: Misconceptions, Problems, and Fallacies in … Notes/ProblemsAndFallacies.pdf · Misconceptions, Problems, and Fallacies in Correlational Analysis James H. Steiger Department of Psychology

Perfect Correlation vs. Equivalence

Perfect Correlation vs. Equivalence

It is not uncommon in the social sciences and education literature to see ahigh correlation between two variables to be used as justification for their“equivalence.”

It is important to realize that measures of two completely differentquantities can be correlated perfectly without the quantities beingequivalent.

Here is an example.

James H. Steiger (Vanderbilt University) Misconceptions, Problems, and Fallacies 19 / 32

Page 20: Misconceptions, Problems, and Fallacies in … Notes/ProblemsAndFallacies.pdf · Misconceptions, Problems, and Fallacies in Correlational Analysis James H. Steiger Department of Psychology

Perfect Correlation vs. Equivalence Height and Weight on the Planet Zorg

Perfect Correlation vs. EquivalenceHeight and Weight on the Planet Zorg

On the planet Zorg, Zorgians are perfect cylinders of the same width anddensity.

So on Zorg, height and weight are perfectly correlated.

Does this mean that height and weight are the same thing?

James H. Steiger (Vanderbilt University) Misconceptions, Problems, and Fallacies 20 / 32

Page 21: Misconceptions, Problems, and Fallacies in … Notes/ProblemsAndFallacies.pdf · Misconceptions, Problems, and Fallacies in Correlational Analysis James H. Steiger Department of Psychology

Perfect Correlation vs. Equivalence Height and Weight on the Planet Zorg

Perfect Correlation vs. EquivalenceHeight and Weight on the Planet Zorg

James H. Steiger (Vanderbilt University) Misconceptions, Problems, and Fallacies 21 / 32

Page 22: Misconceptions, Problems, and Fallacies in … Notes/ProblemsAndFallacies.pdf · Misconceptions, Problems, and Fallacies in Correlational Analysis James H. Steiger Department of Psychology

Perfect Correlation vs. Equivalence Height and Weight on the Planet Zorg

Perfect Correlation vs. EquivalenceHeight and Weight on the Planet Zorg

No, height and weight are not the same thing. Consider the following.

An initial evaluation of weight among Zorgians found that Zorgians abovea certain “cutoff weight” had a high probability of dying before their 22ndbirthday.

Although the data from the probe were sketchy, experts suggested thatZorgian physiology might be similar to earthlings, and that cardiac issuesmight be involved.

It turned out that this conclusion was completely wrong. It wasn’t theZorgian weights that were causing their early demise, it was their heights.

James H. Steiger (Vanderbilt University) Misconceptions, Problems, and Fallacies 22 / 32

Page 23: Misconceptions, Problems, and Fallacies in … Notes/ProblemsAndFallacies.pdf · Misconceptions, Problems, and Fallacies in Correlational Analysis James H. Steiger Department of Psychology

Perfect Correlation vs. Equivalence Height and Weight on the Planet Zorg

Perfect Correlation vs. EquivalenceHeight and Weight on the Planet Zorg

Zorgians had a ceremony called “running the temple.”

In this ceremony, young Zorgians around the age of 21–22, as a rite ofpassage to adulthood, ran through the doors of the temple while passersbyshowered them with flowers while simultaneously trying to hit them withswitches made from branges of a local tree.

Unfortunately, the temple doors had sharp overhangs, and the tallerZorgians frequently suffered injuries by slicing off their tops (Zorgians haveno head) and bleeding to death.

James H. Steiger (Vanderbilt University) Misconceptions, Problems, and Fallacies 23 / 32

Page 24: Misconceptions, Problems, and Fallacies in … Notes/ProblemsAndFallacies.pdf · Misconceptions, Problems, and Fallacies in Correlational Analysis James H. Steiger Department of Psychology

Perfect Correlation vs. Equivalence Height and Weight on the Planet Zorg

Perfect Correlation vs. EquivalenceHeight and Weight on the Planet Zorg

This example highlights the deep conceptual confusion among those whoseek to establish the existence and validity of constructs simply throughcorrelation.

James H. Steiger (Vanderbilt University) Misconceptions, Problems, and Fallacies 24 / 32

Page 25: Misconceptions, Problems, and Fallacies in … Notes/ProblemsAndFallacies.pdf · Misconceptions, Problems, and Fallacies in Correlational Analysis James H. Steiger Department of Psychology

Combining Populations, and Ignoring Explanatory Variables

Combining Populations, and Ignoring Explanatory Variables

As we saw in a previous lecture, combining intact subgroups into one dataset can result in correlations that are misleading, because they do notapply within any subgroup.

James H. Steiger (Vanderbilt University) Misconceptions, Problems, and Fallacies 25 / 32

Page 26: Misconceptions, Problems, and Fallacies in … Notes/ProblemsAndFallacies.pdf · Misconceptions, Problems, and Fallacies in Correlational Analysis James H. Steiger Department of Psychology

Restriction of Range

Restriction of Range

Regression can be used to describe the relationships in existing data.

It can also be used to produce a formula for predicting future performancefrom current data.

For example, we might analyze the relationship between SAT math scoresand success in first year engineering at Vanderbilt. After gathering a fairamount of data, we might then use the regression formula to predict nextyear’s performance at Vanderbilt from SAT data on this year’s applicants.

Suppose we did that. There is a catch. We will select next year’s studentsby using the formula and a “cutoff” score.

As a result, we will not have data next year for many of the applicants,because they did not gain admission to Vanderbilt.

James H. Steiger (Vanderbilt University) Misconceptions, Problems, and Fallacies 26 / 32

Page 27: Misconceptions, Problems, and Fallacies in … Notes/ProblemsAndFallacies.pdf · Misconceptions, Problems, and Fallacies in Correlational Analysis James H. Steiger Department of Psychology

Restriction of Range

Restriction of Range

When data are restricted on the dependent variable, the correlationbetween the predictor and the dependent variable tends to be anunderestimate of the strength of the relationship that would have beenevident had the data not been restricted

This is known as the restriction of range problem.

We can demonstrate it with an artificial data set.

James H. Steiger (Vanderbilt University) Misconceptions, Problems, and Fallacies 27 / 32

Page 28: Misconceptions, Problems, and Fallacies in … Notes/ProblemsAndFallacies.pdf · Misconceptions, Problems, and Fallacies in Correlational Analysis James H. Steiger Department of Psychology

Restriction of Range

Restriction of Range

Suppose 2000 students apply to Vanderbilt Engineering, and the truerelationship between SAT math and Engineering GPA shows a correlationof 0.51.

The plot below shows what would have been observed had all theapplicants been admitted to Vanderbilt.

However, suppose that only those with SAT scores above 700 wereadmitted.

Then the data set would be reduced to those students to the right of theblue line.

550 600 650 700 750 800

2.5

3.0

3.5

4.0

SAT

GPA

James H. Steiger (Vanderbilt University) Misconceptions, Problems, and Fallacies 28 / 32

Page 29: Misconceptions, Problems, and Fallacies in … Notes/ProblemsAndFallacies.pdf · Misconceptions, Problems, and Fallacies in Correlational Analysis James H. Steiger Department of Psychology

Restriction of Range

Restriction of Range

Here is the scatterplot for only those applicants to the right of the blueline. The correlation is reduced from 0.51 to 0.24.

> SAT.accepted <- SAT[SAT > 700]

> GPA.accepted <- GPA[SAT > 700]

> plot(SAT.accepted, GPA.accepted)

700 720 740 760 780 800

2.8

3.0

3.2

3.4

3.6

3.8

4.0

SAT.accepted

GPA

.acc

epte

d

> cor(SAT.accepted, GPA.accepted)

[1] 0.2408

James H. Steiger (Vanderbilt University) Misconceptions, Problems, and Fallacies 29 / 32

Page 30: Misconceptions, Problems, and Fallacies in … Notes/ProblemsAndFallacies.pdf · Misconceptions, Problems, and Fallacies in Correlational Analysis James H. Steiger Department of Psychology

Prediction vs. Explanation: The Shrinkage Problem

Prediction vs. Explanation: The Shrinkage Problem

A common way that people can be deceived by a regression analysis is toassume that a multiple regression equation (especially one derived from asmall sample using a large number of predictors) that works well in acurrent sample will work equally as well in a future sample.

Generally, the multiple correlation coefficient will “shrink” substantially ifthe current equation is applied to future values of X with the expectationof predicting future values of Y .

Most regression programs print an “adjusted R2” value to attempt toadjust for shrinkage.

James H. Steiger (Vanderbilt University) Misconceptions, Problems, and Fallacies 30 / 32

Page 31: Misconceptions, Problems, and Fallacies in … Notes/ProblemsAndFallacies.pdf · Misconceptions, Problems, and Fallacies in Correlational Analysis James H. Steiger Department of Psychology

The Third Variable Fallacy

The Third Variable Fallacy

Often people assume, sometimes almost subconsciously, that when twovariables correlate highly with a third variable, they correlate highly witheach other.

Actually, if rXW and rYW are both .7071, rXY can vary anywhere from 0 to1.

Only when rXW and/or rYW become very high does the correlationbetween X and Y become highly restricted.

James H. Steiger (Vanderbilt University) Misconceptions, Problems, and Fallacies 31 / 32

Page 32: Misconceptions, Problems, and Fallacies in … Notes/ProblemsAndFallacies.pdf · Misconceptions, Problems, and Fallacies in Correlational Analysis James H. Steiger Department of Psychology

Correlation and Causality

Correlation and Causality

Correlation is not causality. This is a standard adage in textbooks onstatistics and experimental design, but it is still forgotten on occasion.

In an earlier lecture, we examined an example: the correlation betweennumber of fire trucks sent to a fire and the dollar damage done by the fire.

This correlation can be elevated, because size of the fire directly affectsdamage done by the fire, and the number of trucks sent to the fire.

James H. Steiger (Vanderbilt University) Misconceptions, Problems, and Fallacies 32 / 32