1 Chapter 15 Introduction to the Analysis of Variance IThe Omnibus Null Hypothesis H 0 : 1 = 2...

23
1 Chapter 15 Introduction to the Analysis of Variance I The Omnibus Null Hypothesis H 0 : 1 = 2 = . . . = p H 1 : j = j′ for some j and

Transcript of 1 Chapter 15 Introduction to the Analysis of Variance IThe Omnibus Null Hypothesis H 0 : 1 = 2...

Page 1: 1 Chapter 15 Introduction to the Analysis of Variance IThe Omnibus Null Hypothesis H 0 :  1 =  2 =... =  p H 1 :  j =  j′ for some j and j´

1

Chapter 15

Introduction to the Analysis of Variance

I The Omnibus Null Hypothesis

H0: 1 = 2 = . . . = p

H1: j = j′ for some j and j´

Page 2: 1 Chapter 15 Introduction to the Analysis of Variance IThe Omnibus Null Hypothesis H 0 :  1 =  2 =... =  p H 1 :  j =  j′ for some j and j´

2

A. Answering General Versus Specific ResearchQuestions

1. Population contrast, i, and sample contrast,

1 1 2

2. Pairwise and nonpairwise contrasts211ˆ YY

i

211ˆ

312ˆ

413ˆ

324ˆ

425ˆ

436ˆ

2ˆ 32

17

3ˆ 432

18

22ˆ 4321

9

Page 3: 1 Chapter 15 Introduction to the Analysis of Variance IThe Omnibus Null Hypothesis H 0 :  1 =  2 =... =  p H 1 :  j =  j′ for some j and j´

3

B. Analysis of Variance Versus Multiple t Tests

1. Number of pairwise contrasts among p means

is given by p(p – 1)/2

p = 3 3(3 – 1)/2 = 3

p = 4 4(4 – 1)/2 = 6

p = 5 5(5 – 1)/2 = 10

2. If C = 3 contrasts among p = 3 means are tested

using a t statistic at = .05, the probability of

one or more type I errors is less than

1 (1 )C 1 (1 .05)3 .14

Page 4: 1 Chapter 15 Introduction to the Analysis of Variance IThe Omnibus Null Hypothesis H 0 :  1 =  2 =... =  p H 1 :  j =  j′ for some j and j´

4

3. As C increases, the probability of making one or

more Type I errors using a t statistic increases

dramatically.

Prob. of one or more Type I errors

C 4 [1 (1 .05)4] .19

C 5 [1 (1 .05)5] .23

C 6 [1 (1 .05)6] .26

C 7 [1 (1 .05)7 ] .37

Page 5: 1 Chapter 15 Introduction to the Analysis of Variance IThe Omnibus Null Hypothesis H 0 :  1 =  2 =... =  p H 1 :  j =  j′ for some j and j´

5

4. Analysis of variance tests the omnibus null

hypothesis, H0: 1 = 2 = . . . = p , and controls

probability of making a Type I error at, say,

= .05 for any number of means.

5. Rejection of the null hypothesis makes the

alternative hypothesis, H1: j ≠ j’, tenable.

Page 6: 1 Chapter 15 Introduction to the Analysis of Variance IThe Omnibus Null Hypothesis H 0 :  1 =  2 =... =  p H 1 :  j =  j′ for some j and j´

6

II Basic Concepts In ANOVA

A. Notation

1. Two subscripts are used to denote a score, Xij.

The i subscript denotes one of the i = 1, . . . , n

participants in a treatment level. The j subscript

denotes one of the j = 1, . . . , p treatment levels.

2. The jth level of treatment A is denoted by aj.

Page 7: 1 Chapter 15 Introduction to the Analysis of Variance IThe Omnibus Null Hypothesis H 0 :  1 =  2 =... =  p H 1 :  j =  j′ for some j and j´

7

a1 a2 a3 a4

X11 X12 X13 X14

X21 X22 X23 X24

Xn1 Xn2 Xn3 Xn4

Treatment Levels

X.1

X.2 X.3

X.4 X. .

n

XXXX n12111

1

n

XXXX n42414

4

p

XXXX 421

Page 8: 1 Chapter 15 Introduction to the Analysis of Variance IThe Omnibus Null Hypothesis H 0 :  1 =  2 =... =  p H 1 :  j =  j′ for some j and j´

8

B. Composite Nature of a Score

1. A score reflects the effects of four variables:

independent variable

characteristics of the participants in the

experiment

chance fluctuations in the participant’s

performance

environmental and other uncontrolled

variables

Page 9: 1 Chapter 15 Introduction to the Analysis of Variance IThe Omnibus Null Hypothesis H 0 :  1 =  2 =... =  p H 1 :  j =  j′ for some j and j´

9

2. Sample model equation for a score

X ij X. . ( X. j X. .) ( X ij X. j )

Score Grand Treatment Error

Mean Effect Effect

3. The statistics estimate parameters of the model

equation as follows

X ij ( j ) ( X ij j )

Score Grand Treatment Error

Mean Effect Effect

Page 10: 1 Chapter 15 Introduction to the Analysis of Variance IThe Omnibus Null Hypothesis H 0 :  1 =  2 =... =  p H 1 :  j =  j′ for some j and j´

10

4. Illustration of the sample model equation using the

weight-loss data in Table 1.

Table 1. One-Month Weight Losses for Three Diets

a1 a2 a3

7 10 12

9 13 11

8 9 15

6 7 14

Treatment Levels (Diets)

X.1 8

X.2 9 X.3 12

X. . 9.67

Page 11: 1 Chapter 15 Introduction to the Analysis of Variance IThe Omnibus Null Hypothesis H 0 :  1 =  2 =... =  p H 1 :  j =  j′ for some j and j´

11

5. Let X11 = 7 denote Joan’s weight loss. She used

diet a1. Her score is a composite that tells a story.

X ij X. . ( X. j X. .) ( X ij X. j )

Score Grand Treatment Error

Mean

6. Joan used a less effective diet than other girls

(8 – 9.67 = –1.67), and she lost less weight than

other girls on the same diet (7 – 8 = –1).

67.1Effect 1Effect

)87()67.98(67.97

Page 12: 1 Chapter 15 Introduction to the Analysis of Variance IThe Omnibus Null Hypothesis H 0 :  1 =  2 =... =  p H 1 :  j =  j′ for some j and j´

12

C. Partition of the Total Sum of Squares (SSTO)

1. The total variability among scores in the diet

experiment

also is a composite that can be decomposed into

between-groups sum of squares (SSBG)

within-groups sum of squares (SSWG)

SSTO ( X ij X. .

i1

n

j1

p )2

SSBG n ( X. j X. .)

2

j1

p

SSWG ( X ij X. j

i1

n

j1

p )2

Page 13: 1 Chapter 15 Introduction to the Analysis of Variance IThe Omnibus Null Hypothesis H 0 :  1 =  2 =... =  p H 1 :  j =  j′ for some j and j´

13

D. Degrees of Freedom for SSTO, SSBG, and SSWG

1. dfTO = np – 1

2. dfBG = p – 1

3. dfWG = p(n – 1)

E. Mean Squares, MS, and F Statistic 1. SSTO / (np 1) MSTO

2. SSBG / ( p 1) MSBG

3. SSWG / p(n 1) MSWG

4. F MSBG / MSWG

Page 14: 1 Chapter 15 Introduction to the Analysis of Variance IThe Omnibus Null Hypothesis H 0 :  1 =  2 =... =  p H 1 :  j =  j′ for some j and j´

14

F. Nature of MSBG and MSWG

1. Expected value of MSBG and MSWG when the

null hypothesis is true.

E( MSBG) E( MSWG)

2

2. Expected value of MSBG and MSWG when the

null hypothesis is false.

E( MSWG)

2

E( MSBG)

2 n ( j )2 / ( p 1)

Page 15: 1 Chapter 15 Introduction to the Analysis of Variance IThe Omnibus Null Hypothesis H 0 :  1 =  2 =... =  p H 1 :  j =  j′ for some j and j´

15

3. MSBG represents variation among participants

who have been treated differently—received

different treatment levels.

4. MSWG represents variation among participants

who have been treated the same—received

the same treatment level.

5. F = MSBG/MSWG values close to 1 suggest that

the treatment levels did not affect the dependent

variable; large values suggest that the treatment

levels had an effect.

Page 16: 1 Chapter 15 Introduction to the Analysis of Variance IThe Omnibus Null Hypothesis H 0 :  1 =  2 =... =  p H 1 :  j =  j′ for some j and j´

16

III Completely Randomized Design (CR-p Design)

A. Characteristics of a CR-p Design

1. Design has one treatment, treatment A, with p

levels.

2. N = n1 + n2 + . . . + np participants are randomly

assigned to the p treatment levels.

3. It is desirable, but not necessary, to have the same

number of participants in each treatment level.

Page 17: 1 Chapter 15 Introduction to the Analysis of Variance IThe Omnibus Null Hypothesis H 0 :  1 =  2 =... =  p H 1 :  j =  j′ for some j and j´

17

B. Comparison of layouts for a t-test design for independent samples and a CR-3 design

Participant1 a1 Participant1 a1

Participant2 a1 Participant2 a1

Participant10 a1 Participant10 a1

Participant11 a2 Participant11 a2

Participant12 a2 Participant12 a2

Participant20 a2 Participant20 a2

Participant21 a3

Participant22 a3

Participant30 a3

Treat. Treat.

level level

X.1

X.2

X.1

X.2

X.3

Page 18: 1 Chapter 15 Introduction to the Analysis of Variance IThe Omnibus Null Hypothesis H 0 :  1 =  2 =... =  p H 1 :  j =  j′ for some j and j´

18

C. Descriptive Statistics for Weight-Loss Data In Table 1

Table 2. Means and Standard Deviations for Weight-Loss Data

Diet

a1 a2 a3

8.00 9.00 12.00

2.21 2.21 2.31

X. j

j

Page 19: 1 Chapter 15 Introduction to the Analysis of Variance IThe Omnibus Null Hypothesis H 0 :  1 =  2 =... =  p H 1 :  j =  j′ for some j and j´

19

4 6 8 10 12 14 16

a2

a1

One-Month Weight Loss

a3

Figure 1. Stacked box plots for the weight-loss data. The distributions are relatively symmetrical and have similar dispersions.

Page 20: 1 Chapter 15 Introduction to the Analysis of Variance IThe Omnibus Null Hypothesis H 0 :  1 =  2 =... =  p H 1 :  j =  j′ for some j and j´

20

Table 3. Computational Procedures for CR-3 Design

a1 a2 a3

7 10 12

9 13 11

8 9 15

6 7 14

X ij 80 90 120

000.29014897 ijX

000.302614897 22222 ASX ij

333.3803310290 22 XnpX ij

Page 21: 1 Chapter 15 Introduction to the Analysis of Variance IThe Omnibus Null Hypothesis H 0 :  1 =  2 =... =  p H 1 :  j =  j′ for some j and j´

21

X iji1

n

2

nj1

p [A]

(80)2

10

(90)2

10

(120)2

102890.000

SSTO [AS] [X ] 3026.000 2803.333 222.667

SSBG [A] [X ] 2890.000 2803.333 86.667

SSWG [AS] [A] 3026.000 2890.000 136.000

D. Sum of Squares Formulas for CR-3 Design

Page 22: 1 Chapter 15 Introduction to the Analysis of Variance IThe Omnibus Null Hypothesis H 0 :  1 =  2 =... =  p H 1 :  j =  j′ for some j and j´

22

Table 4. ANOVA Table for Weight-Loss Data

Source SS df MS F

1. Between 86.667 p – 1 = 2 43.334 8.60*groups (BG)Three diets

2. Within 136.000 p(n – 1) = 27 5.037groups (WG)

3. Total 222.667 np – 1 = 29

*p < .002

1

2

Page 23: 1 Chapter 15 Introduction to the Analysis of Variance IThe Omnibus Null Hypothesis H 0 :  1 =  2 =... =  p H 1 :  j =  j′ for some j and j´

23

E. Assumptions for CR-p Design

1. The model equation,

reflects all of the sources of variation that affect

Xij.

2. Random sampling or random assignment

3. The j = 1, . . . , p populations are normally

distributed.

4. Variances of the j = 1, . . . , p populations are

equal.

X ij ( j ) ( X ij j ),