© 2006 The McGraw-Hill Companies, Inc., All Rights Reserved.McGraw-Hill/Irwin 18-1 Chapter 18...

18-1

© 2006 The McGraw-Hill Companies, Inc., All Rights Reserved.

McGraw-Hill/Irwin

Chapter 18Chapter 18

Hypothesis Hypothesis TestingTesting

18-2

Learning Objectives

Understand . . .

• the nature and logic of hypothesis testing

• a statistically significant difference

• six-step hypothesis testing procedure

18-3

Learning Objectives

Understand . . .

• differences between parametric and nonparametric tests and when to use each

• factors that influence the selection of an appropriate test of statistical significance

• how to interpret the various test statistics

18-4

Hypothesis Testing

DeductiveReasoning

Inductive Reasoning

18-5

Statistical Procedures

Descriptive Statistics

Inferential Statistics

18-6

Exhibit 18-1Hypothesis Testing and the

Research Process

18-7

Approaches to Hypothesis Testing

Classical statistics• Objective view of

probability• Established

hypothesis is rejected or fails to be rejected

• Analysis based on sample data

Bayesian statistics• Extension of classical

approach• Analysis based on

sample data• Also considers

established subjective probability estimates

18-8

Statistical Significance

18-9

Types of Hypotheses

• Null– H0: = 50 mpg

– H0: < 50 mpg

– H0: > 50 mpg

• Alternate– HA: = 50 mpg

– HA: > 50 mpg

– HA: < 50 mpg

18-10

Exhibit 18-2 Two-Tailed Test of Significance

18-11

Exhibit 18-2 One-Tailed Test of Significance

18-12

Decision Rule

Take no corrective action if the analysis shows that one cannot reject the null hypothesis.

18-13

Exhibit 18-3 Statistical Decisions

18-14

Exhibit 18-4 Probability of Making a Type I Error

18-15

Critical Values

18-16

Exhibit 18-4 Probability of Making A Type I Error

18-17

Factors Affecting Probability of Committing a Error

True value of parameterTrue value of parameter

Alpha level selectedAlpha level selected

One or two-tailed test usedOne or two-tailed test used

Sample standard deviationSample standard deviation

Sample sizeSample size

18-18

Exhibit 18-5 Probability of Making A Type II Error

18-19

Statistical Testing Procedures

Obtain critical test value

Obtain critical test value

Interpret the test

Interpret the test

StagesStages

Choose statistical test

Choose statistical test

State null hypothesis

State null hypothesis

Select level of significance

Select level of significance

Compute difference

value

Compute difference

value

18-20

Tests of Significance

NonparametricParametric

18-21

Assumptions for Using Parametric Tests

Independent observationsIndependent observations

Normal distributionNormal distribution

Equal variancesEqual variances

Interval or ratio scalesInterval or ratio scales

18-22

Exhibit 18-6

18-23

Exhibit 18-6

18-24

Exhibit 18-6

18-25

Advantages of Nonparametric Tests

Easy to understand and useEasy to understand and use

Usable with nominal dataUsable with nominal data

Appropriate for ordinal dataAppropriate for ordinal data

Appropriate for non-normal population distributions

Appropriate for non-normal population distributions

18-26

How To Select A Test

How many samples are involved?

If two or more samples are involved, are the individual cases independent or related?

Is the measurement scale nominal, ordinal, interval, or ratio?

18-27

Exhibit 18-7 Recommended Statistical Techniques

Two-Sample Tests____________________________________________

k-Sample Tests ____________________________________________

Measurement Scale One-Sample Case Related Samples

Independent Samples Related Samples

Independent Samples

Nominal • Binomial• x2 one-sample test

• McNemar • Fisher exact test• x2 two-samples test

• Cochran Q • x2 for k samples

Ordinal • Kolmogorov-Smirnov one-sample test• Runs test

• Sign test

•Wilcoxon matched-pairs test

• Median test

•Mann-Whitney U•Kolmogorov-Smirnov•Wald-Wolfowitz

• Friedman two-way ANOVA

• Median extension•Kruskal-Wallis one-way ANOVA

Interval and Ratio

• t-test

• Z test

• t-test for paired samples

• t-test

• Z test

• Repeated-measures ANOVA

• One-way ANOVA• n-way ANOVA

18-28

Questions Answered by One-Sample Tests

• Is there a difference between observed frequencies and the frequencies we would expect?

• Is there a difference between observed and expected proportions?

• Is there a significant difference between some measures of central tendency and the population parameter?

18-29

Parametric Tests

t-testZ-test

18-30

One-Sample t-Test Example

Null Ho: = 50 mpg

Statistical test t-test

Significance level .05, n=100

Calculated value 1.786

Critical test value 1.66

(from Appendix C, Exhibit C-2)

18-31

One Sample Chi-Square Test Example

Living ArrangementIntend to

JoinNumber

InterviewedPercent

(no. interviewed/200)

ExpectedFrequencies

(percent x 60)

Dorm/fraternity 16 90 45 27

Apartment/rooming house, nearby

13 40 20 12

Apartment/rooming house, distant

16 40 20 12

Live at home 15_____

30_____

15_____

9_____

Total 60 200 100 60

18-32

One-Sample Chi-Square Example

Null Ho: 0 = E

Statistical test One-sample chi-square

Significance level .05

Calculated value 9.89



18-33

Two-Sample Parametric Tests

18-34

Two-Sample t-Test Example

A Group B Group

Average hourly sales X1 = $1,500 X2 = $1,300

Standard deviation s1 = 225 s2 = 251

18-35

Two-Sample t-Test Example

Null Ho: A sales = B sales

Statistical test t-test

Significance level .05 (one-tailed)

Calculated value 1.97, d.f. = 20



18-36

Two-Sample Nonparametric Tests: Chi-Square

On-the-Job-Accident

Cell DesignationCountExpected Values Yes No Row Total

Smoker

Heavy Smoker

1,1

12,

8.24

1,2

4

7.75

16

Moderate

2,1

9

7.73

2,2

6

7.27

15

Nonsmoker

3,1

13

18.03

3,2

22

16.97

35

Column Total 34 32 66

18-37

Two-Sample Chi-Square Example

Null There is no difference in distribution channel for age categories.

Statistical test Chi-square





18-38

Exhibit 18-8 SPSS Cross-Tab Procedure

18-39

Two-Related-Samples Tests

NonparametricParametric

18-40

Exhibit 18-9 Sales Data for Paired-Samples t-Test

Company Sales Year 2

SalesYear 1 Difference D D2

GM

GE

Exxon

IBM

Ford

AT&T

Mobil

DuPont

Sears

Amoco

Total

126932

54574

86656

62710

96146

36112

50220

35099

53794

23966

123505

49662

78944

59512

92300

35173

48111

32427

49975

20779

3427

4912

7712

3192

3846

939

2109

2632

3819

3187ΣD = 35781 .

11744329

24127744

59474944

10227204

14971716

881721

4447881

6927424

14584761

10156969ΣD = 157364693 .

18-41

Paired-Samples t-Test Example

Null Year 1 sales = Year 2 sales

Statistical test Paired sample t-test





18-42

Exhibit 18-10 SPSS Output for Paired-Samples t-Test

18-43

Related-Samples Nonparametric Tests: McNemar Test

BeforeAfter

Do Not Favor

After

Favor

Favor A B

Do Not Favor C D

18-44

An Example of the McNemar Test

BeforeAfter

Do Not Favor

After

Favor

Favor A=10 B=90

Do Not Favor C=60 D=40

18-45

k-Independent-Samples Tests: ANOVA

• Tests the null hypothesis that the means of three or more populations are equal

• One-way: Uses a single-factor, fixed-effects model to compare the effects of a treatment or factor on a continuous dependent variable

18-46

Exhibit 18-12 ANOVA Example

__________________________________________Model Summary_________________________________________

Source d.f. Sum of Squares Mean Square F Value p Value

Model (airline) 2 11644.033 5822.017 28.304 0.0001

Residual (error) 57 11724.550 205.694

Total 59 23368.583

_______________________Means Table________________________

Count Mean Std. Dev. Std. Error

Delta 20 38.950 14.006 3.132

Lufthansa 20 58.900 15.089 3.374

KLM 20 72.900 13.902 3.108

All data are hypothetical

18-47

ANOVA Example Continued

Null A1 = A2 = A3

Statistical test ANOVA and F ratio


Calculated value 28.304, d.f. = 2, 57



18-48

Post Hoc: Scheffe’s S Multiple Comparison Procedure

Verses Diff

Crit. Diff. p Value

Delta Lufthansa 19,950 11.400 .0002

KLM 33.950 11.400 .0001

Lufthansa KLM 14.000 11.400 .0122

18-49

Exhibit 18-13 Multiple Comparison Procedures

TestComplex

ComparisonsPairwise

Comparisons

Equaln’s

OnlyUnequal

n’s

EqualVariancesAssumed

UnequalVariances

NotAssumed

Fisher LSD X X X

Bonferroni X X X

Tukey HSD X X X

Tukey-Kramer X X X

Games-Howell X X X

Tamhane T2 X X X

Scheffé S X X X X

Brown-Forsythe X X X X

Newman-Keuls X X

Duncan X X

Dunnet’s T3 X

Dunnet’s C X

18-50

Exhibit 18-14 ANOVA Plots

18-51

Exhibit 18-15 Two-Way ANOVA Example

__________________________________________Model Summary_________________________________________


Airline 2 11644.033 5822.017 39.178 0.0001

Seat selection 1 3182.817 3182.817 21.418 0.0001

Airline by seat selection 2 517.033 258.517 1.740 0.1853

Residual 54 8024.700 148.606

All data are hypothetical

__________Means Table Effect: Airline by Seat Selection___________


Delta economy 10 35.600 12.140 3.839

Delta business 10 42.300 15.550 4.917

Lufthansa economy 10 48.500 12.501 3.953

Lufthansa business 10 69.300 9.166 2.898

KLM economy 10 64.800 13.037 4.123

KLM business 10 81.000 9.603 3.037

18-52

k-Related-Samples Tests

More than two levels in grouping factor

Observations are matched

Data are interval or ratio

18-53

Exhibit 18-17 Repeated-Measures ANOVA Example

___________________________________Means Table by Airline _________________________________________________________________________


Rating 1, Delta 20 38.950 14.006 3.132

Rating 1, Lufthansa 20 58.900 15.089 3.374

Rating 1, KLM 20 72.900 13.902 3.108

Rating 2, Delta 20 32.400 8.268 1.849

Rating 2, Lufthansa 20 72.250 10.572 2.364

Rating 2, KLM 20 79.800 11.265 2.519

__________________________________________________________Model Summary_________________________________________________________


Airline 2 3552735.50 17763.775 67.199 0.0001

Subject (group) 57 15067.650 264.345

Ratings 1 625.633 625.633 14.318 0.0004

Ratings by air....... 2 2061.717 1030.858 23.592 0.0001

Ratings by subj..... 57 2490.650 43.696

All data are hypothetical.

______________________________________Means Table Effect: Ratings_________________________________________________________________


Rating 1 60 56.917 19.902 2.569

Rating 2 60 61.483 23.208 2.996

18-54

Key Terms

• a priori contrasts• Alternative hypothesis• Analysis of variance

(ANOVA• Bayesian statistics• Chi-square test• Classical statistics• Critical value• F ratio• Inferential statistics

• K-independent-samples tests

• K-related-samples tests• Level of significance• Mean square• Multiple comparison tests

(range tests)• Nonparametric tests• Normal probability plot

18-55

Key Terms

• Null hypothesis• Observed significance

level• One-sample tests• One-tailed test• p value• Parametric tests• Power of the test• Practical significance

• Region of acceptance• Region of rejection• Statistical significance• t distribution• Trials• t-test• Two-independent-samples

tests

18-56

Key Terms

• Two-related-samples tests

• Two-tailed test• Type I error

• Type II error• Z distribution• Z test

© 2006 The McGraw-Hill Companies, Inc., All Rights Reserved.McGraw-Hill/Irwin 18-1 Chapter 18...

Documents

Transcript of © 2006 The McGraw-Hill Companies, Inc., All Rights Reserved.McGraw-Hill/Irwin 18-1 Chapter 18...