18-1 Copyright © 2011 by the McGraw-Hill Companies, Inc. All rights reserved.McGraw-Hill/Irwin.
© 2006 The McGraw-Hill Companies, Inc., All Rights Reserved.McGraw-Hill/Irwin 18-1 Chapter 18...
-
Upload
gunner-tush -
Category
Documents
-
view
218 -
download
2
Transcript of © 2006 The McGraw-Hill Companies, Inc., All Rights Reserved.McGraw-Hill/Irwin 18-1 Chapter 18...
18-1
© 2006 The McGraw-Hill Companies, Inc., All Rights Reserved.
McGraw-Hill/Irwin
Chapter 18Chapter 18
Hypothesis Hypothesis TestingTesting
18-2
Learning Objectives
Understand . . .
• the nature and logic of hypothesis testing
• a statistically significant difference
• six-step hypothesis testing procedure
18-3
Learning Objectives
Understand . . .
• differences between parametric and nonparametric tests and when to use each
• factors that influence the selection of an appropriate test of statistical significance
• how to interpret the various test statistics
18-4
Hypothesis Testing
DeductiveReasoning
Inductive Reasoning
18-5
Statistical Procedures
Descriptive Statistics
Inferential Statistics
18-6
Exhibit 18-1Hypothesis Testing and the
Research Process
18-7
Approaches to Hypothesis Testing
Classical statistics• Objective view of
probability• Established
hypothesis is rejected or fails to be rejected
• Analysis based on sample data
Bayesian statistics• Extension of classical
approach• Analysis based on
sample data• Also considers
established subjective probability estimates
18-8
Statistical Significance
18-9
Types of Hypotheses
• Null– H0: = 50 mpg
– H0: < 50 mpg
– H0: > 50 mpg
• Alternate– HA: = 50 mpg
– HA: > 50 mpg
– HA: < 50 mpg
18-10
Exhibit 18-2 Two-Tailed Test of Significance
18-11
Exhibit 18-2 One-Tailed Test of Significance
18-12
Decision Rule
Take no corrective action if the analysis shows that one cannot reject the null hypothesis.
18-13
Exhibit 18-3 Statistical Decisions
18-14
Exhibit 18-4 Probability of Making a Type I Error
18-15
Critical Values
18-16
Exhibit 18-4 Probability of Making A Type I Error
18-17
Factors Affecting Probability of Committing a Error
True value of parameterTrue value of parameter
Alpha level selectedAlpha level selected
One or two-tailed test usedOne or two-tailed test used
Sample standard deviationSample standard deviation
Sample sizeSample size
18-18
Exhibit 18-5 Probability of Making A Type II Error
18-19
Statistical Testing Procedures
Obtain critical test value
Obtain critical test value
Interpret the test
Interpret the test
StagesStages
Choose statistical test
Choose statistical test
State null hypothesis
State null hypothesis
Select level of significance
Select level of significance
Compute difference
value
Compute difference
value
18-20
Tests of Significance
NonparametricParametric
18-21
Assumptions for Using Parametric Tests
Independent observationsIndependent observations
Normal distributionNormal distribution
Equal variancesEqual variances
Interval or ratio scalesInterval or ratio scales
18-22
Exhibit 18-6
18-23
Exhibit 18-6
18-24
Exhibit 18-6
18-25
Advantages of Nonparametric Tests
Easy to understand and useEasy to understand and use
Usable with nominal dataUsable with nominal data
Appropriate for ordinal dataAppropriate for ordinal data
Appropriate for non-normal population distributions
Appropriate for non-normal population distributions
18-26
How To Select A Test
How many samples are involved?
If two or more samples are involved, are the individual cases independent or related?
Is the measurement scale nominal, ordinal, interval, or ratio?
18-27
Exhibit 18-7 Recommended Statistical Techniques
Two-Sample Tests____________________________________________
k-Sample Tests ____________________________________________
Measurement Scale One-Sample Case Related Samples
Independent Samples Related Samples
Independent Samples
Nominal • Binomial• x2 one-sample test
• McNemar • Fisher exact test• x2 two-samples test
• Cochran Q • x2 for k samples
Ordinal • Kolmogorov-Smirnov one-sample test• Runs test
• Sign test
•Wilcoxon matched-pairs test
• Median test
•Mann-Whitney U•Kolmogorov-Smirnov•Wald-Wolfowitz
• Friedman two-way ANOVA
• Median extension•Kruskal-Wallis one-way ANOVA
Interval and Ratio
• t-test
• Z test
• t-test for paired samples
• t-test
• Z test
• Repeated-measures ANOVA
• One-way ANOVA• n-way ANOVA
18-28
Questions Answered by One-Sample Tests
• Is there a difference between observed frequencies and the frequencies we would expect?
• Is there a difference between observed and expected proportions?
• Is there a significant difference between some measures of central tendency and the population parameter?
18-29
Parametric Tests
t-testZ-test
18-30
One-Sample t-Test Example
Null Ho: = 50 mpg
Statistical test t-test
Significance level .05, n=100
Calculated value 1.786
Critical test value 1.66
(from Appendix C, Exhibit C-2)
18-31
One Sample Chi-Square Test Example
Living ArrangementIntend to
JoinNumber
InterviewedPercent
(no. interviewed/200)
ExpectedFrequencies
(percent x 60)
Dorm/fraternity 16 90 45 27
Apartment/rooming house, nearby
13 40 20 12
Apartment/rooming house, distant
16 40 20 12
Live at home 15_____
30_____
15_____
9_____
Total 60 200 100 60
18-32
One-Sample Chi-Square Example
Null Ho: 0 = E
Statistical test One-sample chi-square
Significance level .05
Calculated value 9.89
Critical test value 7.82
(from Appendix C, Exhibit C-3)
18-33
Two-Sample Parametric Tests
18-34
Two-Sample t-Test Example
A Group B Group
Average hourly sales X1 = $1,500 X2 = $1,300
Standard deviation s1 = 225 s2 = 251
18-35
Two-Sample t-Test Example
Null Ho: A sales = B sales
Statistical test t-test
Significance level .05 (one-tailed)
Calculated value 1.97, d.f. = 20
Critical test value 1.725
(from Appendix C, Exhibit C-2)
18-36
Two-Sample Nonparametric Tests: Chi-Square
On-the-Job-Accident
Cell DesignationCountExpected Values Yes No Row Total
Smoker
Heavy Smoker
1,1
12,
8.24
1,2
4
7.75
16
Moderate
2,1
9
7.73
2,2
6
7.27
15
Nonsmoker
3,1
13
18.03
3,2
22
16.97
35
Column Total 34 32 66
18-37
Two-Sample Chi-Square Example
Null There is no difference in distribution channel for age categories.
Statistical test Chi-square
Significance level .05
Calculated value 6.86, d.f. = 2
Critical test value 5.99
(from Appendix C, Exhibit C-3)
18-38
Exhibit 18-8 SPSS Cross-Tab Procedure
18-39
Two-Related-Samples Tests
NonparametricParametric
18-40
Exhibit 18-9 Sales Data for Paired-Samples t-Test
Company Sales Year 2
SalesYear 1 Difference D D2
GM
GE
Exxon
IBM
Ford
AT&T
Mobil
DuPont
Sears
Amoco
Total
126932
54574
86656
62710
96146
36112
50220
35099
53794
23966
123505
49662
78944
59512
92300
35173
48111
32427
49975
20779
3427
4912
7712
3192
3846
939
2109
2632
3819
3187ΣD = 35781 .
11744329
24127744
59474944
10227204
14971716
881721
4447881
6927424
14584761
10156969ΣD = 157364693 .
18-41
Paired-Samples t-Test Example
Null Year 1 sales = Year 2 sales
Statistical test Paired sample t-test
Significance level .01
Calculated value 6.28, d.f. = 9
Critical test value 3.25
(from Appendix C, Exhibit C-2)
18-42
Exhibit 18-10 SPSS Output for Paired-Samples t-Test
18-43
Related-Samples Nonparametric Tests: McNemar Test
BeforeAfter
Do Not Favor
After
Favor
Favor A B
Do Not Favor C D
18-44
An Example of the McNemar Test
BeforeAfter
Do Not Favor
After
Favor
Favor A=10 B=90
Do Not Favor C=60 D=40
18-45
k-Independent-Samples Tests: ANOVA
• Tests the null hypothesis that the means of three or more populations are equal
• One-way: Uses a single-factor, fixed-effects model to compare the effects of a treatment or factor on a continuous dependent variable
18-46
Exhibit 18-12 ANOVA Example
__________________________________________Model Summary_________________________________________
Source d.f. Sum of Squares Mean Square F Value p Value
Model (airline) 2 11644.033 5822.017 28.304 0.0001
Residual (error) 57 11724.550 205.694
Total 59 23368.583
_______________________Means Table________________________
Count Mean Std. Dev. Std. Error
Delta 20 38.950 14.006 3.132
Lufthansa 20 58.900 15.089 3.374
KLM 20 72.900 13.902 3.108
All data are hypothetical
18-47
ANOVA Example Continued
Null A1 = A2 = A3
Statistical test ANOVA and F ratio
Significance level .05
Calculated value 28.304, d.f. = 2, 57
Critical test value 3.16
(from Appendix C, Exhibit C-9)
18-48
Post Hoc: Scheffe’s S Multiple Comparison Procedure
Verses Diff
Crit. Diff. p Value
Delta Lufthansa 19,950 11.400 .0002
KLM 33.950 11.400 .0001
Lufthansa KLM 14.000 11.400 .0122
18-49
Exhibit 18-13 Multiple Comparison Procedures
TestComplex
ComparisonsPairwise
Comparisons
Equaln’s
OnlyUnequal
n’s
EqualVariancesAssumed
UnequalVariances
NotAssumed
Fisher LSD X X X
Bonferroni X X X
Tukey HSD X X X
Tukey-Kramer X X X
Games-Howell X X X
Tamhane T2 X X X
Scheffé S X X X X
Brown-Forsythe X X X X
Newman-Keuls X X
Duncan X X
Dunnet’s T3 X
Dunnet’s C X
18-50
Exhibit 18-14 ANOVA Plots
18-51
Exhibit 18-15 Two-Way ANOVA Example
__________________________________________Model Summary_________________________________________
Source d.f. Sum of Squares Mean Square F Value p Value
Airline 2 11644.033 5822.017 39.178 0.0001
Seat selection 1 3182.817 3182.817 21.418 0.0001
Airline by seat selection 2 517.033 258.517 1.740 0.1853
Residual 54 8024.700 148.606
All data are hypothetical
__________Means Table Effect: Airline by Seat Selection___________
Count Mean Std. Dev. Std. Error
Delta economy 10 35.600 12.140 3.839
Delta business 10 42.300 15.550 4.917
Lufthansa economy 10 48.500 12.501 3.953
Lufthansa business 10 69.300 9.166 2.898
KLM economy 10 64.800 13.037 4.123
KLM business 10 81.000 9.603 3.037
18-52
k-Related-Samples Tests
More than two levels in grouping factor
Observations are matched
Data are interval or ratio
18-53
Exhibit 18-17 Repeated-Measures ANOVA Example
___________________________________Means Table by Airline _________________________________________________________________________
Count Mean Std. Dev. Std. Error
Rating 1, Delta 20 38.950 14.006 3.132
Rating 1, Lufthansa 20 58.900 15.089 3.374
Rating 1, KLM 20 72.900 13.902 3.108
Rating 2, Delta 20 32.400 8.268 1.849
Rating 2, Lufthansa 20 72.250 10.572 2.364
Rating 2, KLM 20 79.800 11.265 2.519
__________________________________________________________Model Summary_________________________________________________________
Source d.f. Sum of Squares Mean Square F Value p Value
Airline 2 3552735.50 17763.775 67.199 0.0001
Subject (group) 57 15067.650 264.345
Ratings 1 625.633 625.633 14.318 0.0004
Ratings by air....... 2 2061.717 1030.858 23.592 0.0001
Ratings by subj..... 57 2490.650 43.696
All data are hypothetical.
______________________________________Means Table Effect: Ratings_________________________________________________________________
Count Mean Std. Dev. Std. Error
Rating 1 60 56.917 19.902 2.569
Rating 2 60 61.483 23.208 2.996
18-54
Key Terms
• a priori contrasts• Alternative hypothesis• Analysis of variance
(ANOVA• Bayesian statistics• Chi-square test• Classical statistics• Critical value• F ratio• Inferential statistics
• K-independent-samples tests
• K-related-samples tests• Level of significance• Mean square• Multiple comparison tests
(range tests)• Nonparametric tests• Normal probability plot
18-55
Key Terms
• Null hypothesis• Observed significance
level• One-sample tests• One-tailed test• p value• Parametric tests• Power of the test• Practical significance
• Region of acceptance• Region of rejection• Statistical significance• t distribution• Trials• t-test• Two-independent-samples
tests
18-56
Key Terms
• Two-related-samples tests
• Two-tailed test• Type I error
• Type II error• Z distribution• Z test