Chapter 13: Comparing Two Population Parameters

51
apter 13: Comparing Two Population Paramete

description

Chapter 13: Comparing Two Population Parameters. 13.1 – Comparing Two Means. - PowerPoint PPT Presentation

Transcript of Chapter 13: Comparing Two Population Parameters

Page 1: Chapter 13:  Comparing Two Population Parameters

Chapter 13: Comparing Two Population Parameters

Page 2: Chapter 13:  Comparing Two Population Parameters

13.1 – Comparing Two Means

Page 3: Chapter 13:  Comparing Two Population Parameters

Comparative studies are more convincing than single-sample investigations, so one-sample inference is not as common as comparative (two-sample) inference. In a comparative study, we may want to compare two treatments, or we may want to compare two populations. In either case, the samples must be chosen randomly and independently in order to perform statistical inference.

How is this different than a matched pairs design?A matched pairs design is when you compare two similar things given the same treatment. This is when you are comparing two sets of samples given different treatments!

Page 4: Chapter 13:  Comparing Two Population Parameters

Two-Sample inference:

Compare two treatments or two populations. The null hypothesis is that there is no difference between the two parameters.

1 2: or oH 1 2: 0oH

Page 5: Chapter 13:  Comparing Two Population Parameters

Review:

How do you subtract two means?

1 2–

How do you subtract two standard deviations?

1 2+2 2

Add their variances and take their square roots!

Page 6: Chapter 13:  Comparing Two Population Parameters

Two Sample Z:

is known

Two Sample T:

is not known

SRS SRS

Normality Normality•Population approx normal• n1 + n2 30 by CLT

•Population approx normal• n1 + n2 30 by CLT• n1 + n2 < 30 and data doesn’t have strong skewness

Independence IndependenceN 10nThe two samples are independent

N 10nThe two samples are independent

Page 7: Chapter 13:  Comparing Two Population Parameters

Note!The t* statistic does not have an exact t-distribution.

The degrees of freedom are calculated differently.22 2

1 2

1 22 22 2

1 2

1 1 2 2

1 11 1

s sn n

dfs s

n n n n

Your calculator will do this for you!

Page 8: Chapter 13:  Comparing Two Population Parameters

Two Sample Z:

Two Sample T:

Confidence Interval:

2 21 2

1 21 2

*x x Zn n

2 21 2

1 21 2

*dfs sx x tn n

estimate test statistic sd

Page 9: Chapter 13:  Comparing Two Population Parameters

Two Sample Z:

Two Sample T:

Hypothesis Test:

1 2 1 22 21 2

1 2

x xZ

n n

test statistic = estimate – hypothesized valuestandard deviation of statistic

1 2 1 22 21 2

1 2

x xt

s sn n

Page 10: Chapter 13:  Comparing Two Population Parameters

Two Sample Z: Two Sample T:STAT-TESTS- 2-SampZtest STAT-TESTS- 2-SampTtest

STAT-TESTS- 2-SampZInt STAT-TESTS- 2-SampTInt

Note: The only time you pool is when the standard deviations are the same. This almost never happens, so just don’t do it!

Calculator Tip!

Page 11: Chapter 13:  Comparing Two Population Parameters

Example #1Patients with heart-attack symptoms arrive at an emergency room either by ambulance or self-transportation provided by themselves, family, or friends. When a patient arrives at the emergency room, the time of arrival is recorded. The time when the patient’s diagnostic treatment begins is also recorded. An administrator of a large hospital wanted to determine whether the mean wait time (time between arrival and diagnostic treatment) for patients with heart-attack symptoms differ according to the mode of transportation. A random sample of 150 patients with heart-attack symptoms who had reported to the emergency room was selected. For each patient, the mode of transportation and wait time were recorded. Summary statistics for each mode of transportation are shown in the table below.

Page 12: Chapter 13:  Comparing Two Population Parameters

Mode of Transportation

Sample Size

Mean Wait Time (in minutes)

Standard Deviation of Wait Time

(in minutes)

Self 73 8.30 5.16

Ambulance 77 6.04 4.30

a. Use a 99% confidence interval to estimate the difference between the mean wait times for ambulance transported patients and self-transported patients at this emergency room.

Page 13: Chapter 13:  Comparing Two Population Parameters

P: μS = mean wait time for diagnostic treatment if traveled by self-transportation

μA = mean wait time for diagnostic treatment if traveled by ambulance

μD = μA - μS = Difference in wait times

Page 14: Chapter 13:  Comparing Two Population Parameters

A:

SRS (says so)

Normality73 + 77 ≥ 30

150 ≥ 30

By the CLT, ok to assume normality

Independence(More than 1500 people with heart-attack symptoms)

Self-transported patients shouldn’t influence the wait time in ambulance transported patients

N: Two-Sample t-interval

nA + nS 30

Page 15: Chapter 13:  Comparing Two Population Parameters

I: 22

* SAA S df

A S

ssx x tn n

222

22 2211 1

1 1

SA

A S

S

A A S S

ssn n

dfss

n n n n

22 2

2 22 2

5.16 4.3073 77

1 5.16 1 4.373 1 73 77 1 77

0.365860.00185 0.0007587

140.3717611

Page 16: Chapter 13:  Comparing Two Population Parameters
Page 17: Chapter 13:  Comparing Two Population Parameters

22

100 * SAA S

A S

ssx x tn n

4.30 5.166.04 8.30 2.62677 73

2.26 2.626 0.3557

4.302, 0.218

2.26 0.93409

Page 18: Chapter 13:  Comparing Two Population Parameters

Note: Using the calculator!

22

* SAA S df

A S

ssx x tn n

*140.37

4.30 5.166.04 8.3077 73

t

4.2910, 0.2291

Page 19: Chapter 13:  Comparing Two Population Parameters

I am 99% confident the true mean difference of wait time of ambulance and self-transported patients is between –4.2910 and –0.2291 minutes

C:

b. Based only on this confidence interval, do you think the difference in the mean wait times is statistically significant? Justify your answer.

Since 0 is not in the confidence interval, we can say that the ambulance wait times are statistically significantly shorter than the wait times for self-transported patients at the 99% confidence level.

Page 20: Chapter 13:  Comparing Two Population Parameters

Example #2: The following is a list of salary rates (per hour in dollars) for men and women with a high school diploma.

Women Men8 10.6 7.5 11.98.25 10.8 8.5 11.959 11 8.5 129.25 11.5 9.85 129.35 11.9 10.5 129.8 12.25 10.5 12.59.95 12.5 10.5 1310 12.5 10.9 13.710 12.95 10.95 13.7510 13.9 11 14.510.25 13.95 11 14.7510.5 14.45 11.65 1510.5 14.8 11.9 15.5

Page 21: Chapter 13:  Comparing Two Population Parameters

If the two samples are independent and are taken randomly, is there significant evidence that the men make more money than the women? Assume that in past experience = 1.99 dollars for men and = 2.01 for women.

P: μM = mean dollars per hour for men with high school diploma

μW = mean dollars per hour for women with high school diploma

μD = μM - μW = Difference in dollars per hour

Page 22: Chapter 13:  Comparing Two Population Parameters

H:

: or : 0o M W o M WH H

: or : 0A M W A M WH H

Page 23: Chapter 13:  Comparing Two Population Parameters

A:

SRS (says so)

Normality26 + 26 ≥ 30

52 ≥ 30

By the CLT, ok to assume normality

Independence(More than 520 people with engineer degree)

Men’s salaries shouldn’t influence the salaries of women with high school diploma. Also, says independent

N: Two-Sample Z-Test

nM + nW 30

Page 24: Chapter 13:  Comparing Two Population Parameters

T:

22

( ) ( )M W M W

WM

M W

x xZ

n n

2 2

(11.76153 11.075) (0)

1.99 2.0126 26

0.686530.5547

1.2376

Page 25: Chapter 13:  Comparing Two Population Parameters

1.24

P(Z > 1.24) = 1 – P(Z < 1.24) =

O:

Page 26: Chapter 13:  Comparing Two Population Parameters
Page 27: Chapter 13:  Comparing Two Population Parameters

1.24

P(Z > 1.24) = 1 – P(Z < 1.24) = 1 – 0.8925 = 0.1075

Page 28: Chapter 13:  Comparing Two Population Parameters

M:

____ p 0.1075 0.05

>

Accept the Null

Page 29: Chapter 13:  Comparing Two Population Parameters

There is not enough evidence to say that men with a high school diploma make more money per hour than women.

S:

Page 30: Chapter 13:  Comparing Two Population Parameters

13.2 – Comparing Two Proportions

Page 31: Chapter 13:  Comparing Two Population Parameters

If we want to compare two populations or compare the responses to two treatments from independent samples, we look at a two-sample proportion:

21: ppHo 0: 21 ppHoor

Page 32: Chapter 13:  Comparing Two Population Parameters

Conditions for Proportion Interval:

SRS

Normality

IndependenceN 10(n1 + n2)The two samples are independent

1 1ˆ 5n p

1 1ˆ1 5n p 2 2ˆ 5n p

2 2ˆ1 5n p

Page 33: Chapter 13:  Comparing Two Population Parameters

2

22

1

1121

)ˆ1(ˆ)ˆ1(ˆ*ˆˆ

npp

nppzpp

Confidence Interval:

estimate test statistic sd

Page 34: Chapter 13:  Comparing Two Population Parameters

Conditions for Proportion Test:

SRS

Normality

IndependenceN 10(n1 + n2)The two samples are independent

1 ˆ 5Cn p

1 ˆ1 5Cn p 2 ˆ 5Cn p

2 ˆ1 5Cn p

count of success in both samplesˆcount of individuals from both samplesCp 1 2

1 2

x xn n

Page 35: Chapter 13:  Comparing Two Population Parameters

Hypothesis Test:

test statistic = estimate – hypothesized valuestandard deviation of statistic

1 2

1 2

ˆ ˆ

1 1ˆ ˆ(1 )c c

p pz

p pn n

Page 36: Chapter 13:  Comparing Two Population Parameters

Confidence Interval: Hypothesis Test

STAT-TESTS- 2-PropZInt STAT-TESTS- 2-PropZTest

Note: The only time you pool is when the standard deviations are the same. This almost never happens, so just don’t do it!

Calculator Tip!

Page 37: Chapter 13:  Comparing Two Population Parameters

Example #1An election is bitterly contested between two rivals. In a poll of 750 potential voters taken 4 weeks before the election, 420 indicated a preference for candidate Grumpy over candidate Dopey. Two weeks later, a new poll of 900 randomly selected potential voters found 465 who plan to vote for Grumpy. Dopey immediately began advertising that support for Grumpy was slipping drastically and that he was going to win the election. Statistically speaking (at the 0.05 level), how happy should Dopey be?

P: p1 = true proportion of people who want Grumpy to win in 1st poll

p2 = true proportion of people who want Grumpy to win in 1st poll

pD = p1 - p2 = Difference in proportion of people in 1st poll and second

Page 38: Chapter 13:  Comparing Two Population Parameters

21: ppHo 0: 21 ppHoor

1 2:AH p p 1 2: 0AH p p or

H:

Page 39: Chapter 13:  Comparing Two Population Parameters

SRS

Normality

1 ˆ 5Cn p

1 ˆ1 5Cn p

2 ˆ 5Cn p

2 ˆ1 5Cn p

count of success in both samplesˆcount of individuals from both samplesCp 1 2

1 2

x xn n

(Says in second one only. Must assume the first)

885 0.5361650

402.27 5(750)(0.536) 5

(750) 1 0.536 5 347.73 5

(900)(0.536) 5482.73 5

(900) 1 0.536 5

417.27 5

Page 40: Chapter 13:  Comparing Two Population Parameters

Independence

Safe to assume there were more than 10(750+900), or 16,500 voters

The first poll might have influenced the second poll, proceed with caution!

N: 2-PropZTest

Page 41: Chapter 13:  Comparing Two Population Parameters

T:

1 2

1 2

ˆ ˆ

1 1ˆ ˆ(1 )c c

p pz

p pn n

0.56 0.5167

1 10.536(1 0.536)750 900

0.04330.02466

1.7576

Page 42: Chapter 13:  Comparing Two Population Parameters

1.75

P(Z > 1.75) = 1 – P(Z < 1.75) =

O:

Page 43: Chapter 13:  Comparing Two Population Parameters
Page 44: Chapter 13:  Comparing Two Population Parameters

1.24

P(Z > 1.24) = 1 – P(Z < 1.24) = 1 – 0.9599 = 0.0401

Or, by calculator:

P(Z > 1.24) = 0.03941

Page 45: Chapter 13:  Comparing Two Population Parameters

M:

____ p 0.03941 0.05

<

Reject the Null

Page 46: Chapter 13:  Comparing Two Population Parameters

There is enough evidence to say that the proportion of voters that support Grumpy has dropped from the 1st poll to the second.

S:

Dopey should be very happy!

Page 47: Chapter 13:  Comparing Two Population Parameters

Example #2Two groups of 40 randomly selected students were selected to be part of a study on drop-out rates. One group was enrolled in a counseling program designed to give them skills needed to succeed in school and the other group received no special counseling. Fifteen of the students who received counseling dropped out of school, and 23 of the students who did not receive counseling dropped out. Construct a 90% confidence interval for the true difference between the drop-out rates of the two groups. P: pC = true proportion of students who drop out with

counseling

pN = true proportion of students who drop out without any counseling

pD = pC - pD = Difference in proportion of students who drop out with counseling vs. without

Page 48: Chapter 13:  Comparing Two Population Parameters

SRS

Normality

1 1ˆ 5n p

1 1ˆ1 5n p

2 2ˆ 5n p

2 2ˆ1 5n p

A: (says in both groups)

(40)(0.375) 5

15 5

40 1 0.375 5

25 5

40 0.575 5

40 1 0.575 5

23 5

17 5

Page 49: Chapter 13:  Comparing Two Population Parameters

Independence

Safe to assume there were more than 10(40+40), or 800 students

The drop out rate of the group with counseling might influence the group without counseling. Proceed with caution!

N: 2-PropZInt

Page 50: Chapter 13:  Comparing Two Population Parameters

I:

2

22

1

1121

)ˆ1(ˆ)ˆ1(ˆ*ˆˆ

npp

nppzpp

0.375(1 0.375) 0.575(1 0.575)0.375 0.575 1.64540 40

0.2 1.645 0.1094

0.3799, 0.0201

Page 51: Chapter 13:  Comparing Two Population Parameters

I am 90% confident the true difference in the proportion of dropouts with counseling vs. without counseling is between –0.3799 and –0.0201.

C:

It appears that drop out rates are lower with the group that got counseling than without it.