Sta220 - Statistics Mr. Smith Room 310 Class #16.

33
Sta220 - Statistics Mr. Smith Room 310 Class #16

Transcript of Sta220 - Statistics Mr. Smith Room 310 Class #16.

Page 1: Sta220 - Statistics Mr. Smith Room 310 Class #16.

Sta220 - Statistics Mr. SmithRoom 310Class #16

Page 2: Sta220 - Statistics Mr. Smith Room 310 Class #16.

Section 5-1 and 5-2 Notes

Page 3: Sta220 - Statistics Mr. Smith Room 310 Class #16.

Our goal in this chapter is to estimate the value of an unknown population parameter, such as the population mean.

Example• The mean gas mileage for a new car model• The average expected life of a flat-screen

computer monitor.

Page 4: Sta220 - Statistics Mr. Smith Room 310 Class #16.

The unknown population parameter (e.g., mean or proportion) that we are interested in estimating is called the target parameter.

Page 5: Sta220 - Statistics Mr. Smith Room 310 Class #16.

Copyright © 2013 Pearson Education, Inc.. All rights reserved.

Procedure

Page 6: Sta220 - Statistics Mr. Smith Room 310 Class #16.

A point estimator of a population parameter is a rule or formula that tells us how to use the sample data to calculate a single number that can be used as an estimate of the target parameter.

An interval estimator (or confidence interval) is a formula that tells us how to use the sample data to calculate an interval that estimates the target parameter.

Page 7: Sta220 - Statistics Mr. Smith Room 310 Class #16.

Example:We’ll use the sample mean to estimate the population mean . Consequently, , is a point estimator. We then attach a measure of reliability, similar to a standard deviation, to our estimate by obtaining an interval estimator, also called confidence interval.

Page 8: Sta220 - Statistics Mr. Smith Room 310 Class #16.

5-2: Confidence Interval for a Population Mean: Normal (z) Statistic

Page 9: Sta220 - Statistics Mr. Smith Room 310 Class #16.

Suppose a large hospital wants to estimate the average length of time patients remain in the hospital.

• The hospital’s target parameter is the population mean .

• Hospital administrators plan to randomly sample 100 of all previous patients’ records.

• The sample mean of the lengths of stay to estimate , the mean of all patients’ visits. So the sample mean represents a point estimator.

Page 10: Sta220 - Statistics Mr. Smith Room 310 Class #16.

According to the Central Limit Theorem, the sampling distribution of the sample mean is approximately normal for a large samples. (n > 30)

Now we can calculate the interval estimator:

This means we form an interval from 1.96 standard deviations (probability = .95) below the sample mean to 1.96 standard deviations above the mean.

Page 11: Sta220 - Statistics Mr. Smith Room 310 Class #16.

Copyright © 2013 Pearson Education, Inc.. All rights reserved.

Sampling distribution of x

Page 12: Sta220 - Statistics Mr. Smith Room 310 Class #16.

Example 5.1:

Consider the large hospital that wants to estimate the average length of stay of its patients, . The hospital randomly samples n = 100 of its patients and finds that the sample mean length of stay is days. Also, suppose it is known that the standard deviation of the length of stay for all hospital patients is days. Use the interval estimator to calculate a confidence interval for the target parameter, .

Page 13: Sta220 - Statistics Mr. Smith Room 310 Class #16.

Solution

Substitution and into the interval estimator formula, we obtain:

Page 14: Sta220 - Statistics Mr. Smith Room 310 Class #16.

or (3.72, 5.28).

Page 15: Sta220 - Statistics Mr. Smith Room 310 Class #16.

The confidence coefficient is the probability that an interval estimator encloses the population parameter – that is, the relative frequency with which the interval estimator encloses the population parameter when the estimator is used repeatedly a very large number of times. The confidence level is the confidence coefficient expressed as a percentage.

Page 16: Sta220 - Statistics Mr. Smith Room 310 Class #16.

Copyright © 2013 Pearson Education, Inc.. All rights reserved.

Confidence intervals for m: 10 samples

Page 17: Sta220 - Statistics Mr. Smith Room 310 Class #16.

Copyright © 2013 Pearson Education, Inc.. All rights reserved.

Locating za/2 on the standard normal curve

Page 18: Sta220 - Statistics Mr. Smith Room 310 Class #16.

The value is defined as the value of the standard normal random variable z such that the area will lie to its right. In other words, .

Page 19: Sta220 - Statistics Mr. Smith Room 310 Class #16.

Example 5-2:

Find for = .80.

Page 20: Sta220 - Statistics Mr. Smith Room 310 Class #16.

Solution

To illustrate, for confidence coefficient of .80, we have , and .

is z value that locates area .10 in the upper tail of the sampling distribution. Since the total area to the right of the mean is .10, we find the z value corresponding to an area of .5- .1 = .4 to the right of the mean.

This z value is = 1.28

Page 21: Sta220 - Statistics Mr. Smith Room 310 Class #16.

Copyright © 2013 Pearson Education, Inc.. All rights reserved.

Table 7.2

Page 22: Sta220 - Statistics Mr. Smith Room 310 Class #16.

Copyright © 2013 Pearson Education, Inc.. All rights reserved.

Procedure

Page 23: Sta220 - Statistics Mr. Smith Room 310 Class #16.

Copyright © 2013 Pearson Education, Inc.. All rights reserved.

Definition

Page 24: Sta220 - Statistics Mr. Smith Room 310 Class #16.

Example 5-3

Unoccupied seats on flights cause airlines to lose revenue. Suppose a large airline wants to estimate its average number of unoccupied seats per flight over the past year. To accomplish this, the records of 225 flights are randomly selected, and the number of unoccupied seats is noted for each of the sampled flights. Descriptive statistics for the data are displayed in the MINITAB printout below.

Estimate μ, the mean number of unoccupied seats per flight during the past year, using 90% confidence interval.

Page 25: Sta220 - Statistics Mr. Smith Room 310 Class #16.

Solution

The form of a large-sample 90% confidence interval for a population mean is:

Page 26: Sta220 - Statistics Mr. Smith Room 310 Class #16.

or the interval 11.15 to 12.05. That is, at the 90% confident level, we estimate the mean number of unoccupied seats per flight to be between 11.15 and 12.5 during the sampled year.

Page 27: Sta220 - Statistics Mr. Smith Room 310 Class #16.

Example 5-4

Many middle schools have initiated a program that provides every student with a free laptop (notebook) computer. Student usage of laptops at a middle school that participates in the initiative was investigated in American Secondary Education (fall 2009). In a sample of 106 students, the researchers reported the following statistics on how many minutes per day each student used his or her laptop for taking notes: = 13.2 and s = 19.5. Now the researchers want to estimate the average amount of time per day laptops are used for taking notes for all middle school students across the country.

Page 28: Sta220 - Statistics Mr. Smith Room 310 Class #16.

a. Calculate a 90% confidence interval for the target parameter. Interpret the results.

b. Explain what the phrase “90% confidence” implies in part a.

Page 29: Sta220 - Statistics Mr. Smith Room 310 Class #16.

Solution

a. For confidence coefficient .90, = .10 and = .05. From the table, = 1.645.

The confidence interval is:

Page 30: Sta220 - Statistics Mr. Smith Room 310 Class #16.

(10.084, 16. 316)

We are 90% confident that the true average amount of time per day laptops are used for taking notes for all middle school students across the country is between 10.084 and 16.316 minutes.

Page 31: Sta220 - Statistics Mr. Smith Room 310 Class #16.

b. “90% confidence” means that in a repeated sampling, 90% of all confidence intervals constructed in this manner will contain the true mean.

Page 32: Sta220 - Statistics Mr. Smith Room 310 Class #16.

Sometime, the we produce a confidence interval that is too wide. In this case, we want to reduce the width of the interval to obtain a more precise estimate of .

Go back to the hospital example. A 90% confidence interval for is (3.92, 5.14). The interval is narrower than the previously calculated 95% confidence interval (3.81, 5.25).

However, we also have “less confidence” in the 90% confidence interval.

Page 33: Sta220 - Statistics Mr. Smith Room 310 Class #16.

5-2 Homework Due Friday (also 5-3 and 5-5 will be due Friday, so I encourage you to work tomorrow).