Statistical Analysis Notes. Vocabulary Statistic: a number that describes a sample Parameter: a...

20
Statistical Analysis Notes

Transcript of Statistical Analysis Notes. Vocabulary Statistic: a number that describes a sample Parameter: a...

Page 1: Statistical Analysis Notes. Vocabulary  Statistic: a number that describes a sample  Parameter: a number that describes a population  We will study.

Statistical AnalysisNotes

Page 2: Statistical Analysis Notes. Vocabulary  Statistic: a number that describes a sample  Parameter: a number that describes a population  We will study.

Vocabulary

Statistic: a number that describes a sample

Parameter: a number that describes a population

We will study how to use statistics to estimate parameters. In other words, we can use information about a small group to infer things about a large group, provided that the small group is fairly representative of the large group.

Page 3: Statistical Analysis Notes. Vocabulary  Statistic: a number that describes a sample  Parameter: a number that describes a population  We will study.

Let’s look at an example.

Page 4: Statistical Analysis Notes. Vocabulary  Statistic: a number that describes a sample  Parameter: a number that describes a population  We will study.

In a survey of 50 students at a high school, 32 students said they plan to go to the prom. The school has 720

students.

The statistic is that 32 out of 50 students in the sample say they are going to the prom.

The parameter would be the number of students in the school that plan to go to the prom.

We can use the statistic to estimate the parameter.

Page 5: Statistical Analysis Notes. Vocabulary  Statistic: a number that describes a sample  Parameter: a number that describes a population  We will study.

Example, continued

We can write a proportion to estimate the parameter:

Now, let’s substitute in this information from the problem:

Cross-multiply to solve for x: x = 460.8

This means that about 461 students plan to attend the dance.

Page 6: Statistical Analysis Notes. Vocabulary  Statistic: a number that describes a sample  Parameter: a number that describes a population  We will study.

More About Statistics and Parameters

A lottery ticket says the 1 in 3 tickets will be a winner. Michael bought 5 tickets, but hasn’t won a prize yet. He thinks the advertised odds must be incorrect.

However, Michael is basing this on too small of a sample.

Remember that samples must be random and representative of an entire population if we are going to use them to draw VALID conclusions.

Page 7: Statistical Analysis Notes. Vocabulary  Statistic: a number that describes a sample  Parameter: a number that describes a population  We will study.

Types of Sampling Distributions

Simple Random Sample: members are chosen so that everyone has an equal chance of being selected Example: 100 students’ names are randomly chosen from a

list of all students in the school

Systematic Sample: members are chosen using a pattern Example: every other student is selected from a list of all

students

Stratified Sample: members are randomly chosen from subgroups created from the population Example: 3 students are chosen from each class in school

Page 8: Statistical Analysis Notes. Vocabulary  Statistic: a number that describes a sample  Parameter: a number that describes a population  We will study.

Types of Sampling Distributions, continued

Cluster Sample: an entire subgroup is surveyed after the subgroup was randomly chosen from the population Example: an entire English class is surveyed

Convenience Sample: members are chosen because they are easily accessible Example: 100 students with first lunch are surveyed

because the surveyor also has first lunch

Self-Selected Sample: members voluntarily participate Example: 100 students show up to take a survey after

school

Page 9: Statistical Analysis Notes. Vocabulary  Statistic: a number that describes a sample  Parameter: a number that describes a population  We will study.

More About Sampling

Usually cost and accuracy are the most important considerations when choosing how to sample a population

A census is the most accurate survey, but is time-consuming and costly

More accurate sampling methods are usually more difficult to use or are more expensive than others

Simple random samples, stratified samples, and cluster samples are very accurate

Convenience samples and self-selected samples are not very accurate due to the nature of the individuals involved

Page 10: Statistical Analysis Notes. Vocabulary  Statistic: a number that describes a sample  Parameter: a number that describes a population  We will study.

Let’s practice! Classify the sampling method used in each scenario below. Then, go to the next slide to check your answers.

1. Town officials are deciding whether to build a public park downtown. They decide to ask residents to come to a town hall meeting to vote on the issue.

2. The owner of a movie theater wants to know how customers feel about the new 3D glasses the theater is using. She asks every 25th person who exits the theater each Friday night for a month.

3. The manager of the produce department at a grocery store wants to determine the quality of the pears they receive. She examines several randomly selected oranges from each of the 10 crates.

Page 11: Statistical Analysis Notes. Vocabulary  Statistic: a number that describes a sample  Parameter: a number that describes a population  We will study.

How well did you do?

1. Self-Selected Sample – the individuals volunteer to come to the meeting to vote on whether or not there should be a park built

2. Systematic Sample – individuals are chosen using a pattern

3. Stratified Sample - pears are chosen randomly from each crate

Page 12: Statistical Analysis Notes. Vocabulary  Statistic: a number that describes a sample  Parameter: a number that describes a population  We will study.

It’s time to apply what we’ve learned!

We are going to conduct a simulation of random sampling.

To do this, we are going to use a random digits table, that you should have downloaded and printed earlier. If you haven’t done so already, please click on the link to it on the main page and print it now.

A random digit table contains lines of numbers in which all of the digits from 0 to 9 are represented equally.

We will use these digits to randomly choose individuals, and use their information to find a sample mean.

Page 13: Statistical Analysis Notes. Vocabulary  Statistic: a number that describes a sample  Parameter: a number that describes a population  We will study.

Below are the last names of 20 managers of a set of retail stores. The owner of the store wants to know the opinions of the managers about the stores’ new

pricing system.We will use the random digits table to select 6

managers to be interviewed by the owner.

Baker Helms Myers Smith

Boss Jones Nixon Stamm

Crown King O’Neal Thorn

Cruz Loren Rogers

West

Duram

Moss Sand Wills

Page 14: Statistical Analysis Notes. Vocabulary  Statistic: a number that describes a sample  Parameter: a number that describes a population  We will study.

First, let’s assign each manager a number.

01 Baker 06 Helms 11 Myers 16 Smith

02 Boss 07 Jones 12 Nixon 17 Stamm

03 Crown 08 King 13 O’Neal 18 Thorn

04 Cruz 09 Loren 14 Rogers 19 West

05 Duram 10 Moss 15 Sand 20 Wills

Page 15: Statistical Analysis Notes. Vocabulary  Statistic: a number that describes a sample  Parameter: a number that describes a population  We will study.

Now, let’s use line 139 in our table to choose our 6.

Line 139 in the table is 55588 99404 70708 41098 43563 56934 48394 51719

We go through the row to find double-digit numbers that could represent one of our managers. We would see 55, 58, 89, and 94 that are not aligned with our numbers…but then we get 04 which matches Manager Cruz. Cruz will be our first choice to be interviewed.

Continue using the table to identify 5 others. The highlighted numbers represent our other candidates: 70, 70, 84, 10, 98, 43, 56, 35, 69, 34, 48, 39, 45, 17, 19.

We have reached the end of the row and have chosen 4 managers: Cruz (04), Moss (10), Stamm (17), and West (19). We still need 2 more, so we can just continue on to line 140 in the table.

12, 97, 51, 32, 58, 13

The other two managers we will choose are Nixon (12) and O’Neal (13).

Page 16: Statistical Analysis Notes. Vocabulary  Statistic: a number that describes a sample  Parameter: a number that describes a population  We will study.

More About Using Random Numbers

Unless told a specific row to use, you can choose any row and use it to select individuals.

You can also choose how to identify your individuals. We used two-digit numbers, but you can also choose three-digit numbers, four-digit numbers, etc.

Using random numbers to select our individuals ensures that we get a simple random sample.

Page 17: Statistical Analysis Notes. Vocabulary  Statistic: a number that describes a sample  Parameter: a number that describes a population  We will study.

Here’s one last example that we can use to practice using the random digits table.

Researchers want to know the average length of a great white shark. Since it would be impossible to measure every great white shark in the world, they recorded the measurements of the lengths of 24 great whites who entered a research facility. The data is summarized on the next slide, where the whales have been numbered, and their weights are given in feet.

Page 18: Statistical Analysis Notes. Vocabulary  Statistic: a number that describes a sample  Parameter: a number that describes a population  We will study.

Lengths of 24 Sharks

Shark #

Length

Shark #

Length

Shark #

Length

Shark #

Length

01 18.7 07 12.3 13 18.6 19 16.4

02 15.7 08 18.3 14 14.6 20 15.8

03 14.9 09 17.6 15 12.1 21 16.4

04 16.7 10 17.8 16 16.2 22 12.6

05 17.8 11 13.8 17 12.2 23 15.2

06 14.7 12 12.4 18 13.2 24 15.8

Page 19: Statistical Analysis Notes. Vocabulary  Statistic: a number that describes a sample  Parameter: a number that describes a population  We will study.

Here’s what we need to do.

First, let’s use the random number table to choose 7 sharks. So that we can check our work, let’s start at line 116. Remember to continue on to the next line if you need to.

You should get in order: 14, 03, 10, 22, 06, 11, 16

The mean of our sample would be the average of these lengths. If you add up the lengths of these 7 sharks and divide by 7 you should get 14.93 inches.

Page 20: Statistical Analysis Notes. Vocabulary  Statistic: a number that describes a sample  Parameter: a number that describes a population  We will study.

In summary,

The random number table can be time-consuming, especially if we need to use a large sample size.

We also need to be able to identify a large number of characteristics of populations, and the random number process limits how much information we can accurately infer.

In our next lesson, we will discuss ways to expand our use of information from a sample in order to more accurately describe an entire population.