Chapter 5 Describing Distributions Numerically

15
Chapter 5 Describing Distributions Numerically AP Statistics

description

Chapter 5 Describing Distributions Numerically. AP Statistics. Center and Spread. In this chapter, we will eliminate much of the vagueness of finding the center and spread of a distribution The values we use will be determined by the shape of the distribution. Skewed Distribution . - PowerPoint PPT Presentation

Transcript of Chapter 5 Describing Distributions Numerically

Page 1: Chapter 5 Describing Distributions Numerically

Chapter 5Describing Distributions Numerically

AP Statistics

Page 2: Chapter 5 Describing Distributions Numerically

Center and Spread

• In this chapter, we will eliminate much of the vagueness of finding the center and spread of a distribution

• The values we use will be determined by the shape of the distribution

Page 3: Chapter 5 Describing Distributions Numerically

Skewed Distribution

• Center: Median• Spread: Interquartile Range (IQR)

• Both of these are “resistant”• Both should include units

Page 4: Chapter 5 Describing Distributions Numerically

Skewed Distribution

How to find the IQR 1. Find median 2. Find the median of both halves of data

the lower median is 1st Quartilethe upper median is 3rd Quartile

3. Subtract the two quartile scores** 1st Quartile = 25th percentile** 3rd Quartile = 75th percentile

Page 5: Chapter 5 Describing Distributions Numerically

Outliers

• Really just data that seems unusual• Formally we compute fences and if data point

is outside the fences, we consider it an outlier• Always use common sense

• Upper fence:• Lower fence: IQRQ

IQRQ5.15.1

1

3

Page 6: Chapter 5 Describing Distributions Numerically

5-Number Summary and Boxplots

The five number summary is used to create a boxplot

Page 7: Chapter 5 Describing Distributions Numerically

Boxplots

Page 8: Chapter 5 Describing Distributions Numerically

Boxplot

• Complement histograms by providing more specific information

• Look at histogram and boxplot together

• Most useful when comparing distributions

Page 9: Chapter 5 Describing Distributions Numerically

Symmetric Distributions

• Center: Mean

• Spread: Standard Deviation

• Both are not “resistant”• Both should include units

ny

y

Page 10: Chapter 5 Describing Distributions Numerically

Standard Deviation

• Takes into account how far each value in a data set is from the mean

Formula:

Find the standard deviation:10, 14, 15, 16, 20

1

2

n

yys

Page 11: Chapter 5 Describing Distributions Numerically

Properties of standard deviation1. Only use with mean2. If s = 0, there is no spread and all data pieces

are same—other wise s>0 and s gets larger as data pieces get more spread out.

3. A few outliers can really change the value of the standard deviation

Page 12: Chapter 5 Describing Distributions Numerically

Distributions with Outliers

• Tricky situation• Since outliers affect mean and standard deviation,

it is usually better to use median and IQR• However, if the distribution is unimodal—use

mean and median and just report outliers separately

• However, if you find a simple reason for outlier (input), eliminate it and use mean and standard devation—if symmetric

Page 13: Chapter 5 Describing Distributions Numerically

Other information

• If distribution is symmetric, then mean=median• If skewed right, mean>median• If skewed left, mean<median• Spread of distribution is just as important as

the center• How accurate: one or two decimal points more

than original data• Reexpress data???

Page 14: Chapter 5 Describing Distributions Numerically

Computer Printout Analysis

Page 15: Chapter 5 Describing Distributions Numerically