Math 1710 Class 2pi.math.cornell.edu/~web1710/slides/aug26_v2u.pdf · Class 2 V2u If You’ve...

53
Math 1710 Class 2 V2u If You’ve Studied Stats ... A Simple Median and Quartile Example Describing the Dist. of Quant. Data Outliers Math 1710 Class 2 Dr. Allen Back Aug. 26, 2016

Transcript of Math 1710 Class 2pi.math.cornell.edu/~web1710/slides/aug26_v2u.pdf · Class 2 V2u If You’ve...

Page 1: Math 1710 Class 2pi.math.cornell.edu/~web1710/slides/aug26_v2u.pdf · Class 2 V2u If You’ve Studied Stats... A Simple Median and Quartile Example Describing the Dist. of Quant.

Math 1710Class 2

V2u

If You’veStudied Stats. . .

A SimpleMedian andQuartileExample

Describing theDist. ofQuant. Data

Outliers

Math 1710 Class 2

Dr. Allen Back

Aug. 26, 2016

Page 2: Math 1710 Class 2pi.math.cornell.edu/~web1710/slides/aug26_v2u.pdf · Class 2 V2u If You’ve Studied Stats... A Simple Median and Quartile Example Describing the Dist. of Quant.

Math 1710Class 2

V2u

If You’veStudied Stats. . .

A SimpleMedian andQuartileExample

Describing theDist. ofQuant. Data

Outliers

AP Stats vs. 1710

Some different emphases.

But generally comparable.So retaking not recommended.Though many people do.If you do, pls be sure towork/attend regularly.Our exams quite different.(And not that close to textbook.)Lecture a good guide to exam priorities.(Some explanations will go beyondwhat you will be responsible for.)

Page 3: Math 1710 Class 2pi.math.cornell.edu/~web1710/slides/aug26_v2u.pdf · Class 2 V2u If You’ve Studied Stats... A Simple Median and Quartile Example Describing the Dist. of Quant.

Math 1710Class 2

V2u

If You’veStudied Stats. . .

A SimpleMedian andQuartileExample

Describing theDist. ofQuant. Data

Outliers

AP Stats vs. 1710

Some different emphases.But generally comparable.

So retaking not recommended.Though many people do.If you do, pls be sure towork/attend regularly.Our exams quite different.(And not that close to textbook.)Lecture a good guide to exam priorities.(Some explanations will go beyondwhat you will be responsible for.)

Page 4: Math 1710 Class 2pi.math.cornell.edu/~web1710/slides/aug26_v2u.pdf · Class 2 V2u If You’ve Studied Stats... A Simple Median and Quartile Example Describing the Dist. of Quant.

Math 1710Class 2

V2u

If You’veStudied Stats. . .

A SimpleMedian andQuartileExample

Describing theDist. ofQuant. Data

Outliers

AP Stats vs. 1710

Some different emphases.But generally comparable.So retaking not recommended.

Though many people do.If you do, pls be sure towork/attend regularly.Our exams quite different.(And not that close to textbook.)Lecture a good guide to exam priorities.(Some explanations will go beyondwhat you will be responsible for.)

Page 5: Math 1710 Class 2pi.math.cornell.edu/~web1710/slides/aug26_v2u.pdf · Class 2 V2u If You’ve Studied Stats... A Simple Median and Quartile Example Describing the Dist. of Quant.

Math 1710Class 2

V2u

If You’veStudied Stats. . .

A SimpleMedian andQuartileExample

Describing theDist. ofQuant. Data

Outliers

AP Stats vs. 1710

Some different emphases.But generally comparable.So retaking not recommended.Though many people do.

If you do, pls be sure towork/attend regularly.Our exams quite different.(And not that close to textbook.)Lecture a good guide to exam priorities.(Some explanations will go beyondwhat you will be responsible for.)

Page 6: Math 1710 Class 2pi.math.cornell.edu/~web1710/slides/aug26_v2u.pdf · Class 2 V2u If You’ve Studied Stats... A Simple Median and Quartile Example Describing the Dist. of Quant.

Math 1710Class 2

V2u

If You’veStudied Stats. . .

A SimpleMedian andQuartileExample

Describing theDist. ofQuant. Data

Outliers

AP Stats vs. 1710

Some different emphases.But generally comparable.So retaking not recommended.Though many people do.If you do, pls be sure towork/attend regularly.

Our exams quite different.(And not that close to textbook.)Lecture a good guide to exam priorities.(Some explanations will go beyondwhat you will be responsible for.)

Page 7: Math 1710 Class 2pi.math.cornell.edu/~web1710/slides/aug26_v2u.pdf · Class 2 V2u If You’ve Studied Stats... A Simple Median and Quartile Example Describing the Dist. of Quant.

Math 1710Class 2

V2u

If You’veStudied Stats. . .

A SimpleMedian andQuartileExample

Describing theDist. ofQuant. Data

Outliers

AP Stats vs. 1710

Some different emphases.But generally comparable.So retaking not recommended.Though many people do.If you do, pls be sure towork/attend regularly.Our exams quite different.

(And not that close to textbook.)Lecture a good guide to exam priorities.(Some explanations will go beyondwhat you will be responsible for.)

Page 8: Math 1710 Class 2pi.math.cornell.edu/~web1710/slides/aug26_v2u.pdf · Class 2 V2u If You’ve Studied Stats... A Simple Median and Quartile Example Describing the Dist. of Quant.

Math 1710Class 2

V2u

If You’veStudied Stats. . .

A SimpleMedian andQuartileExample

Describing theDist. ofQuant. Data

Outliers

AP Stats vs. 1710

Some different emphases.But generally comparable.So retaking not recommended.Though many people do.If you do, pls be sure towork/attend regularly.Our exams quite different.(And not that close to textbook.)

Lecture a good guide to exam priorities.(Some explanations will go beyondwhat you will be responsible for.)

Page 9: Math 1710 Class 2pi.math.cornell.edu/~web1710/slides/aug26_v2u.pdf · Class 2 V2u If You’ve Studied Stats... A Simple Median and Quartile Example Describing the Dist. of Quant.

Math 1710Class 2

V2u

If You’veStudied Stats. . .

A SimpleMedian andQuartileExample

Describing theDist. ofQuant. Data

Outliers

AP Stats vs. 1710

Some different emphases.But generally comparable.So retaking not recommended.Though many people do.If you do, pls be sure towork/attend regularly.Our exams quite different.(And not that close to textbook.)Lecture a good guide to exam priorities.

(Some explanations will go beyondwhat you will be responsible for.)

Page 10: Math 1710 Class 2pi.math.cornell.edu/~web1710/slides/aug26_v2u.pdf · Class 2 V2u If You’ve Studied Stats... A Simple Median and Quartile Example Describing the Dist. of Quant.

Math 1710Class 2

V2u

If You’veStudied Stats. . .

A SimpleMedian andQuartileExample

Describing theDist. ofQuant. Data

Outliers

AP Stats vs. 1710

Some different emphases.But generally comparable.So retaking not recommended.Though many people do.If you do, pls be sure towork/attend regularly.Our exams quite different.(And not that close to textbook.)Lecture a good guide to exam priorities.(Some explanations will go beyondwhat you will be responsible for.)

Page 11: Math 1710 Class 2pi.math.cornell.edu/~web1710/slides/aug26_v2u.pdf · Class 2 V2u If You’ve Studied Stats... A Simple Median and Quartile Example Describing the Dist. of Quant.

Math 1710Class 2

V2u

If You’veStudied Stats. . .

A SimpleMedian andQuartileExample

Describing theDist. ofQuant. Data

Outliers

Per Capita CO2 Emissions

Units are metric tons per person per year.

Page 12: Math 1710 Class 2pi.math.cornell.edu/~web1710/slides/aug26_v2u.pdf · Class 2 V2u If You’ve Studied Stats... A Simple Median and Quartile Example Describing the Dist. of Quant.

Math 1710Class 2

V2u

If You’veStudied Stats. . .

A SimpleMedian andQuartileExample

Describing theDist. ofQuant. Data

Outliers

Per Capita CO2 Emissions

8 Most Populous Countries in the World (a few years ago):

Country tons/yr.China 2.3India 1.1

US 19.7Indonesia 1.2

Brazil 1.8Russia 9.8

Pakistan .7? .2

Page 13: Math 1710 Class 2pi.math.cornell.edu/~web1710/slides/aug26_v2u.pdf · Class 2 V2u If You’ve Studied Stats... A Simple Median and Quartile Example Describing the Dist. of Quant.

Math 1710Class 2

V2u

If You’veStudied Stats. . .

A SimpleMedian andQuartileExample

Describing theDist. ofQuant. Data

Outliers

Per Capita CO2 Emissions

8 Most Populous Countries in the World:

Country tons/yr.China 2.3India 1.1

US 19.7Indonesia 1.2

Brazil 1.8Russia 9.8

Pakistan .7Bangladesh .2

Page 14: Math 1710 Class 2pi.math.cornell.edu/~web1710/slides/aug26_v2u.pdf · Class 2 V2u If You’ve Studied Stats... A Simple Median and Quartile Example Describing the Dist. of Quant.

Math 1710Class 2

V2u

If You’veStudied Stats. . .

A SimpleMedian andQuartileExample

Describing theDist. ofQuant. Data

Outliers

Per Capita CO2 Emissions

In order:

tons/yr : .2 .7 1.1 1.2 1.8 2.3 9.8 19.7

Page 15: Math 1710 Class 2pi.math.cornell.edu/~web1710/slides/aug26_v2u.pdf · Class 2 V2u If You’ve Studied Stats... A Simple Median and Quartile Example Describing the Dist. of Quant.

Math 1710Class 2

V2u

If You’veStudied Stats. . .

A SimpleMedian andQuartileExample

Describing theDist. ofQuant. Data

Outliers

Per Capita CO2 Emissions

In order with positions: (n = 8)

tons/yr : .2 .7 1.1 1.2 1.8 2.3 9.8 19.7Posn. : 1 2 3 4 5 6 7 8

Page 16: Math 1710 Class 2pi.math.cornell.edu/~web1710/slides/aug26_v2u.pdf · Class 2 V2u If You’ve Studied Stats... A Simple Median and Quartile Example Describing the Dist. of Quant.

Math 1710Class 2

V2u

If You’veStudied Stats. . .

A SimpleMedian andQuartileExample

Describing theDist. ofQuant. Data

Outliers

Per Capita CO2 Emissions

In order with positions: (n = 8)

tons/yr : .2 .7 1.1 1.2 1.8 2.3 9.8 19.7Posn. : 1 2 3 4 5 6 7 8

The median is the middle value.A basic measure of center.When the sample size is even, we average the two middlevalues.

Page 17: Math 1710 Class 2pi.math.cornell.edu/~web1710/slides/aug26_v2u.pdf · Class 2 V2u If You’ve Studied Stats... A Simple Median and Quartile Example Describing the Dist. of Quant.

Math 1710Class 2

V2u

If You’veStudied Stats. . .

A SimpleMedian andQuartileExample

Describing theDist. ofQuant. Data

Outliers

Per Capita CO2 Emissions

The median is the middle value.A basic measure of center.When the sample size is even, we average the two middlevalues.Sample size (n = 8)

tons/yr : .2 .7 1.1 1.2 1.8 2.3 9.8 19.7Posn. : 1 2 3 4 5 6 7 8

median =1.8 + 1.2

2= 1.5.

Page 18: Math 1710 Class 2pi.math.cornell.edu/~web1710/slides/aug26_v2u.pdf · Class 2 V2u If You’ve Studied Stats... A Simple Median and Quartile Example Describing the Dist. of Quant.

Math 1710Class 2

V2u

If You’veStudied Stats. . .

A SimpleMedian andQuartileExample

Describing theDist. ofQuant. Data

Outliers

Per Capita CO2 Emissions

tons/yr : .2 .7 1.1 1.2 1.8 2.3 9.8 19.7

Histogram of all 8

Page 19: Math 1710 Class 2pi.math.cornell.edu/~web1710/slides/aug26_v2u.pdf · Class 2 V2u If You’ve Studied Stats... A Simple Median and Quartile Example Describing the Dist. of Quant.

Math 1710Class 2

V2u

If You’veStudied Stats. . .

A SimpleMedian andQuartileExample

Describing theDist. ofQuant. Data

Outliers

Per Capita CO2 Emissions

tons/yr : .2 .7 1.1 1.2 1.8 2.3 9.8 19.7

Histogram of all 8

The US value of 19.7 is not in keeping with the rest if the data.Such a value is called an outlier.

Page 20: Math 1710 Class 2pi.math.cornell.edu/~web1710/slides/aug26_v2u.pdf · Class 2 V2u If You’ve Studied Stats... A Simple Median and Quartile Example Describing the Dist. of Quant.

Math 1710Class 2

V2u

If You’veStudied Stats. . .

A SimpleMedian andQuartileExample

Describing theDist. ofQuant. Data

Outliers

Per Capita CO2 Emissions

The US value of 19.7 is not in keeping with the rest if the data.Such a value is called an outlier.

Histogram of all but US. (n=7)

Page 21: Math 1710 Class 2pi.math.cornell.edu/~web1710/slides/aug26_v2u.pdf · Class 2 V2u If You’ve Studied Stats... A Simple Median and Quartile Example Describing the Dist. of Quant.

Math 1710Class 2

V2u

If You’veStudied Stats. . .

A SimpleMedian andQuartileExample

Describing theDist. ofQuant. Data

Outliers

Per Capita CO2 Emissions

The US value of 19.7 is not in keeping with the rest if the data.Such a value is called an outlier.

Histogram of all but US. (n=7)

Removal of the outlier gives a much more revealing histogram.

Page 22: Math 1710 Class 2pi.math.cornell.edu/~web1710/slides/aug26_v2u.pdf · Class 2 V2u If You’ve Studied Stats... A Simple Median and Quartile Example Describing the Dist. of Quant.

Math 1710Class 2

V2u

If You’veStudied Stats. . .

A SimpleMedian andQuartileExample

Describing theDist. ofQuant. Data

Outliers

Per Capita CO2 Emissions

Without the outlier: (n = 7)

tons/yr : .2 .7 1.1 1.2 1.8 2.3 9.8Posn. : 1 2 3 4 5 6 7

With n odd, the median is just the middle value of 1.2.

Page 23: Math 1710 Class 2pi.math.cornell.edu/~web1710/slides/aug26_v2u.pdf · Class 2 V2u If You’ve Studied Stats... A Simple Median and Quartile Example Describing the Dist. of Quant.

Math 1710Class 2

V2u

If You’veStudied Stats. . .

A SimpleMedian andQuartileExample

Describing theDist. ofQuant. Data

Outliers

Per Capita CO2 Emissions

Without the outlier: (n = 7)

tons/yr : .2 .7 1.1 1.2 1.8 2.3 9.8Posn. : 1 2 3 4 5 6 7

With n odd, the median is just the middle value of 1.2.

Page 24: Math 1710 Class 2pi.math.cornell.edu/~web1710/slides/aug26_v2u.pdf · Class 2 V2u If You’ve Studied Stats... A Simple Median and Quartile Example Describing the Dist. of Quant.

Math 1710Class 2

V2u

If You’veStudied Stats. . .

A SimpleMedian andQuartileExample

Describing theDist. ofQuant. Data

Outliers

Per Capita CO2 Emissions

The first quartile Q1 or 25th percentile is defined to be themedian of the bottom half of our data.

Page 25: Math 1710 Class 2pi.math.cornell.edu/~web1710/slides/aug26_v2u.pdf · Class 2 V2u If You’ve Studied Stats... A Simple Median and Quartile Example Describing the Dist. of Quant.

Math 1710Class 2

V2u

If You’veStudied Stats. . .

A SimpleMedian andQuartileExample

Describing theDist. ofQuant. Data

Outliers

Per Capita CO2 Emissions

The first quartile Q1 or 25th percentile is defined to be themedian of the bottom half of our data.For a data set of odd sample size, we do not include themedian in the bottom half:

Page 26: Math 1710 Class 2pi.math.cornell.edu/~web1710/slides/aug26_v2u.pdf · Class 2 V2u If You’ve Studied Stats... A Simple Median and Quartile Example Describing the Dist. of Quant.

Math 1710Class 2

V2u

If You’veStudied Stats. . .

A SimpleMedian andQuartileExample

Describing theDist. ofQuant. Data

Outliers

Per Capita CO2 Emissions

The first quartile Q1 or 25th percentile is defined to be themedian of the bottom half of our data.For a data set of odd sample size, we do not include themedian in the bottom half:Without the outlier: (n = 7)

tons/yr : .2 .7 1.1 1.2 1.8 2.3 9.8Posn. : 1 2 3 4 5 6 7

Q1 = .7

Page 27: Math 1710 Class 2pi.math.cornell.edu/~web1710/slides/aug26_v2u.pdf · Class 2 V2u If You’ve Studied Stats... A Simple Median and Quartile Example Describing the Dist. of Quant.

Math 1710Class 2

V2u

If You’veStudied Stats. . .

A SimpleMedian andQuartileExample

Describing theDist. ofQuant. Data

Outliers

Per Capita CO2 Emissions

The first quartile Q1 or 25th percentile is defined to be themedian of the bottom half of our data.For a data set of odd sample size, we do not include themedian in the bottom half:Without the outlier: (n = 7)

tons/yr : .2 .7 1.1 1.2 1.8 2.3 9.8Posn. : 1 2 3 4 5 6 7

Q1 = .7

Page 28: Math 1710 Class 2pi.math.cornell.edu/~web1710/slides/aug26_v2u.pdf · Class 2 V2u If You’ve Studied Stats... A Simple Median and Quartile Example Describing the Dist. of Quant.

Math 1710Class 2

V2u

If You’veStudied Stats. . .

A SimpleMedian andQuartileExample

Describing theDist. ofQuant. Data

Outliers

Per Capita CO2 Emissions

The first quartile Q1 or 25th percentile is defined to be themedian of the bottom half of our data.For a data set of odd sample size, we do not include themedian in the bottom half:Without the outlier: (n = 7)

tons/yr : .2 .7 1.1 1.2 1.8 2.3 9.8Posn. : 1 2 3 4 5 6 7

Q1 = .7

(And Q3 = 2.3.)The convention about not including the middle changed in the3rd edition of (a sibling by the same authors of) our text.

Page 29: Math 1710 Class 2pi.math.cornell.edu/~web1710/slides/aug26_v2u.pdf · Class 2 V2u If You’ve Studied Stats... A Simple Median and Quartile Example Describing the Dist. of Quant.

Math 1710Class 2

V2u

If You’veStudied Stats. . .

A SimpleMedian andQuartileExample

Describing theDist. ofQuant. Data

Outliers

Per Capita CO2 Emissions

For the original data set: (n = 8)

tons/yr : .2 .7 1.1 1.2 1.8 2.3 9.8 19.7Posn. : 1 2 3 4 5 6 7 8

Q1 =.7 + 1.1

2= .9 and Q3 =

2.3 + 9.8

2= 6.05

Page 30: Math 1710 Class 2pi.math.cornell.edu/~web1710/slides/aug26_v2u.pdf · Class 2 V2u If You’ve Studied Stats... A Simple Median and Quartile Example Describing the Dist. of Quant.

Math 1710Class 2

V2u

If You’veStudied Stats. . .

A SimpleMedian andQuartileExample

Describing theDist. ofQuant. Data

Outliers

Per Capita CO2 Emissions

The 5-number summary:

min, Q1, median, Q3, maxall 8 .2, .9, 1.5, 6.05, 19.7

w/o US .2, .7, 1.2, 2.3, 9.8

Page 31: Math 1710 Class 2pi.math.cornell.edu/~web1710/slides/aug26_v2u.pdf · Class 2 V2u If You’ve Studied Stats... A Simple Median and Quartile Example Describing the Dist. of Quant.

Math 1710Class 2

V2u

If You’veStudied Stats. . .

A SimpleMedian andQuartileExample

Describing theDist. ofQuant. Data

Outliers

Per Capita CO2 Emissions

The 5-number summary:

min, Q1, median, Q3, maxall 8 .2, .9, 1.5, 6.05, 19.7

w/o US .2, .7, 1.2, 2.3, 9.8

Boxplot - Graphical form of the 5 number summary:

all - n=8

Page 32: Math 1710 Class 2pi.math.cornell.edu/~web1710/slides/aug26_v2u.pdf · Class 2 V2u If You’ve Studied Stats... A Simple Median and Quartile Example Describing the Dist. of Quant.

Math 1710Class 2

V2u

If You’veStudied Stats. . .

A SimpleMedian andQuartileExample

Describing theDist. ofQuant. Data

Outliers

Per Capita CO2 Emissions

The 5-number summary:

min, Q1, median, Q3, maxall 8 .2, .9, 1.5, 6.05, 19.7

w/o US .2, .7, 1.2, 2.3, 9.8

Boxplot - Graphical form of the 5 number summary:

Without the Outlier - n=7

Page 33: Math 1710 Class 2pi.math.cornell.edu/~web1710/slides/aug26_v2u.pdf · Class 2 V2u If You’ve Studied Stats... A Simple Median and Quartile Example Describing the Dist. of Quant.

Math 1710Class 2

V2u

If You’veStudied Stats. . .

A SimpleMedian andQuartileExample

Describing theDist. ofQuant. Data

Outliers

Per Capita CO2 Emissions

Interquartile Range: IQR=Q3 − Q1.A basic measure of spread.

Page 34: Math 1710 Class 2pi.math.cornell.edu/~web1710/slides/aug26_v2u.pdf · Class 2 V2u If You’ve Studied Stats... A Simple Median and Quartile Example Describing the Dist. of Quant.

Math 1710Class 2

V2u

If You’veStudied Stats. . .

A SimpleMedian andQuartileExample

Describing theDist. ofQuant. Data

Outliers

Per Capita CO2 Emissions

Interquartile Range: IQR=Q3 − Q1.A basic measure of spread.The median and IQR are usually little affected by outliers.“Resistant to Outliers”

Page 35: Math 1710 Class 2pi.math.cornell.edu/~web1710/slides/aug26_v2u.pdf · Class 2 V2u If You’ve Studied Stats... A Simple Median and Quartile Example Describing the Dist. of Quant.

Math 1710Class 2

V2u

If You’veStudied Stats. . .

A SimpleMedian andQuartileExample

Describing theDist. ofQuant. Data

Outliers

Per Capita CO2 Emissions

Interquartile Range: IQR=Q3 − Q1.A basic measure of spread.The median and IQR are usually little affected by outliers.“Resistant to Outliers”

Here:

n Q1 med Q3 IQR

with outlier 8 .9 1.5 6.05 5.15w/o outlier 7 .7 1.2 2.3 1.6

It is mostly because of the small sample size that the medianand IQR change as much as they do here.

Page 36: Math 1710 Class 2pi.math.cornell.edu/~web1710/slides/aug26_v2u.pdf · Class 2 V2u If You’ve Studied Stats... A Simple Median and Quartile Example Describing the Dist. of Quant.

Math 1710Class 2

V2u

If You’veStudied Stats. . .

A SimpleMedian andQuartileExample

Describing theDist. ofQuant. Data

Outliers

Per Capita CO2 Emissions

Spreadsheets do“wild” things when computing quartiles:

tons/yr : .2 .7 1.1 1.2 1.8 2.3 9.8 19.7

Open Office, an Excel Clone

One way to get such numbers:With 8 numbers there are 7 intervals in between..7 is the 100

7 %ile.1.1 is the 200

7 %ile.25 = 1

4 ·1007 + 3

4 ·2007

So the 25th %ile is 14 · .7 + 3

4 · 1.1 = 1.

Page 37: Math 1710 Class 2pi.math.cornell.edu/~web1710/slides/aug26_v2u.pdf · Class 2 V2u If You’ve Studied Stats... A Simple Median and Quartile Example Describing the Dist. of Quant.

Math 1710Class 2

V2u

If You’veStudied Stats. . .

A SimpleMedian andQuartileExample

Describing theDist. ofQuant. Data

Outliers

Describing the Distribution of Quantitative Data

To write a verbal description:Comment on the shape, center, spread, and any unusualfeatures of the distribution.

Page 38: Math 1710 Class 2pi.math.cornell.edu/~web1710/slides/aug26_v2u.pdf · Class 2 V2u If You’ve Studied Stats... A Simple Median and Quartile Example Describing the Dist. of Quant.

Math 1710Class 2

V2u

If You’veStudied Stats. . .

A SimpleMedian andQuartileExample

Describing theDist. ofQuant. Data

Outliers

Describing the Distribution of Quantitative Data

To write a verbal description:Comment on the shape, center, spread, and any unusualfeatures of the distribution.

Histogram of all but US. (n=7)

Per Capita CO 2 Emissions:n Q1 med Q3 IQR

with outlier 8 .9 1.5 6.05 5.15w/o outlier 7 .7 1.2 2.3 1.6

Page 39: Math 1710 Class 2pi.math.cornell.edu/~web1710/slides/aug26_v2u.pdf · Class 2 V2u If You’ve Studied Stats... A Simple Median and Quartile Example Describing the Dist. of Quant.

Math 1710Class 2

V2u

If You’veStudied Stats. . .

A SimpleMedian andQuartileExample

Describing theDist. ofQuant. Data

Outliers

Describing the Distribution of Quantitative Data

Histogram of all but US. (n=7)

Per Capita CO 2 Emissions:n Q1 med Q3 IQR

with outlier 8 .9 1.5 6.05 5.15w/o outlier 7 .7 1.2 2.3 1.6

Page 40: Math 1710 Class 2pi.math.cornell.edu/~web1710/slides/aug26_v2u.pdf · Class 2 V2u If You’ve Studied Stats... A Simple Median and Quartile Example Describing the Dist. of Quant.

Math 1710Class 2

V2u

If You’veStudied Stats. . .

A SimpleMedian andQuartileExample

Describing theDist. ofQuant. Data

Outliers

Describing the Distribution of Quantitative Data

The distribution of per capita CO2 emissions of the 8 mostpopulous countries is skewed to the right with a centersomewhere between 1 and 2 tons per year. The U.S. andpossibly Russia are outliers on the big side. The interquartilerange of 1.6 tons per year reflects the fact that aside from thetwo biggest producers, the six other countries produce less than2.3 tons per person per year.

Page 41: Math 1710 Class 2pi.math.cornell.edu/~web1710/slides/aug26_v2u.pdf · Class 2 V2u If You’ve Studied Stats... A Simple Median and Quartile Example Describing the Dist. of Quant.

Math 1710Class 2

V2u

If You’veStudied Stats. . .

A SimpleMedian andQuartileExample

Describing theDist. ofQuant. Data

Outliers

Describing the Distribution of Quantitative Data

The distribution of per capita CO2 emissions of the 8 mostpopulous countries is skewed to the right with a centersomewhere between 1 and 2 tons per year. The U.S. andpossibly Russia are outliers on the big side. The interquartilerange of 1.6 tons per year reflects the fact that aside from thetwo biggest producers, the six other countries produce less than2.3 tons per person per year.

Note skewed to the right or skewed positive, means stretchedout towards the right; not the convention most newcomers findinstinctive.

Page 42: Math 1710 Class 2pi.math.cornell.edu/~web1710/slides/aug26_v2u.pdf · Class 2 V2u If You’ve Studied Stats... A Simple Median and Quartile Example Describing the Dist. of Quant.

Math 1710Class 2

V2u

If You’veStudied Stats. . .

A SimpleMedian andQuartileExample

Describing theDist. ofQuant. Data

Outliers

Forbes

790 CEO Salaries 1994

Page 43: Math 1710 Class 2pi.math.cornell.edu/~web1710/slides/aug26_v2u.pdf · Class 2 V2u If You’ve Studied Stats... A Simple Median and Quartile Example Describing the Dist. of Quant.

Math 1710Class 2

V2u

If You’veStudied Stats. . .

A SimpleMedian andQuartileExample

Describing theDist. ofQuant. Data

Outliers

Forbes

790 CEO Salaries 1994

Page 44: Math 1710 Class 2pi.math.cornell.edu/~web1710/slides/aug26_v2u.pdf · Class 2 V2u If You’ve Studied Stats... A Simple Median and Quartile Example Describing the Dist. of Quant.

Math 1710Class 2

V2u

If You’veStudied Stats. . .

A SimpleMedian andQuartileExample

Describing theDist. ofQuant. Data

Outliers

Forbes

CEO Salaries 1994 Boxplot

Page 45: Math 1710 Class 2pi.math.cornell.edu/~web1710/slides/aug26_v2u.pdf · Class 2 V2u If You’ve Studied Stats... A Simple Median and Quartile Example Describing the Dist. of Quant.

Math 1710Class 2

V2u

If You’veStudied Stats. . .

A SimpleMedian andQuartileExample

Describing theDist. ofQuant. Data

Outliers

Forbes

All But Top 9 CEO Salaries 1994

Page 46: Math 1710 Class 2pi.math.cornell.edu/~web1710/slides/aug26_v2u.pdf · Class 2 V2u If You’ve Studied Stats... A Simple Median and Quartile Example Describing the Dist. of Quant.

Math 1710Class 2

V2u

If You’veStudied Stats. . .

A SimpleMedian andQuartileExample

Describing theDist. ofQuant. Data

Outliers

Forbes

All But Top 9 CEO Salaries 1994

Page 47: Math 1710 Class 2pi.math.cornell.edu/~web1710/slides/aug26_v2u.pdf · Class 2 V2u If You’ve Studied Stats... A Simple Median and Quartile Example Describing the Dist. of Quant.

Math 1710Class 2

V2u

If You’veStudied Stats. . .

A SimpleMedian andQuartileExample

Describing theDist. ofQuant. Data

Outliers

Forbes

CEO Salaries 1994 Boxplot

Page 48: Math 1710 Class 2pi.math.cornell.edu/~web1710/slides/aug26_v2u.pdf · Class 2 V2u If You’ve Studied Stats... A Simple Median and Quartile Example Describing the Dist. of Quant.

Math 1710Class 2

V2u

If You’veStudied Stats. . .

A SimpleMedian andQuartileExample

Describing theDist. ofQuant. Data

Outliers

Comparing Measures of Center

n Mean x̄ MedianAll 790 2.82M 1.304MWithout top 9 781 2.24M 1.296M

Means heavily affected by outliers.

Medians resistant to outliers.

Page 49: Math 1710 Class 2pi.math.cornell.edu/~web1710/slides/aug26_v2u.pdf · Class 2 V2u If You’ve Studied Stats... A Simple Median and Quartile Example Describing the Dist. of Quant.

Math 1710Class 2

V2u

If You’veStudied Stats. . .

A SimpleMedian andQuartileExample

Describing theDist. ofQuant. Data

Outliers

Comparing Measures of Center

n Mean x̄ MedianAll 790 2.82M 1.304MWithout top 9 781 2.24M 1.296M

Means heavily affected by outliers.

Medians resistant to outliers.

Page 50: Math 1710 Class 2pi.math.cornell.edu/~web1710/slides/aug26_v2u.pdf · Class 2 V2u If You’ve Studied Stats... A Simple Median and Quartile Example Describing the Dist. of Quant.

Math 1710Class 2

V2u

If You’veStudied Stats. . .

A SimpleMedian andQuartileExample

Describing theDist. ofQuant. Data

Outliers

Comparing Measures of Center

n Mean x̄ MedianAll 790 2.82M 1.304MWithout top 9 781 2.24M 1.296M

Means heavily affected by outliers.

Medians resistant to outliers.

Page 51: Math 1710 Class 2pi.math.cornell.edu/~web1710/slides/aug26_v2u.pdf · Class 2 V2u If You’ve Studied Stats... A Simple Median and Quartile Example Describing the Dist. of Quant.

Math 1710Class 2

V2u

If You’veStudied Stats. . .

A SimpleMedian andQuartileExample

Describing theDist. ofQuant. Data

Outliers

Comparing Measures of Spread

n Std. Dev.s IQRAll 790 8.32M 1.731MWithout top 9 781 2.724M 1.662M

Std. Dev heavily affected by outliers.

IQR resistant to outliers.

Page 52: Math 1710 Class 2pi.math.cornell.edu/~web1710/slides/aug26_v2u.pdf · Class 2 V2u If You’ve Studied Stats... A Simple Median and Quartile Example Describing the Dist. of Quant.

Math 1710Class 2

V2u

If You’veStudied Stats. . .

A SimpleMedian andQuartileExample

Describing theDist. ofQuant. Data

Outliers

Comparing Measures of Spread

n Std. Dev.s IQRAll 790 8.32M 1.731MWithout top 9 781 2.724M 1.662M

Std. Dev heavily affected by outliers.

IQR resistant to outliers.

Page 53: Math 1710 Class 2pi.math.cornell.edu/~web1710/slides/aug26_v2u.pdf · Class 2 V2u If You’ve Studied Stats... A Simple Median and Quartile Example Describing the Dist. of Quant.

Math 1710Class 2

V2u

If You’veStudied Stats. . .

A SimpleMedian andQuartileExample

Describing theDist. ofQuant. Data

Outliers

Comparing Measures of Spread

n Std. Dev.s IQRAll 790 8.32M 1.731MWithout top 9 781 2.724M 1.662M

Std. Dev heavily affected by outliers.

IQR resistant to outliers.