1 Business 90: Business Statistics Professor David Mease Sec 03, T R 7:30-8:45AM BBC 204 Lecture 8 =...

38
1 Business 90: Business Statistics Professor David Mease Sec 03, T R 7:30-8:45AM BBC 204 Lecture 8 = Finish Chapter “Presenting Data in Tables and Charts” (PDITAC) Agenda: 1) Go over quiz on Homework 2

Transcript of 1 Business 90: Business Statistics Professor David Mease Sec 03, T R 7:30-8:45AM BBC 204 Lecture 8 =...

Page 1: 1 Business 90: Business Statistics Professor David Mease Sec 03, T R 7:30-8:45AM BBC 204 Lecture 8 = Finish Chapter Presenting Data in Tables and Charts.

1

Business 90: Business Statistics

Professor David Mease

Sec 03, T R 7:30-8:45AM BBC 204

Lecture 8 = Finish Chapter “Presenting Data in Tables and Charts”

(PDITAC)

Agenda:1) Go over quiz on Homework 22) Reminder about Homework 3 (due Tuesday)3) Lecture over rest of Chapter PDITAC

Page 2: 1 Business 90: Business Statistics Professor David Mease Sec 03, T R 7:30-8:45AM BBC 204 Lecture 8 = Finish Chapter Presenting Data in Tables and Charts.

2

Homework 3 - Due Tuesday 2/23 1) The dataset at http://www.cob.sjsu.edu/mease_d/sec4lettergrades.xls gives the letter grades for a quiz I gave once.

a) Make a summary table for the letter grades using the PivotTable in Excel. In your summary table list the grades in the order A+, A, A-, B+, etc. Double check a few of your answers by hand.

b) Make the bar chart using Excel with the grades in the same order as in part A.

c) Make the pie chart using Excel.

d) Make the pareto diagram using Excel.

2) The dataset http://www.cob.sjsu.edu/mease_d/America_West_Flights.xls contains flight status information for America West flights departing from four major West Coast airports. Make a contingency table for this data using the PivotTable feature in Excel.

3) Do textbook problem number 48 in Chapter “Presenting Data in Tables and Charts”.

4) The dataset at http://www.cob.sjsu.edu/mease_d/gpa-data.xls contains data from 20 San Jose State University graduating seniors who were asked to report their high school GPA (first column) and their current college GPA (second column).

a) Make a scatter plot of this data with High School GPA on the X-axis and College GPA on the Y-axis using Excel.

b) Give the equation of the least squares regression line using Excel.

c) What is the slope of the least squares regression line?

d) Interpret the slope of the least squares regression line.

e) What is the coefficient of correlation?

f) What is the value of R-squared?

g) Use the least squares regression line to predict the college GPA of a student who had a high school GPA of 2.7.

Important: Again, be sure to print out your solutions and bring them with you to class for the quiz.

Page 3: 1 Business 90: Business Statistics Professor David Mease Sec 03, T R 7:30-8:45AM BBC 204 Lecture 8 = Finish Chapter Presenting Data in Tables and Charts.

3

Presenting Data in Tables and Charts

Statistics for ManagersUsing Microsoft® Excel

4th Edition

Page 4: 1 Business 90: Business Statistics Professor David Mease Sec 03, T R 7:30-8:45AM BBC 204 Lecture 8 = Finish Chapter Presenting Data in Tables and Charts.

4

Chapter Goals

After completing this chapter, you should be able to: Create an ordered array Construct and interpret a frequency distribution,

histogram, and polygon for numerical data Construct and interpret a cumulative percentage

distribution and ogive for numerical data Create and interpret contingency tables, bar charts,

and pie charts for categorical data Create and interpret a scatter diagram and a least

squares regression line (in other chapter p. 387-398) Describe appropriate and inappropriate ways to

display data graphically

Page 5: 1 Business 90: Business Statistics Professor David Mease Sec 03, T R 7:30-8:45AM BBC 204 Lecture 8 = Finish Chapter Presenting Data in Tables and Charts.

5

Graphs and Tables for Two Variables (Bivariate Data)

Two Numerical Variables:

Scatter Diagram

Two Categorical Variables:

Contingency Table (also called cross-classification table or two-way table)

Side-by-Side Bar Chart

Page 6: 1 Business 90: Business Statistics Professor David Mease Sec 03, T R 7:30-8:45AM BBC 204 Lecture 8 = Finish Chapter Presenting Data in Tables and Charts.

6

Contingency Tables Using Excel

Just like with summary tables, to make a contingency table in Excel, it is often useful to use a “Pivot Table” to count the frequencies of the different categories, especially for large datasets. This is done by selecting “Data” and then “PivotTable and PivotChart Report”. Next go to “Layout” and drag the name of one variable into the row and the other into the column. Pick either one and also drag it into the data area. (Be sure you name the two columns where you have the data first.)

Page 7: 1 Business 90: Business Statistics Professor David Mease Sec 03, T R 7:30-8:45AM BBC 204 Lecture 8 = Finish Chapter Presenting Data in Tables and Charts.

7

Contingency Tables Using Excel

Page 8: 1 Business 90: Business Statistics Professor David Mease Sec 03, T R 7:30-8:45AM BBC 204 Lecture 8 = Finish Chapter Presenting Data in Tables and Charts.

8

Contingency Tables Using Excel

Page 9: 1 Business 90: Business Statistics Professor David Mease Sec 03, T R 7:30-8:45AM BBC 204 Lecture 8 = Finish Chapter Presenting Data in Tables and Charts.

9

Contingency Tables Using Excel

Page 10: 1 Business 90: Business Statistics Professor David Mease Sec 03, T R 7:30-8:45AM BBC 204 Lecture 8 = Finish Chapter Presenting Data in Tables and Charts.

10

Contingency Tables Using Excel

Page 11: 1 Business 90: Business Statistics Professor David Mease Sec 03, T R 7:30-8:45AM BBC 204 Lecture 8 = Finish Chapter Presenting Data in Tables and Charts.

11

Contingency Tables Using Excel

Page 12: 1 Business 90: Business Statistics Professor David Mease Sec 03, T R 7:30-8:45AM BBC 204 Lecture 8 = Finish Chapter Presenting Data in Tables and Charts.

12

Contingency Tables Using Excel

Page 13: 1 Business 90: Business Statistics Professor David Mease Sec 03, T R 7:30-8:45AM BBC 204 Lecture 8 = Finish Chapter Presenting Data in Tables and Charts.

13

Contingency Tables Using Excel

Page 14: 1 Business 90: Business Statistics Professor David Mease Sec 03, T R 7:30-8:45AM BBC 204 Lecture 8 = Finish Chapter Presenting Data in Tables and Charts.

14

In class exercise #25:The file http://www.cob.sjsu.edu/mease_d/gender_and_major.xls lists the genders and majors for Bus 90 students from a previous term. Make a contingency table using the Pivot Table feature in Excel. Put Gender along the side and major along the top.

Page 15: 1 Business 90: Business Statistics Professor David Mease Sec 03, T R 7:30-8:45AM BBC 204 Lecture 8 = Finish Chapter Presenting Data in Tables and Charts.

15

In class exercise #25:The file http://www.cob.sjsu.edu/mease_d/gender_and_major.xls lists the genders and majors for Bus 90 students from a previous term. Make a contingency table using the Pivot Table feature in Excel. Put Gender along the side and major along the top.

ANSWER: MajorGender Business Non-Business UndeclaredFemale 49 6 2Male 48 13 2

Page 16: 1 Business 90: Business Statistics Professor David Mease Sec 03, T R 7:30-8:45AM BBC 204 Lecture 8 = Finish Chapter Presenting Data in Tables and Charts.

16

Graphs and Tables for Two Variables (Bivariate Data)

Two Numerical Variables:

Scatter Diagram

Two Categorical Variables:

Contingency Table (also called cross-classification table or two-way table)

Side-by-Side Bar Chart

Page 17: 1 Business 90: Business Statistics Professor David Mease Sec 03, T R 7:30-8:45AM BBC 204 Lecture 8 = Finish Chapter Presenting Data in Tables and Charts.

17

In class exercise #26:Make a side-by-side bar chart for the data from ICE #25 by hand. Put the major along the x-axis and use different colored bars for the two genders.

Page 18: 1 Business 90: Business Statistics Professor David Mease Sec 03, T R 7:30-8:45AM BBC 204 Lecture 8 = Finish Chapter Presenting Data in Tables and Charts.

18

In class exercise #26:Make a side-by-side bar chart for the data from ICE #25 by hand. Put the major along the x-axis and use different colored bars for the two genders.

ANSWER:Gender and Major

0

5

10

15

20

25

30

35

40

45

50

Business Non-Business Undeclared

Major

Fre

qu

ency

.

.

Female

Male

Page 19: 1 Business 90: Business Statistics Professor David Mease Sec 03, T R 7:30-8:45AM BBC 204 Lecture 8 = Finish Chapter Presenting Data in Tables and Charts.

19

Graphs and Tables for Two Variables (Bivariate Data)

Two Numerical Variables:

Scatter Diagram

Two Categorical Variables:

Contingency Table (also called cross-classification table or two-way table)

Side-by-Side Bar Chart

Page 20: 1 Business 90: Business Statistics Professor David Mease Sec 03, T R 7:30-8:45AM BBC 204 Lecture 8 = Finish Chapter Presenting Data in Tables and Charts.

20

Scatter Diagrams are used for bivariate numerical data Bivariate data consists of paired

observations taken from two numerical variables

The Scatter Diagram: one variable is measured on the vertical

axis and the other variable is measured on the horizontal axis

Scatter Diagrams

Page 21: 1 Business 90: Business Statistics Professor David Mease Sec 03, T R 7:30-8:45AM BBC 204 Lecture 8 = Finish Chapter Presenting Data in Tables and Charts.

21

Scatter Diagram Example

Cost per Day vs. Production Volume

0

50

100

150

200

250

0 10 20 30 40 50 60 70

Volume per Day

Cos

t per

Day

Volume per day

Cost per day

23 125

26 140

29 146

33 160

38 167

42 170

50 188

55 195

60 200

Page 22: 1 Business 90: Business Statistics Professor David Mease Sec 03, T R 7:30-8:45AM BBC 204 Lecture 8 = Finish Chapter Presenting Data in Tables and Charts.

22

Scatter Diagrams in Excel

Select “Insert” > “Chart”

1

2Select XY(Scatter) option,

then click “Next”

The data range is the y values and the x values go under the “Series” tab

Important: Don’t include column names

3

Page 23: 1 Business 90: Business Statistics Professor David Mease Sec 03, T R 7:30-8:45AM BBC 204 Lecture 8 = Finish Chapter Presenting Data in Tables and Charts.

23

In class exercise #27:The file http://www.cob.sjsu.edu/mease_d/football.xls gives the total number of wins for each of the 117 Division 1A college football teams for the 2003 and 2004 seasons. Use Excel to make a scatter diagram for this data. Put 2003 wins on the x-axis.

Page 24: 1 Business 90: Business Statistics Professor David Mease Sec 03, T R 7:30-8:45AM BBC 204 Lecture 8 = Finish Chapter Presenting Data in Tables and Charts.

24

In class exercise #27:The file http://www.cob.sjsu.edu/mease_d/football.xls gives the total number of wins for each of the 117 Division 1A college football teams for the 2003 and 2004 seasons. Use Excel to make a scatter diagram for this data. Put 2003 wins on the x-axis.

ANSWER:

Football

0

2

4

6

8

10

12

0 2 4 6 8 10 12

2003 Wins

2004

Win

s

Page 25: 1 Business 90: Business Statistics Professor David Mease Sec 03, T R 7:30-8:45AM BBC 204 Lecture 8 = Finish Chapter Presenting Data in Tables and Charts.

25

Described on pages 387-398 (in a different chapter)

It is the line that fits the data the best as determined by minimizing squared vertical differences

The coefficient of correlation (r) measures the strength and direction of the linear relationship (positive=up, negative=down)

R-squared also measures the strength of the linear relationship, but not the direction

The Least Squares Regression Line

Page 26: 1 Business 90: Business Statistics Professor David Mease Sec 03, T R 7:30-8:45AM BBC 204 Lecture 8 = Finish Chapter Presenting Data in Tables and Charts.

26

Scatter Plots of Data with Various Correlation Coefficients

Y

X

Y

X

Y

X

Y

X

r = -1 r = -.6

r = +.3r = +1

Page 27: 1 Business 90: Business Statistics Professor David Mease Sec 03, T R 7:30-8:45AM BBC 204 Lecture 8 = Finish Chapter Presenting Data in Tables and Charts.

27

Adding the Least Squares Regression Line Using Excel

Click once on the graph1

Page 28: 1 Business 90: Business Statistics Professor David Mease Sec 03, T R 7:30-8:45AM BBC 204 Lecture 8 = Finish Chapter Presenting Data in Tables and Charts.

28

Adding the Least Squares Regression Line Using Excel

From the “Chart” menu select “Add Trendline”2

Page 29: 1 Business 90: Business Statistics Professor David Mease Sec 03, T R 7:30-8:45AM BBC 204 Lecture 8 = Finish Chapter Presenting Data in Tables and Charts.

29

Adding the Least Squares Regression Line Using Excel

Choose the first choice (“Linear”) and press “OK”3

Page 30: 1 Business 90: Business Statistics Professor David Mease Sec 03, T R 7:30-8:45AM BBC 204 Lecture 8 = Finish Chapter Presenting Data in Tables and Charts.

30

Adding the Least Squares Regression Line Using ExcelThe line should now appear on your scatter diagram. Double click on the line then under the “Options” tab check the last two

boxes.4

Page 31: 1 Business 90: Business Statistics Professor David Mease Sec 03, T R 7:30-8:45AM BBC 204 Lecture 8 = Finish Chapter Presenting Data in Tables and Charts.

31

In class exercise #28:A) Graph the least squares regression line for the football data on the scatter diagram using Excel.

B) Give the equation of the least squares regression line using Excel.

C) What is the slope of the least squares regression line?

D) Interpret the slope of the least squares regression line.

E) What is the coefficient of correlation?

F) What is the value of R-squared?

G) Use the least squares regression line to predict the number of 2004 wins for a team that won 12 games in 2003.

Page 32: 1 Business 90: Business Statistics Professor David Mease Sec 03, T R 7:30-8:45AM BBC 204 Lecture 8 = Finish Chapter Presenting Data in Tables and Charts.

32

In class exercise #28:A) Graph the least squares regression line for the football data on the scatter diagram using Excel.

ANSWER for Part A:

Football Data

y = 0.5716x + 2.1623

R2 = 0.4274

0

2

4

6

8

10

12

0 2 4 6 8 10 12

2003 Win Total

2004

Win

Tot

al

Page 33: 1 Business 90: Business Statistics Professor David Mease Sec 03, T R 7:30-8:45AM BBC 204 Lecture 8 = Finish Chapter Presenting Data in Tables and Charts.

33

Principles of Graphical Excellence

Present data in a way that provides substance, statistics and design

Communicate complex ideas with clarity, precision and efficiency

Give the largest number of ideas in the most efficient manner

Excellence almost always involves several dimensions

Tell the truth about the data

Page 34: 1 Business 90: Business Statistics Professor David Mease Sec 03, T R 7:30-8:45AM BBC 204 Lecture 8 = Finish Chapter Presenting Data in Tables and Charts.

34

Using “chart junk” Failing to provide a relative

basis in comparing data

between groups Compressing or distorting the vertical axis Providing no zero point on the vertical axis

Errors in Presenting Data

Page 35: 1 Business 90: Business Statistics Professor David Mease Sec 03, T R 7:30-8:45AM BBC 204 Lecture 8 = Finish Chapter Presenting Data in Tables and Charts.

35

Compressing Vertical Axis

Good Presentation

Quarterly Sales Quarterly Sales

Bad Presentation

0

25

50

Q1 Q2 Q3 Q4

$

0

100

200

Q1 Q2 Q3 Q4

$

Page 36: 1 Business 90: Business Statistics Professor David Mease Sec 03, T R 7:30-8:45AM BBC 204 Lecture 8 = Finish Chapter Presenting Data in Tables and Charts.

36

No Zero Point On Vertical Axis

Monthly Sales

0

39

42

45

J F M A M J

$

36

0

20

40

60

J F M A M J

$

Good Presentations

Monthly Sales

Bad Presentation

36

39

42

45

J F M A M J

$

Graphing the first six months of sales

or

Page 37: 1 Business 90: Business Statistics Professor David Mease Sec 03, T R 7:30-8:45AM BBC 204 Lecture 8 = Finish Chapter Presenting Data in Tables and Charts.

37

No Relative Basis

Good PresentationA’s received by

students.A’s received by

students.

Bad Presentation

0

200

300

FR SO JR SR

Freq.

10%

30%

FR SO JR SR

FR = Freshmen, SO = Sophomore, JR = Junior, SR = Senior

100

20%

0%

%

Page 38: 1 Business 90: Business Statistics Professor David Mease Sec 03, T R 7:30-8:45AM BBC 204 Lecture 8 = Finish Chapter Presenting Data in Tables and Charts.

38

Chart Junk

Good Presentation

1960: $1.00

1970: $1.60

1980: $3.10

1990: $3.80

Minimum Wage Minimum Wage

0

2

4

1960 1970 1980 1990

$

Bad Presentation