Chapter 1 Data Presentation Statistics and Data Measurement Levels Summarizing Data Symmetry and...

17
Chapter 1 Data Presentation Statistics and Data Measurement Levels Summarizing Data Symmetry and Skewness

Transcript of Chapter 1 Data Presentation Statistics and Data Measurement Levels Summarizing Data Symmetry and...

Page 1: Chapter 1 Data Presentation Statistics and Data Measurement Levels Summarizing Data Symmetry and Skewness.

Chapter 1 Data PresentationStatistics and Data

Measurement LevelsSummarizing Data

Symmetry and Skewness

Page 2: Chapter 1 Data Presentation Statistics and Data Measurement Levels Summarizing Data Symmetry and Skewness.

Statistics and Data

• Statistics – collection of techniques used in analyzing data – numbers produced in the analysis (eg. Average)

• Data – collection of measurements made on a number of subjects.

• Subjects – where information are drawn- experimental units

Page 3: Chapter 1 Data Presentation Statistics and Data Measurement Levels Summarizing Data Symmetry and Skewness.

Data are usually stored in a row-and-column display called a spreadsheet.

From page 2 of the textbook

Row represents a subject and columns represent measure of variables.

Page 4: Chapter 1 Data Presentation Statistics and Data Measurement Levels Summarizing Data Symmetry and Skewness.

Measurement Levels of Data

• Categorical Data – variables that yield categorical dataNominal – possible values are just names of categories

– no apparent ordering between the possible values examples: Gender, Major, College

Ordinal – there is an obvious ordering of the possible values example: Year level (Freshman, Sophomore …) , Military ranking

• Numerical Data - variables that yield numerical dataInterval – Interval exists but not ratios

– zero does not mean absence of that variableexamples: Temperature, IQ60 F vs 30 F, there is 30 degrees difference between the two temperatures but it does not mean that 60 F is twice as warm as 30F

Ratio – ratio exist examples: Age, Height, Number of classes taken this semester

Types of Data

Page 5: Chapter 1 Data Presentation Statistics and Data Measurement Levels Summarizing Data Symmetry and Skewness.

Ratio : there are 2 other levels under ratio

Discrete: result of a counting processexample: number of classes being taken,

number of students in a class

Continuous: result of a measuring process.example: height, age, weight, velocity

Page 6: Chapter 1 Data Presentation Statistics and Data Measurement Levels Summarizing Data Symmetry and Skewness.

Summary of Data type and Levels

Page 7: Chapter 1 Data Presentation Statistics and Data Measurement Levels Summarizing Data Symmetry and Skewness.

Summarizing Data

Summarizing Categorical Data

1. Relative Frequency Table - represents the frequency of each type of categorical variable

2. Bar Chart - plot of the relative frequency table; order of categories is arbitrary

3. Pie Chart - also a plot of the relative frequency table, except in a circular shape

Page 8: Chapter 1 Data Presentation Statistics and Data Measurement Levels Summarizing Data Symmetry and Skewness.

Relative Frequency Table

Major Frequency Rel. Freq. ACT 16 0.29GBS 28 0.5MGT 8 0.14MKT 4 0.07Total 56 1

Page 9: Chapter 1 Data Presentation Statistics and Data Measurement Levels Summarizing Data Symmetry and Skewness.

Bar Chart of the Relative Frequency Table

0

5

10

15

20

25

30

ACT GBS MGT MKT

Frequency

Frequency

Using Frequency

Using Relative Frequency

0.29

0.5

0.14

0.07

0

0.1

0.2

0.3

0.4

0.5

0.6

ACT GBS MGT MKT

Percent Distribution of Major

Rel. Freq.

Page 10: Chapter 1 Data Presentation Statistics and Data Measurement Levels Summarizing Data Symmetry and Skewness.

Pie Chart

0.29

0.5

0.14

0.07

Percent Distribution of Major

ACT

GBS

MGT

MKT

Page 11: Chapter 1 Data Presentation Statistics and Data Measurement Levels Summarizing Data Symmetry and Skewness.

Summarizing Numerical Data

Stem and Leaf PlotRelative Frequency Table and Histogram

similar concept with the categorical datadetermine the following: number of classes, class widthFor example: MIN, MAX , number of classes, width = (MAX -MIN) /(classes-1)The intervals in each class should be mutually exclusive.The histogram will just be the graphical presentation of the RTF

Box-and-Whisker Plot

a graphical picture of the distribution of quarters of the data.Useful for comparing distributions of two or more variables

MinimumQ1 (first quartile) – the upper boundary of the first quarterMedian – divides the data into lower and upper halves.Q3 (third quartile) – the upper boundary of the third quarterMaximum

Dotplot

similar to the histogram but used for moderately large datathis can also be used in studying outliers in the data

Page 12: Chapter 1 Data Presentation Statistics and Data Measurement Levels Summarizing Data Symmetry and Skewness.

Stem-and-leaf Display

Summer 2 Quiz Data:8, 11, 13, 19, 21, 23, 25, 25, 25, 28, 31, 35, 39, 47

0 81 1 3 92 1 3 5 5 5 83 1 5 94 7

Stemplot of Summer 2 Quiz

Page 13: Chapter 1 Data Presentation Statistics and Data Measurement Levels Summarizing Data Symmetry and Skewness.

Relative Frequency Table and Histogram

Summer 2 Quiz Data:8, 11, 13, 19, 21, 23, 25, 25, 25, 28, 31, 35, 39, 47

For example, 4 classes is desired. MIN=8, MAX=47Class width = (47-8)/(4-1)=39/3=13

Class Freq. Rel.Freq-5 - 8 1 7%8 - 21 4 29%21 - 34 6 43%34 - 47 3 21%

14 100%Note: intervals include the right endpoint but not the left endpoint.

Page 14: Chapter 1 Data Presentation Statistics and Data Measurement Levels Summarizing Data Symmetry and Skewness.

Histogram of the Summer 2 Quiz Data

Page 15: Chapter 1 Data Presentation Statistics and Data Measurement Levels Summarizing Data Symmetry and Skewness.

Boxplot or Box-and-Whisker Plot

Minimum = 8Q1 (first quartile) =19Median = 25Q3 (third quartile) = 31Maximum = 47

Summer 2 Quiz Data:8, 11, 13, 19, 21, 23, 25, 25, 25, 28, 31, 35, 39, 47

Page 16: Chapter 1 Data Presentation Statistics and Data Measurement Levels Summarizing Data Symmetry and Skewness.

Symmetry and Skewness

Examining symmetry and skewness determines the shape of the data

If the left tail is longer than the right tail, then the data is left-skewed.If the right tail is longer than the left tail, then the data is right-skewed.If the left tail is almost the same as the right tail, then the data is symmetric.

Stem-and-leaf display, Histogram and Boxplot can be used to examine symmetry and skewness.

Page 17: Chapter 1 Data Presentation Statistics and Data Measurement Levels Summarizing Data Symmetry and Skewness.

The left tail is longer than the right tail, hence the data is left-skewed.