Descriptive Statistics - Memphis Statistics.pdf · Introduction to Descriptive Statistics...
Transcript of Descriptive Statistics - Memphis Statistics.pdf · Introduction to Descriptive Statistics...
![Page 1: Descriptive Statistics - Memphis Statistics.pdf · Introduction to Descriptive Statistics Descriptive vs. Inferential Statistics DEFINITIONS – Population – all members of a class](https://reader030.fdocuments.net/reader030/viewer/2022033121/5ec663746cd3db2e297285bf/html5/thumbnails/1.jpg)
CIVL 3103
Introduction to Descriptive Statistics
![Page 2: Descriptive Statistics - Memphis Statistics.pdf · Introduction to Descriptive Statistics Descriptive vs. Inferential Statistics DEFINITIONS – Population – all members of a class](https://reader030.fdocuments.net/reader030/viewer/2022033121/5ec663746cd3db2e297285bf/html5/thumbnails/2.jpg)
Learning Objectives
• To understand the goal of statistical methods of data analysis
• To apply multiple techniques (summary statistics, tables, graphics) to describe data, and to understand the benefits of each.
![Page 3: Descriptive Statistics - Memphis Statistics.pdf · Introduction to Descriptive Statistics Descriptive vs. Inferential Statistics DEFINITIONS – Population – all members of a class](https://reader030.fdocuments.net/reader030/viewer/2022033121/5ec663746cd3db2e297285bf/html5/thumbnails/3.jpg)
Uncertainty in Engineering
• Uncertainty is inherent in all real world problems.
• Two types of uncertainty: – Natural randomness of underlying phenomena – Inaccurate estimation of real-world parameters
and conditions
• Statistics and probability are needed for describing information and forming the bases for design and
decision making under uncertainty.
![Page 4: Descriptive Statistics - Memphis Statistics.pdf · Introduction to Descriptive Statistics Descriptive vs. Inferential Statistics DEFINITIONS – Population – all members of a class](https://reader030.fdocuments.net/reader030/viewer/2022033121/5ec663746cd3db2e297285bf/html5/thumbnails/4.jpg)
Purpose of Probability/Statistics in Engineering
• The purpose of probability and statistics is to deal with uncertainty.
• Probability and statistics are widely used in engineering contexts to develop models or information that can be used in engineering decision-making so that the associated risks and costs are effectively managed.
![Page 5: Descriptive Statistics - Memphis Statistics.pdf · Introduction to Descriptive Statistics Descriptive vs. Inferential Statistics DEFINITIONS – Population – all members of a class](https://reader030.fdocuments.net/reader030/viewer/2022033121/5ec663746cd3db2e297285bf/html5/thumbnails/5.jpg)
Introduction to Descriptive Statistics
Descriptive vs. Inferential Statistics
DEFINITIONS – Population – all members of a class or category of interest – Parameter – a summary measure of the population (e.g.
average) – Sample – a portion or subset of the population collected as
data – Observation – an individual member of the sample (i.e., a
data point) – Statistic – a summary measure of the observations in a
sample
![Page 6: Descriptive Statistics - Memphis Statistics.pdf · Introduction to Descriptive Statistics Descriptive vs. Inferential Statistics DEFINITIONS – Population – all members of a class](https://reader030.fdocuments.net/reader030/viewer/2022033121/5ec663746cd3db2e297285bf/html5/thumbnails/6.jpg)
Populations and Samples
Population Sample
Statistics
Inference
Parameters
![Page 7: Descriptive Statistics - Memphis Statistics.pdf · Introduction to Descriptive Statistics Descriptive vs. Inferential Statistics DEFINITIONS – Population – all members of a class](https://reader030.fdocuments.net/reader030/viewer/2022033121/5ec663746cd3db2e297285bf/html5/thumbnails/7.jpg)
Summary Statistics
• Measures of Central Tendency – Arithmetic mean – Median – Mode
• Measures of Dispersion – Variance – Standard deviation – Coefficient of variation (COV)
![Page 8: Descriptive Statistics - Memphis Statistics.pdf · Introduction to Descriptive Statistics Descriptive vs. Inferential Statistics DEFINITIONS – Population – all members of a class](https://reader030.fdocuments.net/reader030/viewer/2022033121/5ec663746cd3db2e297285bf/html5/thumbnails/8.jpg)
3 4 4 5 6 8 8 8 10
The sample median is given by:
Measures of Central Tendency 8 6 4 10 3 8 4 8 5
The sample mean is given by:
![Page 9: Descriptive Statistics - Memphis Statistics.pdf · Introduction to Descriptive Statistics Descriptive vs. Inferential Statistics DEFINITIONS – Population – all members of a class](https://reader030.fdocuments.net/reader030/viewer/2022033121/5ec663746cd3db2e297285bf/html5/thumbnails/9.jpg)
3 4 4 5 6 8 8 8 10
Measures of Central Tendency
3 4 4 4 6 8 8 8 10
The mode of the sample is the value that occurs most frequently.
Bimodal
![Page 10: Descriptive Statistics - Memphis Statistics.pdf · Introduction to Descriptive Statistics Descriptive vs. Inferential Statistics DEFINITIONS – Population – all members of a class](https://reader030.fdocuments.net/reader030/viewer/2022033121/5ec663746cd3db2e297285bf/html5/thumbnails/10.jpg)
Measures of Dispersion The most common measure of dispersion is the sample variance:
The sample standard deviation is the square root of sample variance:
![Page 11: Descriptive Statistics - Memphis Statistics.pdf · Introduction to Descriptive Statistics Descriptive vs. Inferential Statistics DEFINITIONS – Population – all members of a class](https://reader030.fdocuments.net/reader030/viewer/2022033121/5ec663746cd3db2e297285bf/html5/thumbnails/11.jpg)
Measures of Dispersion
Coefficient of variation (COV):
This is a good way to compare measures of dispersion between different samples whose values don’t necessarily have the same magnitude (or, for that matter, the same units!).
![Page 12: Descriptive Statistics - Memphis Statistics.pdf · Introduction to Descriptive Statistics Descriptive vs. Inferential Statistics DEFINITIONS – Population – all members of a class](https://reader030.fdocuments.net/reader030/viewer/2022033121/5ec663746cd3db2e297285bf/html5/thumbnails/12.jpg)
Frequency Distribution
Speeds (mph) Vehicles Counted
40–45 1
45–50 9
50–55 15
55–60 10
60–65 7
65–70 5
70–75 3
TOTAL 50
Vehicle Speeds on Central Avenue
A frequency distribution is a tabular summary of sample data organized into categories or classes
![Page 13: Descriptive Statistics - Memphis Statistics.pdf · Introduction to Descriptive Statistics Descriptive vs. Inferential Statistics DEFINITIONS – Population – all members of a class](https://reader030.fdocuments.net/reader030/viewer/2022033121/5ec663746cd3db2e297285bf/html5/thumbnails/13.jpg)
Histogram
0
5
10
15
30-35 35-40 40-45 45-50 50-55 55-60 60-65 65-70 70-75 75-80 Speeds (mph)
Freq
uenc
y
0 40 45 50 55 60 65 70 75
A histogram is a graphical representation of a frequency distribution. Each class includes those observations who’s value is greater than the lower bound and less than or equal to the upper bound of the class.
![Page 14: Descriptive Statistics - Memphis Statistics.pdf · Introduction to Descriptive Statistics Descriptive vs. Inferential Statistics DEFINITIONS – Population – all members of a class](https://reader030.fdocuments.net/reader030/viewer/2022033121/5ec663746cd3db2e297285bf/html5/thumbnails/14.jpg)
Histogram
0
5
10
15
30-35 35-40 40-45 45-50 50-55 55-60 60-65 65-70 70-75 75-80 Speeds (mph)
Freq
uenc
y
0
0
5
10
15
32.5 37.5 42.5 47.5 52.5 57.5 62.5 67.5 72.5 77.5 Speeds (mph)
Freq
uenc
y
0
Range
Class mark
![Page 15: Descriptive Statistics - Memphis Statistics.pdf · Introduction to Descriptive Statistics Descriptive vs. Inferential Statistics DEFINITIONS – Population – all members of a class](https://reader030.fdocuments.net/reader030/viewer/2022033121/5ec663746cd3db2e297285bf/html5/thumbnails/15.jpg)
Symmetry and Skewness
![Page 16: Descriptive Statistics - Memphis Statistics.pdf · Introduction to Descriptive Statistics Descriptive vs. Inferential Statistics DEFINITIONS – Population – all members of a class](https://reader030.fdocuments.net/reader030/viewer/2022033121/5ec663746cd3db2e297285bf/html5/thumbnails/16.jpg)
Relative Frequency Distribution
Speeds (kph) Vehicles Counted Pecentage of Sample
40–45 1 2
45–50 9 18
50–55 15 30
55–60 10 20
60–65 7 14
65–70 5 10
70–75 3 6
TOTAL 50 100
Vehicle Speeds on Poplar Avenue
![Page 17: Descriptive Statistics - Memphis Statistics.pdf · Introduction to Descriptive Statistics Descriptive vs. Inferential Statistics DEFINITIONS – Population – all members of a class](https://reader030.fdocuments.net/reader030/viewer/2022033121/5ec663746cd3db2e297285bf/html5/thumbnails/17.jpg)
Relative Frequency Histogram
0%
10%
20%
30%
30-35 35-40 40-45 45-50 50-55 55-60 60-65 65-70 70-75 75-80 Speeds (mph)
Rel
ativ
e Fr
eque
ncy
0 40 45 50 55 60 65 70 75
![Page 18: Descriptive Statistics - Memphis Statistics.pdf · Introduction to Descriptive Statistics Descriptive vs. Inferential Statistics DEFINITIONS – Population – all members of a class](https://reader030.fdocuments.net/reader030/viewer/2022033121/5ec663746cd3db2e297285bf/html5/thumbnails/18.jpg)
Cumulative Frequency Distributions
Speeds (mph) Vehicles Counted Percentage of Sample Cumulative Percentage
40–45 1 2 2
45–50 9 18 20
50–55 15 30 50
55–60 10 20 70
60–65 7 14 84
65–70 5 10 94
70–75 3 6 100
TOTAL 50 100
Vehicle Speeds on Poplar Avenue
![Page 19: Descriptive Statistics - Memphis Statistics.pdf · Introduction to Descriptive Statistics Descriptive vs. Inferential Statistics DEFINITIONS – Population – all members of a class](https://reader030.fdocuments.net/reader030/viewer/2022033121/5ec663746cd3db2e297285bf/html5/thumbnails/19.jpg)
Cumulative Frequency Diagram
0%
25%
50%
75%
100%
30-35 35-40 40-45 45-50 50-55 55-60 60-65 65-70 70-75 75-80
Speeds (mph)
Cum
ulat
ive
Freq
uenc
y
0 40 45 50 55 60 65 70 75
A good rule of thumb is that the number of classes should be approximately equal to the square root of the number of observations.
![Page 20: Descriptive Statistics - Memphis Statistics.pdf · Introduction to Descriptive Statistics Descriptive vs. Inferential Statistics DEFINITIONS – Population – all members of a class](https://reader030.fdocuments.net/reader030/viewer/2022033121/5ec663746cd3db2e297285bf/html5/thumbnails/20.jpg)
Cumulative Frequency Distributions
![Page 21: Descriptive Statistics - Memphis Statistics.pdf · Introduction to Descriptive Statistics Descriptive vs. Inferential Statistics DEFINITIONS – Population – all members of a class](https://reader030.fdocuments.net/reader030/viewer/2022033121/5ec663746cd3db2e297285bf/html5/thumbnails/21.jpg)
Boxplots
• A boxplot is a graphic that presents the median, the first and third quartiles, and any outliers present in the sample.
• The interquartile range (IQR) is the difference between the third and first quartile. This is the distance needed to span the middle half of the data.
![Page 22: Descriptive Statistics - Memphis Statistics.pdf · Introduction to Descriptive Statistics Descriptive vs. Inferential Statistics DEFINITIONS – Population – all members of a class](https://reader030.fdocuments.net/reader030/viewer/2022033121/5ec663746cd3db2e297285bf/html5/thumbnails/22.jpg)
Boxplots
Steps in the Construction of a Boxplot:
Compute the median and the first and third quartiles of the sample. Indicate these with horizontal lines. Draw vertical lines to complete the box.
Find the largest sample value that is no more than 1.5 IQR above the third quartile, and the smallest sample value that is not more than 1.5 IQR below the first quartile. Extend vertical lines (whiskers) from the quartile lines to these points.
Points more than 1.5 IQR above the third quartile, or more than 1.5 IQR below the first quartile are designated as outliers. Plot each outlier individually.
![Page 23: Descriptive Statistics - Memphis Statistics.pdf · Introduction to Descriptive Statistics Descriptive vs. Inferential Statistics DEFINITIONS – Population – all members of a class](https://reader030.fdocuments.net/reader030/viewer/2022033121/5ec663746cd3db2e297285bf/html5/thumbnails/23.jpg)
Boxplots
![Page 24: Descriptive Statistics - Memphis Statistics.pdf · Introduction to Descriptive Statistics Descriptive vs. Inferential Statistics DEFINITIONS – Population – all members of a class](https://reader030.fdocuments.net/reader030/viewer/2022033121/5ec663746cd3db2e297285bf/html5/thumbnails/24.jpg)
Boxplots – Example
![Page 25: Descriptive Statistics - Memphis Statistics.pdf · Introduction to Descriptive Statistics Descriptive vs. Inferential Statistics DEFINITIONS – Population – all members of a class](https://reader030.fdocuments.net/reader030/viewer/2022033121/5ec663746cd3db2e297285bf/html5/thumbnails/25.jpg)
Boxplots - Example Notice there are no outliers in these data.
Looking at the four pieces of the boxplot, we can tell that the sample values are comparatively densely packed between the median and the third quartile.
The lower whisker is a bit longer than the upper one, indicating that the data has a slightly longer lower tail than an upper tail.
The distance between the first quartile and the median is greater than the distance between the median and the third quartile.
This boxplot suggests that the data are skewed to the left.
![Page 26: Descriptive Statistics - Memphis Statistics.pdf · Introduction to Descriptive Statistics Descriptive vs. Inferential Statistics DEFINITIONS – Population – all members of a class](https://reader030.fdocuments.net/reader030/viewer/2022033121/5ec663746cd3db2e297285bf/html5/thumbnails/26.jpg)
Excel Tools
![Page 27: Descriptive Statistics - Memphis Statistics.pdf · Introduction to Descriptive Statistics Descriptive vs. Inferential Statistics DEFINITIONS – Population – all members of a class](https://reader030.fdocuments.net/reader030/viewer/2022033121/5ec663746cd3db2e297285bf/html5/thumbnails/27.jpg)
Analysis Toolpack in Excel
![Page 28: Descriptive Statistics - Memphis Statistics.pdf · Introduction to Descriptive Statistics Descriptive vs. Inferential Statistics DEFINITIONS – Population – all members of a class](https://reader030.fdocuments.net/reader030/viewer/2022033121/5ec663746cd3db2e297285bf/html5/thumbnails/28.jpg)
Other Statistics Software
U of M UMWare service: https://tstumware.memphis.edu/
![Page 29: Descriptive Statistics - Memphis Statistics.pdf · Introduction to Descriptive Statistics Descriptive vs. Inferential Statistics DEFINITIONS – Population – all members of a class](https://reader030.fdocuments.net/reader030/viewer/2022033121/5ec663746cd3db2e297285bf/html5/thumbnails/29.jpg)
SigmaPlot