Descriptive statistics Describing data with numbers: measures of location.
-
Upload
randell-ferguson -
Category
Documents
-
view
227 -
download
0
Transcript of Descriptive statistics Describing data with numbers: measures of location.
![Page 1: Descriptive statistics Describing data with numbers: measures of location.](https://reader036.fdocuments.net/reader036/viewer/2022062322/56649ea15503460f94ba413a/html5/thumbnails/1.jpg)
Descriptive statistics
Describing data with numbers:measures of location
![Page 2: Descriptive statistics Describing data with numbers: measures of location.](https://reader036.fdocuments.net/reader036/viewer/2022062322/56649ea15503460f94ba413a/html5/thumbnails/2.jpg)
What to describe?
• What is the “location” or “center” of the data? (“measures of location”)
• How do the data vary? (“measures of variability”)
![Page 3: Descriptive statistics Describing data with numbers: measures of location.](https://reader036.fdocuments.net/reader036/viewer/2022062322/56649ea15503460f94ba413a/html5/thumbnails/3.jpg)
Measures of Location
• Mean
• Median
• Mode
![Page 4: Descriptive statistics Describing data with numbers: measures of location.](https://reader036.fdocuments.net/reader036/viewer/2022062322/56649ea15503460f94ba413a/html5/thumbnails/4.jpg)
Mean
• Another name for average.
• If describing a population, denoted as , the greek letter “mu”.
• If describing a sample, denoted as , called “x-bar”.
• Appropriate for describing measurement data.
• Seriously affected by unusual values called “outliers”.
![Page 5: Descriptive statistics Describing data with numbers: measures of location.](https://reader036.fdocuments.net/reader036/viewer/2022062322/56649ea15503460f94ba413a/html5/thumbnails/5.jpg)
Calculating Sample Mean
ni
XX
Formula:
That is, add up all of the data points and divide by the number of data points.
Data (# of classes skipped): 2 8 3 4 1
Sample Mean = (2+8+3+4+1)/5 = 3.6
Do not round! Mean need not be a whole number.
![Page 6: Descriptive statistics Describing data with numbers: measures of location.](https://reader036.fdocuments.net/reader036/viewer/2022062322/56649ea15503460f94ba413a/html5/thumbnails/6.jpg)
Median
• Another name for 50th percentile.
• Appropriate for describing measurement data.
• “Robust to outliers,” that is, not affected much by unusual values.
![Page 7: Descriptive statistics Describing data with numbers: measures of location.](https://reader036.fdocuments.net/reader036/viewer/2022062322/56649ea15503460f94ba413a/html5/thumbnails/7.jpg)
Calculating Sample Median
Order data from smallest to largest.
If odd number of data points, the median is the middle value.
Data (# of classes skipped): 2 8 3 4 1
Ordered Data: 1 2 3 4 8
Median
![Page 8: Descriptive statistics Describing data with numbers: measures of location.](https://reader036.fdocuments.net/reader036/viewer/2022062322/56649ea15503460f94ba413a/html5/thumbnails/8.jpg)
Calculating Sample Median
Order data from smallest to largest.
If even number of data points, the median is the average of the two middle values.
Data (# of classes skipped): 2 8 3 4 1 8
Ordered Data: 1 2 3 4 8 8
Median = (3+4)/2 = 3.5
![Page 9: Descriptive statistics Describing data with numbers: measures of location.](https://reader036.fdocuments.net/reader036/viewer/2022062322/56649ea15503460f94ba413a/html5/thumbnails/9.jpg)
Mode
• The value that occurs most frequently.
• One data set can have many modes.
• Appropriate for all types of data, but most useful for categorical data or discrete data with only a few number of possible values.
![Page 10: Descriptive statistics Describing data with numbers: measures of location.](https://reader036.fdocuments.net/reader036/viewer/2022062322/56649ea15503460f94ba413a/html5/thumbnails/10.jpg)
In Minitab:
Variable N Mean Median TrMean StDev SE MeanPhone 139 121.6 60.0 88.1 217.7 18.5
Variable Minimum Maximum Q1 Q3Phone 2.0 2000.0 30.0 120.0
N = number of data points
Sample mean
Sample median
![Page 11: Descriptive statistics Describing data with numbers: measures of location.](https://reader036.fdocuments.net/reader036/viewer/2022062322/56649ea15503460f94ba413a/html5/thumbnails/11.jpg)
In Minitab:
• Select Stat.
• Select Basic Statistics.
• Select Display Descriptive Statistics.
• Select variable(s) of interest.
• Select OK.
![Page 12: Descriptive statistics Describing data with numbers: measures of location.](https://reader036.fdocuments.net/reader036/viewer/2022062322/56649ea15503460f94ba413a/html5/thumbnails/12.jpg)
The most appropriate measure of location depends on …
the shape of the data’s distribution.
![Page 13: Descriptive statistics Describing data with numbers: measures of location.](https://reader036.fdocuments.net/reader036/viewer/2022062322/56649ea15503460f94ba413a/html5/thumbnails/13.jpg)
Most appropriate measure of location
• Depends on whether or not data are “symmetric” or “skewed”.
• Depends on whether or not data have one (“unimodal”) or more (“multimodal”) modes.
![Page 14: Descriptive statistics Describing data with numbers: measures of location.](https://reader036.fdocuments.net/reader036/viewer/2022062322/56649ea15503460f94ba413a/html5/thumbnails/14.jpg)
Symmetric and Unimodal
2.0 2.2 2.4 2.6 2.8 3.0 3.2 3.4 3.6 3.8 4.0
0
10
20
GPAs
Per
cent
![Page 15: Descriptive statistics Describing data with numbers: measures of location.](https://reader036.fdocuments.net/reader036/viewer/2022062322/56649ea15503460f94ba413a/html5/thumbnails/15.jpg)
Symmetric and Unimodal
2 3 4
GPA
![Page 16: Descriptive statistics Describing data with numbers: measures of location.](https://reader036.fdocuments.net/reader036/viewer/2022062322/56649ea15503460f94ba413a/html5/thumbnails/16.jpg)
Symmetric and UnimodalDescriptive Statistics
Variable N Mean Median TrMean StDev SE MeanGPA 92 3.0698 3.1200 3.0766 0.4851 0.0506
Variable Minimum Maximum Q1 Q3GPA 2.0200 3.9800 2.6725 3.4675
![Page 17: Descriptive statistics Describing data with numbers: measures of location.](https://reader036.fdocuments.net/reader036/viewer/2022062322/56649ea15503460f94ba413a/html5/thumbnails/17.jpg)
Symmetric and Bimodal
![Page 18: Descriptive statistics Describing data with numbers: measures of location.](https://reader036.fdocuments.net/reader036/viewer/2022062322/56649ea15503460f94ba413a/html5/thumbnails/18.jpg)
Symmetric and Bimodal
Variable N Mean Median TrMean StDev Males 84 70.048 70.000 70.092 3.030 Females 89 64.798 65.000 64.753 2.877 All 176 67.313 67.000 67.291 4.017
Variable SE Mean Min Max Q1 Q3Males 0.331 63.0 76.0 68.0 72.0Females 0.305 56.0 77.0 63.0 67.0All 0.303 56.0 77.0 64.0 70.0
![Page 19: Descriptive statistics Describing data with numbers: measures of location.](https://reader036.fdocuments.net/reader036/viewer/2022062322/56649ea15503460f94ba413a/html5/thumbnails/19.jpg)
Symmetric and Bimodal
![Page 20: Descriptive statistics Describing data with numbers: measures of location.](https://reader036.fdocuments.net/reader036/viewer/2022062322/56649ea15503460f94ba413a/html5/thumbnails/20.jpg)
Skewed Right
0 100 200 300 400
0
10
20
Number of Music CDs
Fre
quen
cyNumber of Music CDs of Spring 1998 Stat 250 Students
![Page 21: Descriptive statistics Describing data with numbers: measures of location.](https://reader036.fdocuments.net/reader036/viewer/2022062322/56649ea15503460f94ba413a/html5/thumbnails/21.jpg)
Skewed Right
0 100 200 300 400
Number of CDs
![Page 22: Descriptive statistics Describing data with numbers: measures of location.](https://reader036.fdocuments.net/reader036/viewer/2022062322/56649ea15503460f94ba413a/html5/thumbnails/22.jpg)
Skewed Right
Descriptive Statistics
Variable N Mean Median TrMean StDev SE MeanCDs 92 61.04 46.50 52.93 62.90 6.56
Variable Minimum Maximum Q1 Q3CDs 0.00 400.00 21.50 83.00
![Page 23: Descriptive statistics Describing data with numbers: measures of location.](https://reader036.fdocuments.net/reader036/viewer/2022062322/56649ea15503460f94ba413a/html5/thumbnails/23.jpg)
Skewed Left
50 55 60 65 70 75 80 85 90 95 100
0
10
20
30
grades
Per
cent
![Page 24: Descriptive statistics Describing data with numbers: measures of location.](https://reader036.fdocuments.net/reader036/viewer/2022062322/56649ea15503460f94ba413a/html5/thumbnails/24.jpg)
Skewed Left
![Page 25: Descriptive statistics Describing data with numbers: measures of location.](https://reader036.fdocuments.net/reader036/viewer/2022062322/56649ea15503460f94ba413a/html5/thumbnails/25.jpg)
Skewed Left
Variable N Mean Median TrMean StDev SE Meangrades 22 89.18 93.50 90.60 12.92 2.76
Variable Minimum Maximum Q1 Q3grades 50.00 100.00 87.00 98.00
![Page 26: Descriptive statistics Describing data with numbers: measures of location.](https://reader036.fdocuments.net/reader036/viewer/2022062322/56649ea15503460f94ba413a/html5/thumbnails/26.jpg)
Choosing Appropriate Measure of Location
• If data are symmetric, the mean, median, and mode will be approximately the same.
• If data are multimodal, report the mean, median and/or mode for each subgroup.
• If data are skewed, report the median.