Descriptive Statistics And related matters. Two families of statistics Descriptive statistics –...
-
Upload
garey-bond -
Category
Documents
-
view
240 -
download
3
Transcript of Descriptive Statistics And related matters. Two families of statistics Descriptive statistics –...
Descriptive Statistics
And related matters
Two families of statistics
• Descriptive statistics – procedures for summarizing, organizing, graphing, and, in general, describing quantitative information– Mean, standard deviation, # of items, etc.
• Inferential statistics – statistics that allow one to draw conclusions or inferences from the data– ANOVA, t-test, correlation, etc.
Vogt, W. P. (1999). Dictionary of statistics & methodology (2nd ed.). Thousand Oaks, CA: Sage Publications.
Scales of measurement
• Nominal– Used to name or categorize things– Female = 1,Male = 2, correct = 1,incorrect =0– Often used for coding variables in research
• Ordinal– Used to order things– Gives relative position but not amount– Rankings are ordinal
Shavelson, R. J. (1996). Statistical reasoning for the behavioral sciences (Third ed.). Needham Heights, MA: Allyn & Bacon.
Scales of measurement (2)
• Interval– Each scale unit represents an equal distance of
the attribute being measured– Most test scores are considered interval scales– Rating scales are often treated as interval
• Ratio– Interval scales with a meaningful zero point
where zero indicates the absence of the attribute– examples: weight, height, length
Scale summary
• Nominal scales categorize but do not order.
• Ordinal scales categorize and order.
• Interval scales categorize, order, and establish an equal unit in the scale.
• Ratio scales categorize, order, establish an equal unit, and contain a true zero point.
Wiersma, W., & Jurs, S. G. (1990). Educational measurement and testing (2nd ed.). Needham Heights, MA: Allyn and Bacon, p. 13.
Frequency information
• Frequency distribution – how many students received each score
• Cumulative frequency – how many students scored at or below the score in question
• Cumulative percentage – what percent of students scored at or below the score in question
• Useful for seeing patterns in the data
TOTAL
42.5
40.0
37.5
35.0
32.5
30.0
27.5
25.0
22.5
20.0
17.5
15.0
12.5
10
8
6
4
2
0
Std. Dev = 8.12
Mean = 29.4
N = 40.00
Output for SPSS
Measures of Central Tendency
• The four “M”s– Mean– Mode– Median– Midpoint
Think about it…
• Scores: 18, 19, 20, 21, 87• Which give a more accurate picture of this
data, the mean or the median?• Mean = 33• Median = 20• The median is usually more appropriate as
a measure of central tendency when there is an outlier.
For a norm-referenced test
(Henning, 1987, p. 39)
Measures of Dispersion
• Range– High score– Low score
• Standard Deviation
• Variance
Conceptualizing variance
• Imagine a set of scores
8 10 13 9 7 11 10 12 10 9 11Picture those scores on a number line
Williams & Monge (2001) Reasoning with statistics
Conceptualizing variance (2)
• Imagine those scores as deviations from the mean (how far are they from the mean?)
Conceptualizing variance (3)
• Variance: the mean of the squared deviation scores about the mean of a distribution
73.211
94110001149
Variance formula
N
XXS
2
2 )(
Standard Deviation formula
N
XXS
2)(
The Normal Distribution
(Brown, 1996, p. 130)
Tail Tail
Peak
Sample versus population
Describing distributions
Leptokurtic Platykurtic
Think “Leprechaun” Think “Platypus”
Skewed distributions
(Brown, 1996, p. 141)
Tail
Tail
Standardized scores
• A transformation of raw scores into a measure of relative standing based on the mean and standard deviation
• Useful for comparing performance on tests of different lengths, different forms, etc.
• The most often used standardized scores are z-scores, T-scores, and CEEB scores.
• Relative standing is usually based on the norm group (for a norm-referenced test)
Standard score comparison
(Brown, 1996, p. 135)
Practice
A1. 85 is what percentile?
16 (15.9)
A2. What percent between 70 and 115?
82 (81.85)
A3. How many SD is Iliana (177)?
About 5 (5.13)
A4. Iliana’s z =? T=?
CEEB =?
z = (x – m) / sd
T = 10z + 50
CEEB = 100z + 500
5 (5.13)100 (101.3)
1000 (1013)
Application exercises
Student Raw z-score T-score CEEB
A 64 70
B 50
C -1
D -1.5 350
2 700
0 50 500
43 40 400
39.5 35
Raw score mean = 50, raw score standard deviation = 7
Population and Sample
Population S1 S2 S3 S4 S5 S6 S7 S8 S9 S10
3 3 3 3 3 3
6 6 6 6 6 6 6
6 6 6 6 6 6
9 9 9 9 9
12 12 12 12 12 12 12
15 15 15 15 15 Average
Mean 8.5 5 12 7 7 10 10 7 9 8 8 8.3
SD(N) 4.03 1.41 2.45 3.74 1.41 5.10 3.74 3.74 2.45 5.10 2.83 3.20
SD(N-1) 1.73 3.00 4.58 1.73 6.24 4.58 4.58 3.00 6.24 3.46 3.92