1 EPSY 546: LECTURE 1 SUMMARY George Karabatsos. 2 REVIEW.

EPSY 546: LECTURE 1SUMMARY

George Karabatsos

REVIEW

• Test (& types of tests)

REVIEW

• Item response scoring paradigms

REVIEW

• Item response scoring paradigms

• Data paradigm of test theory (typical)

REVIEW

DATA PARADIGM

X X X X

T ota l

P erson

X X X X T o ta l Item Score

p p p p

n n n j nJ

N N N j N J

11 1 2 1 1

2 1 2 2 2 2

Item d ifficu lty

P erso n s = 1 , . . . , ; Item s = 1 , . . . , ; C a teg o rie s 0 , . . . , n N j J k K pN K X

• Latent Trait Re (unidimensional)

REVIEW: Latent Trait

• Latent Trait Re (unidimensional)

• Real Examples of Latent Traits

REVIEW: Latent Trait

• Item Response Function (IRF)

REVIEW: IRF

– Represents different theories

about latent traits.

REVIEW: IRF

– Dichotomous response:

Pj() = Pr[Xj = 1]

= Pr[Correct Response to item j | ]

REVIEW: IRF

– Polychotomous response:

Pjk() = Pr[Xj > k | ]

= Pr[Exceed category k of item j | ]

REVIEW: IRF

– Dichotomous or Polychotomous response:

Ej() = [Expected Rating for item j | ]

0 < Ej() < K

REVIEW: IRF

IRF: Dichotomous items

-5 -4 -3 -2 -1 0 1 2 3 4 5

Pj(Theta)

IRF: Polychotomous items

-5 -4 -3 -2 -1 0 1 2 3 4 5

Pjk(theta)

• The unweighted total score X+n

stochastically orders the latent trait

(Hyunh, 1994; Grayson, 1988)

REVIEW: SCALES

• 4 Scales of Measurement

– Conjoint Measurement

REVIEW: SCALES

• Conjoint Measurement

– Row Independence Axiom

REVIEW

• Property: Ordinal Scaling and unidimensionality of (test score)

REVIEW

INDEPENDENCE AXIOM (row)

ITEMSHard Easy

j = 1 2 3

3(i = 1)

P11 P12 P13

4(i = 2)

P21 P22 P23

TestScore

5(i = 3)

P31 P32 P33

W1 Premise W1 Implication

• IRF: Non-decreasing over

REVIEW

• IRF: Non-decreasing over • Models: MH, 2PL, 3PL, 4PL, True Score,

Factor Analysis

REVIEW

2PL: P a bj j j

-5 -4 -3 -2 -1 0 1 2 3 4 5

Pj(Theta)

3PL: P c c a bj j j j j

-5 -4 -3 -2 -1 0 1 2 3 4 5

Pj(Theta)

4PL: P c d c a bj j j j j j

-5 -4 -3 -2 -1 0 1 2 3 4 5

Pj(Theta)

Monotone Homogeneity (MH)

-5 -4 -3 -2 -1 0 1 2 3 4 5

Pj(Theta)

– Column Independence Axiom (adding)

REVIEW

• Property: Ordinal Scaling and unidimensionality of both (test score) and item difficulty (item score)

REVIEW

ITEMSHard Easy

j = 1 2 3

3(i = 1)

P11 P12 P13

4(i = 2)

P21 P22 P23

TestScore

5(i = 3)

P31 P32 P33

W1 Premise W1 Implication W2 Premise W2 Implication

INDEPENDENCE AXIOM (column)

• IRF: Non-decreasing and non-intersecting over

REVIEW

• IRF: Non-decreasing and non-intersecting over

• Models: DM, ISOP

REVIEW

DM/ISOP (Scheiblechner 1995)

-5 -4 -3 -2 -1 0 1 2 3 4 5 6 7 8

– Thomsen Condition (adding)

REVIEW

• Property: Interval Scaling and unidimensionality of both (test score) and item difficulty (item score)

REVIEW

ITEMSHard Easy

j = 1 2 3

3(i = 1)

P11 P12 P13

4(i = 2)

P21 P22 P23

TestScore

5(i = 3)

P31 P32 P33

Thomsen condition(e.g.,double cancellation)

• IRF: Non-decreasing and parallel (non-intersecting) over

REVIEW

• IRF: Non-decreasing and parallel (non-intersecting) over

• Models: Rasch Model, ADISOP

REVIEW

RASCH-1PL: P j j

-5 -4 -3 -2 -1 0 1 2 3 4 5

Pj(Theta)

• 5 Challenges of Latent Trait Measurement

REVIEW

• 5 Challenges of Latent Trait Measurement

• Test Theory attempts to address these challenges

REVIEW

• Test Construction (10 Steps)

REVIEW

• Test Construction (10 Steps)

• Basic Statistics of Test Theory

REVIEW

• Total Test Score (X+) variance =

Sum[Item Variances]

Sum[Item Covariances]

REVIEW

EPSY 546: LECTURE 2

TRUE SCORE TEST THEORY AND RELIABILITY

George Karabatsos

TRUE SCORE MODEL• Theory: Test score is a random variable.

X+n Observed Test Score of person n,

Tn True Test Score (unknown)

en Random Error (unknown)

X T en n n

TRUE SCORE MODEL

• The Observed person test score X+n is a random

variable (according to some distribution) with mean Tn = E(X+n) and variance 2(X+n) = 2(en).

TRUE SCORE MODEL

• The Observed person test score X+n is a random

variable (according to some distribution) with mean Tn = E(X+n) and variance 2(X+n) = 2(en).

• Random Error en = X+n – Tn

is distributed with

mean E(en) = E(X+n–Tn) = 0,

and variance 2(en) = 2(X+n) .

TRUE SCORE MODEL• True Score:

Tn true score of person n

E (Xn) expected score of person n

s Possible score s {0,1,…,s,…,S}

pns Pr[Person n has test score s]

T E X spn n nss

TRUE SCORE MODEL• 3 Assumptions:

1) Over the population of examinees, error has a mean of 0. E[e] = 0

2) Over the population of examinees, true scores and error scores have 0 correlation.

[T, e] = 0

TRUE SCORE MODEL• 3 Assumptions:

3) For a set of persons, the correlations of the error scores between two testings is zero.

[e1, e2] = 0

– “Two testings”: when a set of persons take two separate tests, or complete two testing occasions with the same form.

– The two sets of person scores are assumed to be randomly chosen from two independent distributions of possible observed scores.

TRUE SCORE ESTIMATION

Xn X X

e X Tn n n

is test reliability.

The proportion of variance of observed scores that is explained by the variance of the true scores.

Xn X X

TEST RELIABILITY

is the error of measurement.

X T2 2

TEST RELIABILITY

is the standard error of

measurement.

(random error)

X T2 2

TEST RELIABILITY

is the standard error of

measurement.

(random error)

Estimated ((1–)*100)% confidence interval around the test score:

X T2 2

X Zn X T /

TEST RELIABILITY

• It is desirable for a test to be Reliable.

TEST RELIABILITY

• Reliability – the degree to which the respondents’ test scores are consistent over repeated administrations of the same test.

TEST RELIABILITY

• Indicates the precision of a set of test scores in the sample.

TEST RELIABILITY

• Indicates the precision of a set of test scores in the sample.

• Random and systematic error can affect the reliability of a test.

TEST RELIABILITY

• Test developers have a responsibility to demonstrate the reliability of scores obtained from their tests.

ESTIMATING RELIABILITY

C ronbach sJ

Estimated item variance

Estimated total test score variance

ESTIMATING RELIABILITY

C ronbach sJ

Estimated covariance between

items i and j

Estimated total test score variance

OTHER FORMS OF RELIABILITY

• Test-Retest Reliability:

The correlation between persons’ test scores over two administrations of the same test.

OTHER FORMS OF RELIABILITY

• Split-Half Reliability (using Spearman-Brown correction for test length):

AB Correlation between scores of Test A and Test B

TEST VALIDITY

• VALIDITY: A test is valid if it measures what it claims to measure.

• Types: Face, Content, Concurrent, Predictive, Construct.

• Face validity: When the test items appear to measure what the test claims to measure.

• Content Validity: When the content of the test items, according to experts, adequately represent the latent trait that the test intends to measure.

TEST VALIDITY

• Concurrent validity: When the test, measuring a particular latent trait, correlates highly with another test that measures the same trait.

• Predictive validity: When the scores of the test predict some meaningful criterion.

TEST VALIDITY

• Construct validity: A test has construct validity when the results of using the test fit hypotheses concerning the nature of the latent trait. The higher the fit, the higher the construct validity.

TEST VALIDITY

RELIABILITY & VALIDITY

• Up to a point, reliability and validity increase together, but then any further increase in reliability (over ~.96) decreases validity.

• For e.g., when there is perfect reliability (perfect correlations between items), the test items are essentially paraphrases of each other.

RELIABILITY & VALIDITY “If the reliability of the items were increased to

unity, all correlations between items would also become unity, and a person passing one item would pass all items and and another failing one item would fail all the other items. Thus all the possible scores would be a perfect score of one or zero…Is the dichotomy of scores the best that would be expected for items with equal difficulty?”

(Tucker, 1946, on the attenuation paradox)

(see also Loevinger, 1954)

1 EPSY 546: LECTURE 1 SUMMARY George Karabatsos. 2 REVIEW.

Documents

Transcript of 1 EPSY 546: LECTURE 1 SUMMARY George Karabatsos. 2 REVIEW.

Karen Parks, Haibei Zhang, Huong Hoang -Fall 2001- EPSY 335 Learning Theories.

EPSY 6304 Jaime H. García, Ph.D. Marie D. Lara 11/25/2013

Chapter Seven person-Centered Therapy EPSY 6363 -- Counseling Theories D. Scott Sparrow.

D'Chaleco 546

GlobalMarketingCenterofKoreaT.031-546-7507F.031-546-7509 2013 ...koreaventure.co.kr/bbs/file_down.asp?name=2013%BC%AD%BA... · 주)지엠씨코리아GlobalMarketingCenterofKoreaT.031-546-7507F.031-546-7509

Distance Learning EPSY 410-02 Fall 1998

2Z SZR ENG GA 3 v3-indesign - epsy

Karabatsos,walker (2009) psychometrika

Треугольник № 546

Ecodias 546

LECTURE 3 SAMPLING THEORY EPSY 640 Texas A&M University

cuesta.edu (805) 546-3138 | (805) 546-3162 Visit cuesta.edu or call (805) 546-3138 | (805) 546-3162 San Luis Obispo Campus Highway 1 San Luis Obispo, CA 93403-8106 (805) 546-3100 North

GESTALT THERAPY COUNSELING THEORIES -- EPSY 6363 DR. SPARROW COUNSELING THEORIES -- EPSY 6363 DR. SPARROW.

Texas A&M University EPSY 642 FALL 2009 Victor L. Willson, Instructor.

EPSY 326 Presentaiton

Primo 546 / R / RS / HR / SM / HRM - Motoruf Primo 546 546 R 546 RS 546 HR 546 SM 546 HRM Lista de piezas de recambio Cortacesped Liste de pièces de rechange Tondeuse Spare parts

CRITERION-RELATED VALIDITY – PREDICTIVE LECTURE 10 EPSY 625.

Item Response Theory for Survey Data Analysis EPSY 5245 Michael C. Rodriguez.

Edicao 546

1 EPSY 546: LECTURE 1 INTRODUCTION TO MEASUREMENT THEORY George Karabatsos.