Education 795 Class Notes
description
Transcript of Education 795 Class Notes
![Page 1: Education 795 Class Notes](https://reader036.fdocuments.net/reader036/viewer/2022062322/568145b7550346895db2bf40/html5/thumbnails/1.jpg)
Education 795 Class Notes
P-Values, Partial Correlation, Multi-Collinearity
Note set 4
![Page 2: Education 795 Class Notes](https://reader036.fdocuments.net/reader036/viewer/2022062322/568145b7550346895db2bf40/html5/thumbnails/2.jpg)
Today’s Agenda
Announcements (ours and yours)
Q/A?
Leveraging what we already know
Partial Correlation and Multi-Collinearity
![Page 3: Education 795 Class Notes](https://reader036.fdocuments.net/reader036/viewer/2022062322/568145b7550346895db2bf40/html5/thumbnails/3.jpg)
P-Values
“p-value refers to the probability of the evidence having arisen as a result of sampling error given that the null hypothesis is true” (Pedhazur & Pedhazure, 1991)
What is inherently wrong the p-values?
Why do we use them?
![Page 4: Education 795 Class Notes](https://reader036.fdocuments.net/reader036/viewer/2022062322/568145b7550346895db2bf40/html5/thumbnails/4.jpg)
P-Values
“Even though I am very critical of statistical inference… I shall probably continue to pay homage to “tests of significance” in the papers I submit to psychological journals. My rationale for this admitted hypocrisy is straightforward: until the rules of the science game are changed, one must abide by at least some of the old rules, or drop out of the game” (Mahoney, 1976, p. xiii)
![Page 5: Education 795 Class Notes](https://reader036.fdocuments.net/reader036/viewer/2022062322/568145b7550346895db2bf40/html5/thumbnails/5.jpg)
What to do?
“Perhaps p values are like mosquitos. They have an evolutionary niche somewhere and no amount of scratching, swatting, or spraying will dislodge them” (Campbell, 1982, p 698)
![Page 6: Education 795 Class Notes](https://reader036.fdocuments.net/reader036/viewer/2022062322/568145b7550346895db2bf40/html5/thumbnails/6.jpg)
Statistical Significance vs. Practical Significance
We should refrain from what Tukey calls “statistical sanctification.” Concern with practical significance is addressed through effect sizes or relational magnitudes (betas in regression).
“A difference is a difference only if it makes a difference” (Huff, 1954, p. 58)
![Page 7: Education 795 Class Notes](https://reader036.fdocuments.net/reader036/viewer/2022062322/568145b7550346895db2bf40/html5/thumbnails/7.jpg)
Introduction to Effect Size
Effect sizes imply strength of meaningfulness or importance
General Rule set forth by Cohen (1988) for small, medium, large ES
We will address how effect sizes are computed later in the course
![Page 8: Education 795 Class Notes](https://reader036.fdocuments.net/reader036/viewer/2022062322/568145b7550346895db2bf40/html5/thumbnails/8.jpg)
Transition Back to Multiple Regression
1. Multiple predictors typically yield better technical solutions (e.g., higher R2)
2. Multiple predictors provide opportunities to test more realistic models (e.g., why is nothing as simple as it should be?)
3. Multiple regression models allow for an examination of more complex research hypotheses than is possible with simple regression / correlation approaches
![Page 9: Education 795 Class Notes](https://reader036.fdocuments.net/reader036/viewer/2022062322/568145b7550346895db2bf40/html5/thumbnails/9.jpg)
Regression
Raw score depiction:
where each b:is the unique and independent contribution of that predictor to the modelfor quantitative IVs, the expected direction and amount of change in the DV for each unit change in the IV, holding all other IVs constantFor dichotomous IVs, the direction and amount of group mean difference on DV, holding all other IVs constant
![Page 10: Education 795 Class Notes](https://reader036.fdocuments.net/reader036/viewer/2022062322/568145b7550346895db2bf40/html5/thumbnails/10.jpg)
Revisit ’s
Example: Dependent Variable: Promote Racial UnderstandingIndependent Variable: Sex, Race
sex = rsex,promote if sex and race are not correlated. These are population based estimates and they are “effect sizes” because we can compare relative strength of predictors in the model
In the Venn diagram on the following slide, note X1 and X2 are not correlated but X2 and X3 are
![Page 11: Education 795 Class Notes](https://reader036.fdocuments.net/reader036/viewer/2022062322/568145b7550346895db2bf40/html5/thumbnails/11.jpg)
Venn Diagram Depiction
CorrelationRegression Coefficients
![Page 12: Education 795 Class Notes](https://reader036.fdocuments.net/reader036/viewer/2022062322/568145b7550346895db2bf40/html5/thumbnails/12.jpg)
Warning
Pedhazur believes that the topics of partial correlations and semi-partial correlations can be confusing and lead to misinterpretations of regression coefficients. Why talk about them?
Awareness and enough knowledge to evaluate research where partials are used
![Page 13: Education 795 Class Notes](https://reader036.fdocuments.net/reader036/viewer/2022062322/568145b7550346895db2bf40/html5/thumbnails/13.jpg)
Partial Correlations
A variation on the idea of residualization (removal of the predictable part of y from y)
First-order partial correlations:
correlation of variable 1 and 2 partialling variable 3 from 1 and 2
![Page 14: Education 795 Class Notes](https://reader036.fdocuments.net/reader036/viewer/2022062322/568145b7550346895db2bf40/html5/thumbnails/14.jpg)
Plug and Chugr Quiz Exam Speed Motiv
Quiz 1.00
Exam .40 1.00
Speed .35 .45 1.00
Motiv .25 .30 .15 1.00
1. What is the correlation between quiz and exam score, controllingfor test taking speed?
2. What is the correlation between exam score and motivation, controlling for test taking speed?
![Page 15: Education 795 Class Notes](https://reader036.fdocuments.net/reader036/viewer/2022062322/568145b7550346895db2bf40/html5/thumbnails/15.jpg)
Semi-Partial Correlations
r1(2.3)=correlation of variables 1 and 2 after having partialed variable 3 only from variable 2. (semi-partial)
VS
r12.3=correlation of variables 1 and 2 after having partialed variable 3 from both variable 1 and variable 2 (partial)
![Page 16: Education 795 Class Notes](https://reader036.fdocuments.net/reader036/viewer/2022062322/568145b7550346895db2bf40/html5/thumbnails/16.jpg)
Before Jumping Into Regression
Examine the data using common-sense (e.g., are the data appropriate for producing interpretable correlation coefficients?) as well as standard diagnostic procedures
Review the r among the predictors for collinearity problems
![Page 17: Education 795 Class Notes](https://reader036.fdocuments.net/reader036/viewer/2022062322/568145b7550346895db2bf40/html5/thumbnails/17.jpg)
Multicollinearity
Multicollinearity refers to correlations among the independent variables only
Multicollinearity is measured by the tolerance statistic, defined as 1 – R2 predicting each predictor using all other predictors
(values close to 1 are better, values close to 0 are bad)
Excessive collinearity (even singularity – perfect correlation between two or more IVs) suggests that predictors have extensive overlaps, and we may need to be selective in picking predictors or combining them (through factor analytic techniques)
![Page 18: Education 795 Class Notes](https://reader036.fdocuments.net/reader036/viewer/2022062322/568145b7550346895db2bf40/html5/thumbnails/18.jpg)
Dangers
Multicollinearity has adverse effects on regression analysis
High multicollinearity leads to a reduction in the magnitude of the b’s
High multicollinearity leads to inflated se’s, reducing the t-ratios for the coefficients
![Page 19: Education 795 Class Notes](https://reader036.fdocuments.net/reader036/viewer/2022062322/568145b7550346895db2bf40/html5/thumbnails/19.jpg)
Solutions
Be selective in choosing variables that are related
Combine like variables into an index using scales or ‘factor analysis’ which we will talk about soon
![Page 20: Education 795 Class Notes](https://reader036.fdocuments.net/reader036/viewer/2022062322/568145b7550346895db2bf40/html5/thumbnails/20.jpg)
Suppressors
When a partial correlation is larger than the original r, it is considered to be the result of a suppressor effect
Suppressor variables effectively mask (suppress) the relationship between other variables
This effect occurs when there is an unbalanced mix of +/- correlations between the DV and the IVs
![Page 21: Education 795 Class Notes](https://reader036.fdocuments.net/reader036/viewer/2022062322/568145b7550346895db2bf40/html5/thumbnails/21.jpg)
Project Activity
Dataset: Chose a dataset and run a multiple regression
Dependent variable: SATC=SATM+SATVIndependent variables: sex, family income,
mother’s education and father’s education
Use syntax to get the tolerance statistic
Rerun the regression summing mothers and fathers education into one variable. Compare the tolerance statisticfor mothers and fathers education with the summed index.
![Page 22: Education 795 Class Notes](https://reader036.fdocuments.net/reader036/viewer/2022062322/568145b7550346895db2bf40/html5/thumbnails/22.jpg)
For Next Week
Read Pedhazur Ch 10 p211-216
Read Pedhazur Ch 14 p304-310
Read Pedhazur Ch 19 p464-466
Read Pedhazur Ch 21 p545-558, p567-579