HSAG Report - Alison Chapman and Allison Makowski_PDF

EVALUATING GEOLOGY TOPICS IN STANDARDIZED TESTING

A STATISTICAL CONSULTING EXPERIENCE

GRAND VALLEY STATE UNIVERSITY STA 319-02: STATISTICS PROJECT

4/21/15

ALISON CHAPMAN AND ALLISON MAKOWSKI

1

Table of Contents:

Abstract 2

Introduction 3

Methods 5

Results 7

Hypothesis 1 12

Hypothesis 2 13

Hypothesis 3 14

Hypothesis 4 15

Hypothesis 5 16

Discussion/Recommendations 17

Appendix 19

2

Abstract: The purpose of this project was to evaluate geology topics in standardized testing. This

report analyzes the High School Advanced Geology (HSAG) exam scores. A GVSU Geology

professor named Dr. Steve Mattox created the HSAG exam. The HSAG exam covers 12-15

geology topics is composed of four parts: Multiple Choice, Rocks and Minerals, Maps, and an

Essay portion. The data was collected from high school seniors in nine different schools between

2012-2014. Analyses were conducted to recognize the areas where the students struggled. After

the analyses were completed, we found that the student’s total score on the HSAG exam differed

between the nine schools. Also, there was a difference in the student’s score between the four

portions of the HSAG exam for five of the nine schools: Dream Academy, Grosse Point South,

Grand Haven, Hudsonville, Multicultural Academy. Our findings will allow Dr. Mattox to look

at specific parts of the exam where student’s struggled and modify the exam to aid in more

students passing. The results will also let Dr. Mattox inform the teachers where his/her students

struggled, so he/she can adjust their lectures so more students in the school can pass.

3

Introduction: The High School Advanced Geology (HSAG) exam was designed high school seniors

that that were currently taking a geology course. The HSAG exam acts like an Advanced

Placement (AP) exam, where students can pass and earn college credit. Our client, Dr. Steve

Mattox, developed the HSAG exam. Dr. Mattox is a GVSU Geology professor and he was

assisted in this project by a GVSU Earth Science student named Christina Sobolak. This project

was funded by two National Science Foundation (NSF) grants, which allowed Dr. Mattox to

expand the project by adding more schools and getting teachers supplies to use in their classroom

lectures. The schools that were ideal for this project were those high in diversity and had a

teacher with an Earth Science minor, Geology major, or Masters in Science education. This

project offers summer training for the teachers along with materials to supply the classroom.

Our role in this study was to act as lead statistical consultants. Dr. Mattox and Christina

collected all of the necessary data, while we ran the statistical analyses. Dr. Mattox wanted to

identify specific topics that caused the most difficulty for students. These topics could prevent

students from passing the exam. The ultimate goal of this program is to help increase the number

of high school students taking a geology course and the level of geology knowledge of

participating teachers and students. Dr. Mattox would like to make this HSAG course to be as

qualified as current AP science courses. Our findings will be able to aid Dr. Mattox in the

continuation of this project. Dr. Mattox would be able to provide the teachers with additional

training and other resources to help future students learn better. The findings will also aid in

increasing the number of students passing the HSAG exam/earning credit, as well as increasing

the number of students enrolling in advanced college geoscience courses. Having more students

4

enroll in geoscience courses will hopefully aid in more students pursing a geoscience career,

which is a field of study where there is a projected shortage of graduates.

In order to assist Dr. Mattox with his project, we conducted statistical analyses to find

significant results. We did this by answering specific hypotheses that we created.

The five hypotheses that we tested are as follows:

1. Is there is a difference in the students’ mean section percent between the four portions of the HSAG exam?

2. Is there is a difference in the students’ total mean percent on the HSAG exam between the years?

3. Is there is a difference in the students’ total mean percent on the HSAG exam

between the nine schools?

4. Is there is a difference in the FOCUS students’ mean section percent between the 4 portions of the HSAG exam?

5. Looking at each individual school, is there a difference in the students’ mean section

percent between the four portions of the HSAG exam

5

Methods: The HSAG exam consisted of four separate portions: Multiple Choice, Rocks and

Minerals, Maps, and an Essay portion. The HSAG exam is often taken in the spring, but each of

the four portions may be taken at separate times. The HSAG exam consists of approximately 12

to 15 general geology topics. There were a total of 196 questions, worth 377 points, on the

HSAG exam. The Multiple Choice portion had the most questions out of all the portions, and the

Essay portion was the shortest portion. Table 1 below, shows the number of questions on each

portion and the amount of geology topics covered within the portion.

Table 1: Questions, Topics, and Points for Each Portion of the HSAG Exam

Portion of the HSAG Exam

Number of Questions

Number of Geology Topics

Total Number of Points

Multiple Choice 77 13 152 Rocks and Minerals 50 4 100

Maps 40 10 80 Essay 29 6 45

Point distribution between the four portions can be seen below in Figure 1. There are

currently nine Michigan colleges and universities that have awarded students with college if the

student gets a 70% or better on the exam. If the student gets a passing score (70% or better), they

will receive up to four college credits. The data collected by Dr. Mattox and Christina ranges

from 2012 to 2014, with approximately four to eight schools participating each year. This data

includes every student who took the exam along with their total number of points they received

for each portion of the exam.

6

Figure 1. Point Distribution of Each Portion of the HSAG Exam

The data was collected from nine different schools in Michigan between 2012 and 2014.

Table 2 shows which schools were involved and the year they participated. There were only

three schools that participated in all three years (Grosse Point South, Hudsonville, and Okemos).

Table 2. The Years the Schools Were Sampled

Year Black River

Cody DIT

Dream Academy

Grand Haven

Grosse Point South

Henry Ford Hudsonville

Multicultural Academy Okemos

2012 X X X X 2013 X X X X X X 2014 X X X X X X X X

Dr. Mattox recorded the collected data and entered it into Excel files that were separated

by school and by year. The students’ total score on the exam, as well as their score for each

portion were recorded within the Excel files. In our study, we analyzed the sample of 354 high

school students.

7

Results: We were given the data in 18 excel files, so we had to restructure the data. Because of

how our data was originally formatted, we needed to combine the multiple Excel files into a

single Excel file that could easily be inputted into a software package. We did this by “copying

and pasting” the data that was needed from each Excel spreadsheet into a single Excel

spreadsheet. Upon doing this, we discovered a few missing values. We replaced the missing

values with zeros because a lack of a value indicated that the student did not take a portion of the

exam, or the complete HSAG exam. Not taking the exam resulted in that student receiving a 0%

on the portion or the exam. No other data cleaning technique was needed. Any errors in the data

would have been from a mistake in the inputting of the original data values by Dr. Mattox. We

would not be able to discover these errors. After restructuring and cleaning the data, we used

SPSS 20 and SAS 9.3 to conduct our analyses.

We began by creating new variables in SPSS and SAS. First, we had SPSS calculate the

percent that a student earned on each of the four portions of the exam. SPSS also calculated the

percent that a student earned on the HSAG exam in total. The following variables were created

by SPSS: Multiple Choice Percentage, Maps Percentage, Rocks and Minerals Percentage, Essay

Percentage, and Total Exam Percentage.

Dr. Mattox also wanted to examine the students that scored between 40-69.99% on the

HSAG exam to see where they struggled. We called these students the Focus students. In order

to accomplish this, we had SAS create a new variable for the focus students by selecting the

students that scored between a 40% and 69.99% on the HSAG Exam. We did not need any

further data reconstructing. As a whole, our dataset did not require much SPSS or SAS

manipulation, besides the creation of new variables.

8

Our project consisted of seven key variables of interest: Portion Percent, School, Year,

Portion of the HSAG Exam, Total Exam Percent, Students, and Focus Students. Information,

about the key variables such as the type, values, and description, is shown in Table 3.

Table 3. Information on the Key Variables in Our Study

Variable Name Variable Type

Range/Values the Variable Can Take On Variable Description

Portion Percent

Quantitative 0 % - 100 % The percentage that a student received for a portion of the HSAG Exam

School Categorical

Black River

Cody DIT

Dream Academy

Grand Haven

Grosse Point South

Henry Ford

Hudsonville

Multicultural Academy

Okemos

The school where the student took the HSAG Exam

Year Categorical 2012 2013 2014 The year that a student took the HSAG Exam

Portion of the HSAG Exam

Categorical Multiple Choice

Rocks and Minerals

Maps

Essay The portion of the HSAG Exam

Total Exam Percent

Quantitative 0 % - 100 % The percentage that a student received on the HSAG Exam

Students Categorical Each student had a specific ID Any student that took the HSAG Exam

Focus Students Categorical

0 or 1 (NO or YES)

“0” = “NO”—Not a focus student “1” = “YES”—Is a focus student

Any student that took the HSAG Exam and scored between a 40 % - 69.99 %

9

For the quantitative key variables, the maximum, minimum, and mean values were

calculated in SPSS. The values can be found in Table 4. Between all nine schools and all three

years, the highest average percent was for the Essay portion at 60.39%. The lowest average

percent belonged to the Maps portion at 55.17%. As a whole, the average percent on the HSAG

Exam was 58.17%.

Table 4. Descriptive Statistics of the Quantitative Variables (Portion Percent and Total Exam Percent)

Variable Name Mean Minimum Maximum

Portion Percent

Multiple Choice Percent

59.02 % 0 % 99 %

Rocks and Minerals Percent

58.27 % 0 % 100 %

Maps Percent

55.17 % 0 % 98 %

Essay Percent

60.39 % 0 % 98 %

Total Exam Percent

HSAG Exam Percent

58.17 % 0 % 95 %

For the categorical variables, the frequencies were calculated. The frequencies for the

variable Year is shown in Figure 2. As shown in Figure 2, the number of students sampled per

year increased between 2012-2014. This is because each year had more schools participating

than the previous. The sampling of the schools is shown in Table 2.

10

Figure 2. Frequencies of the Variable Year

The frequencies for the variable School are shown in Figure 3. As shown in Figure 3, the

school with the most students sampled is Hudsonville with 138 students. On the other hand,

Multicultural Academy and Henry Ford had the lowest amounts, with 4 and 5 students,

respectively.

Figure 3. Frequencies of the Variable School

86

120 148

0 20 40 60 80

100 120 140 160

2012 2013 2014

Number of Students Sampled Per Year (all schools)

138 49

47 40

35 19

17 5 4

0 20 40 60 80 100 120 140 160

Hudsonville Grosse Point South

Okemos Grand Haven

Black River Dream Academy

Cody DIT Henry Ford

Multicultural Academy

Number of Students Sampled for Each School (all years)

11

For our variable, Focus Student, there were a total of 101 out of the 354 students that

scored between a 40% - 69.99%. If the student was a Focus Student, they had a value equal to

“Yes”. Otherwise, the student’s value was equal to “No”. Figure 5 shows the frequencies of the

Focus Students variable.

Figure 4. Frequencies of Variable Focus Students

For each of our hypotheses (except for hypothesis three), a statistical test called an

Analysis of Variance (ANOVA) was used. An ANOVA test is a statistical test of whether or not

the means of several groups are equal or not. We can conclude that there is significant evidence

of a difference in the means if our p-value is less than 0.05. There are three assumptions for an

ANOVA test to be valid: (1) we have independent observations, (2) the samples come from a

normal distribution, and (3) the variances of the data in the groups are equal.

If the assumptions are not met, like in hypothesis three, a nonparametric equivalent to an

ANOVA test, called a Kruskal-Wallis test, is used. A Kruskal-Wallis test ranks the data to help

determine if there is significant evidence of a difference in at least one of the distributions. It is a

significant result if our p-value is less than 0.05. However, the Kruskal-Wallis test is not as

powerful as an ANOVA, but it is useful when the assumptions are not met for an ANOVA test.

12

Hypothesis 1: Is there a difference in the students’ mean section percent between the four portions of the HSAG exam?

First, we tested to see what portion of the exam caused the students the most difficulty.

We ran a One-Way ANOVA test to address this hypothesis. The average students’ scores on

each portion of the exam is shown in Table 5. Because our p-value was 0.084, we were not able

to conclude that there was a difference in the students’ mean scores between the four portions of

the HSAG exam. The Map portion of the HSAG exam had the lowest mean percent, but we

cannot conclude that it was significantly different than the other portions of the exam. The

complete test that was conducted in SPSS for Hypothesis 1 can be seen in the Appendix A.

Table 5. Descriptive Statistics of Student’s Mean Percentage for Each Portion of the Exam

Name Mean Standard Deviation

Multiple Choice Percent 59.02 % 27.38 %

Rocks and Minerals Percent 58.27 % 27.95 %

Maps Percent 55.17 % 26.65 %

Essay Percent 60.39 % 29.96 %

13

Hypothesis 2: Is there a difference in the students’ total mean percent on the HSAG exam between the years?

Next, we tested to see if the students’ total mean percent changed over the course of the

three years. For this hypothesis, we also ran a One-Way ANOVA. The average students’ scores

for each year is shown in Table 6. The lowest mean score on the HSAG exam was in 2014 with a

score of 55.47%. The highest mean score on the HSAG exam for all the students was in 2012

with a score of 63.87%. However, our p-value was 0.056 so there was not significant evidence to

indicate that the students’ mean score differs between the years. We found these results to be

surprising because the average scores actually went down as time went on. Before running the

test, we assumed the scores would increase because teachers could change their teaching styles

to better prepare future students. However, over the course of the three years, the program

expanded the number of schools participating each year. The results could have been “diluted”

by adding schools that were not as prepared to take the HSAG exam unlike the other schools that

have been in the program before. There were several schools that only took this exam in one of

the years so these teachers were not given the opportunity to change and enhance their teaching

strategies yet. The complete test for Hypothesis 2 can be seen in the Appendix B.

Table 6. Descriptive Statistics of the Student’s Total Percentage in Each Year

Year Mean Number of Students

2012 63.87 % 86

2013 57.41 % 120

2014 55.47 % 148

14

Hypothesis 3: Is there a difference in the students’ total mean percent on the HSAG exam between the nine schools?

Our next test was to determine if the students’ total mean percent differed between the

nine schools. Our first choice was to run a One-Way ANOVA, to address this hypothesis, but our

assumptions were not met. Therefore, we decided to run the nonparametric equvialnt to an

ANOVA called a Kruskal-Wallis test. With a p-value less than 0.001, we found significant

evidence of a difference in the students’ total mean percent on the HSAG exam between the nine

schools. The total sample size for each school, as well as the students’ mean rank on the HSAG

exam can be seen in Table 7. You can see that Hudsonville had the highest mean rank of 269.88.

However, Hudsonville has taken this exam for over 10 years, so the teacher has perfected his

teaching style. We can than compare this to Cody DIT who had the lowest mean rank of 49.76

and took the exam only in 2013 and 2014. The complete test for hypothesis 3 can be seen in the

Appendix C.

Table 7. Descriptive Statistics of Each School’s Total Exam Percentage Ranked

School Number of Students Average Rank

Black River 35 122.77

Cody DIT 17 49.76

Dream Academy 19 26.58

Gross Point South 49 180.98

Grand Haven 40 163.36

Henry Ford 5 67

Hudsonville 138 269.88

Multicultural Academy 4 78.38

Okemos 47 82.81

15

Hypothesis 4: Is there a difference in the Focus students’ mean section percent between the four portions of the HSAG exam?

We also wanted to test to see what portion of the HSAG exam caused the Focus students

the most difficulty. We ran a One-Way ANOVA test to address this hypothesis. The average

Focus Students’ scores for each portion of the HSAG exam and the exam in total are shown in

Table 8. With a p-value less than 0.001, we found significant evidence to show that the Focus

Students’ mean score differs between the four portions of the exam. The Essay’s mean

percentage was significantly greater than the other three portions of the exam at 60.89%, on

average. The lowest mean score for the Focus Students group was the Multiple Choice portion

with an average score of 53.30%. The complete analysis for Hypothesis 4 can be found in the

Appendix D.

Table 8. Descriptive Statistics of Focus Student’s Mean Percentage (Each Portion Percent and Total Percent Overall)

Portion Mean

Multiple Choice Percent 53.30 %

Rocks and Minerals Percent 54.87 %

Maps Percent 55.54 %

Essay Percent 60.89 %

HSAG Exam Percent TOTAL 56.14 %

16

Hypothesis 5: Looking at each individual school, was there a difference in the students’ mean section percent between the four portions of the HSAG exam?

Lastly, we wanted to look at each individual school to see what portion of the HSAG

exam caused their students the most difficulty. We ran a One-Way ANOVA test to address our

hypothesis. We found insufficient evidence that there was a difference in students’ mean

percentage between the 4 portions of the HSAG exam for the following schools: Black River,

Cody DIT, Henry Ford, and Okemos. However, we did find significant evidence that there was a

difference in students’ mean percentage between the 4 portions of the HSAG exam for Dream

Academy, Grosse Point South, Grand Haven, Hudsonville, and Multicultural Academy. In Table

9, the portion of the exam that caused those five schools the most difficulty is shown. The

complete test conducted in SPSS for hypothesis 5 can be seen in Appendix E.

Table 9. Schools’ Lowest Mean Portion of Exam for the Significant Results

School Multiple Choice

Rocks and Minerals Maps Essay

Dream Academy X X

Grosse Point South X X X

Grand Haven X

Hudsonville X

Multicultural Academy X X

17

Discussion/Recommendations: Our data analysis allowed Dr. Mattox to look at specific portions of the exam instead of

the entire HSAG exam. Dr. Mattox wanted to focus most of his attention on the students that

scored between a 40% - 69.99%, since they are the students that were close to passing, but

missed the mark. After running our analyses, we would recommend that Dr. Mattox re-look at

the multiple choice portion of the exam, since there was significant evidence that this portion

caused those students that scored between a 40% - 69.99% the most difficulty. This will help Dr.

Mattox to spend his time, energy, and resources on that specific portion, instead of spreading it

across the whole exam.

We also were able to show Dr. Mattox how each individual school was performing,

which will allow him to adjust the schools that he has in the program. By showing how each

school is performing, Dr. Mattox will be able to decide if he will drop a school from his program

and invest his resources in a school that is better suited for the program. This will help the

schools that are dedicated to the program by helping those schools continue to do better.

Our analyses will also allow Dr. Mattox to contact the teachers within the program to

show them where their particular students struggled. This will help the teachers in the program

adjust their teaching to better prepare future students. The teachers will be able to adapt based on

this data and change their strategies to fit the students’ needs.

Dream Academy should focus on improving the topics in the rocks and minerals portion

as well as the essay portion. Grosse Point South should focus on improving all the topics in all

the portions besides the essay portion. On the other hand, Grand Haven and Hudsonville only

need to focus on the topics in the multiple choice and map portion; respectively. Lastly,

Multicultural Academy should concentration on the topics in the Map and Essay portions.

18

There are a few limitations with the data since the same schools were not consistently

sampled for each of the years, but this is due to the program being expanded and allowing more

schools to participate. A few suggestions for further research would be to continue to add more

schools to the program and let the current teacher’s know where he or she can better focus his or

her teaching.

19

Appendix:

Appendix A: Hypothesis 1 20

Appendix B: Hypothesis 2 23

Appendix C: Hypothesis 3 27

Appendix D: Hypothesis 4 30

Appendix E: Hypothesis 5 33

School 1: Black River 33

School 2: Cody DIT 35

School 3: Dream Academy 37

School 4: Grosse Point 39

School 5: Grand Haven 41

School 6: Henry Ford 43

School 7: Hudsonville 45

School 8: Multicultural Academy 47

School 9: Okemos 49

20

Appendix A: Hypothesis 1 Goal/Hypothesis:

To determine if there is a difference in the students’ mean portion score between the four portions of the HSAG exam

Figure A.1 shows the variability of the student’s mean score for each portion. Each boxplot shows the same amount of variability and all four are very similar.

Figure A.1. Boxplots of Each Portion of the HSAG Exam

Figure A.2. SPSS Output for the Average Percent for Each Portion of the HSAG Exam

21

We are analyzing a quantitative dependent variable (Portion Percent) by a qualitative independent variable with four categories (Portion of the HSAG Exam), so we will be using a One-Way ANOVA test for this hypothesis. Assumptions for a One-Way ANOVA:

1. Independent samples This assumption is met because each student’s score on one portion of the

HSAG Exam is independent of the other portions.

2. Approximately Normal Populations From Figure A.3, each portion has a skewness and kurtosis value between

(-1,1) so we can accept normality, which means the students’ mean score on each portion of the HSAG exam is from a normal population. This assumption is met.

Figure A.3. Skewness and Kurtosis SPSS Output

3. Equal Variances

In order to check for equal variances, we will be using a statistical test called the Levene’s Test, which will tell us if the variances for each of the portions of the exam are approximately equal.

From Figure A.4, we see the test produces a test statistic of 2.019 and a p-value of 0.109, so we can conclude that there is significant evidence of equal variances across the four portions of the HSAG Exam.

This assumption is met.

Figure A.4. Levene’s Test SPSS Output

22

Since all three of the assumptions are met, we will proceed with the One-Way ANOVA.

Figure A.5. SPSS Output for the One-Way ANOVA Test for Hypothesis 1

From Figure A.5, we see that our One-Way ANOVA test results in a test statistic of F = 2.219, which produces a p-value = 0.084. Our p-value is large, so we do not have a significant result. This means that there is not significant evidence that there is a difference in students’ mean scores between the 4 portions of the HSAG exam.

23

Appendix B: Hypothesis 2 Goal/Hypothesis: To determine if there is a difference in the students’ total mean percent on the HSAG exam between the years Figure B.1 shows the variability of the student’s total mean score for each year. The boxplots show approximately the same shape, but we will check the assumptions.

Figure B.1. Boxplots of Each Year

Figure B.2. SPSS Output for the Average Total Percent for Each Year

24

We are analyzing a quantitative dependent variable (Total Percent) by a qualitative independent variable with three categories (Year), so we will be using a One-Way ANOVA test for this hypothesis. Assumptions for a One-Way ANOVA:

1. Independent samples This assumption is met because each student’s total score on the HSAG

Exam from one year is independent of the other years.

2. Approximately Normal Populations From Figure B.3, the year 2012 has a skewness and kurtosis value

between (-1,1) so we can accept normality for this year, which means the students’ mean score on the HSAG exam in 2012 is from a normal population.

Figure B.3. Skewness and Kurtosis SPSS Output for 2012

From Figure B.4, the year 2013 has a skewness value between (-1,1), but the kurtosis value is outside the “normal” range. At a -1.005, the kurtosis value is not extremely far off what we would consider “normal”, so we will proceed on assuming normality for this year. This means the students’ mean score on the HSAG exam in 2013 is from an approximately normal population.


25

From Figure B.5, the year 2014 has a skewness value between (-1,1), but the kurtosis value is outside the “normal” range. At a -1.240, the kurtosis value is close to what we would consider “normal”, so we will proceed with caution on accepting normality for this year. This means the students’ mean score on the HSAG exam in 2014 is from an approximately normal population.


Combining all of the information from figure B.3, B.4, and B.5, we will continue with assuming normality since the skewness and kurtosis values for all three years are not extreme and we also have a large sample size. This assumption is met.

3. Equal Variances In order to check for equal variances, we will be using a statistical test

called the Levene’s Test, which will tell us if the variances for each of the portions of the exam are approximately equal.

From Figure B.5, we see the test produces a test statistic of 0.740 and a p-value of 0.478, so we can conclude that there is significant evidence of equal variances across the three years.

This assumption is met.

Figure B.5. Levene’s Test SPSS Output


26

From Figure B.6, we see that our One-Way ANOVA test results in a test statistic of F = 2.904, which produces a p-value of 0.056. Our p-value is above our significance level of 0.05, so we do not have a significant result. This means that there is not significant evidence to show that the students’ mean score differs between the years.

Figure B.6. SPSS Output for the One-Way ANOVA Test for Hypothesis 2

27

Appendix C: Hypothesis 3 Goal/Hypothesis: To determine if there is a difference in the students’ total mean percent on the HSAG exam between the nine schools Figure C.1 shows the variability of the student’s total mean score for each school and there is an enormous difference in variability between the nine schools.

Figure C.1. Boxplots of Each School

We are analyzing a quantitative dependent variable (Total Percent) by a qualitative independent variable with nine categories (School), so we will be using a One-Way ANOVA test for this hypothesis if the assumptions are met.

28

Assumptions for a One-Way ANOVA:

1. Independent samples This assumption is met because each student’s total score on the HSAG

Exam from one school is independent of the other schools.

2. Approximately Normal Populations This assumption is met for the most part because we have a large sample

size for most of the school, but there are a few schools that have a smaller sample size. We will continue checking the assumptions with reservations.

Figure C.3. Descriptive Statistics

3. Equal Variances

In order to check for equal variances, we will be using a statistical test called the Levene’s Test, which will tell us if the variances for each of the portions of the exam are approximately equal.

From Figure B.4, we see the test produces a test statistic of 13.298 and a very small p-value of less than 0.000, so we CANNOT conclude that there are equal variances across the nine schools. This assumption is NOT met.

Figure C.3. Levene’s Test SPSS Output

29

Since all the assumptions are NOT met, we cannot proceed with the One-Way ANOVA.

We will be conducting a Kruskal-Wallis Test instead. From Figure C.4, we see that our Kruskal-Wallis test results in a test statistic of

χ2(8) = 240.944 which gave us a p-value less than 0.000. Therefore, we have significant evidence of difference in the students’ total percent between the nine different schools,

Figure C.4. SPSS Output for the Kruskal-Wallis Test for Hypothesis 3

There is a mean rank score of 26.58 for Dream Academy, 49,76 for Cody DIT, 67 for Henry Ford, 78.38 for Multicultural Academy, 82.81 for Okemos, 122.77 for Black River, 163.36 for Grand Haven, 180.98 for Grosse Point South, and 269.88 for Hudsonville. The results are shown in Figure C.5.

Figure C.5. SPSS Output for Ranks for Each School

30

Appendix D: Hypothesis 4 Goal/Hypothesis:

To determine if there is a difference in the Focus Students’ mean section percent between the 4 portions of the HSAG exam

Figure D.1. Shows the variability of the Focus Students’ mean score for each portion of the HSAG Exam.

Figure D.1. Boxplots of Each Portion of the HSAG Exam for the Focus Students

Figure D.2. SPSS Output for the Focus Students’ Average Percent for Each Portion of the

HSAG Exam

31

We are analyzing a quantitative dependent variable (Focus Students’ Portion Percent) by a qualitative independent variable with four categories (Portion of the HSAG Exam), so we will be using a One-Way ANOVA test for this hypothesis. Assumptions for a One-Way ANOVA:

1. Independent samples This assumption is met because each focus student’s score on one portion

of the HSAG Exam is independent of the other portions.

2. Approximately Normal Populations From Figure D.3, we have p-values for the Shapiro-Wilk Test for

Normality that are all significant (greater than 0.05). Therefore, we can assume normality. Thus, this assumption is met.

Figure D.3. Shapiro Wilk Test for Normality Results

3. Equal Variances In order to check for equal variances, we will be using a statistical test

called the Levene’s Test, which will tell us if the variances for each of the portions of the exam are approximately equal.

From Figure D.4, we see the test produces a test statistic 2.173 and a p-value of 0.091, so we can conclude that there is significant evidence of equal variances across the four portions of the HSAG Exam. This assumption is met.

Figure D.4. Levene’s Test SPSS Output

32


From Figure D.5, we see that our One-Way ANOVA test results in a test statistic of

F = 6.175, which produces a p-value less than 0.000. Our p-value is very small, so we have a significant result. This means that there is significant evidence to that the focus students’ mean score differs between the four portions of the exam.

Figure D.5. SPSS Output for the One-Way ANOVA Test for Hypothesis 4

Figure D.6 shows the portions of the exams are significantly different from each other. The essay portion has a higher mean score than the multiple choice, rocks and minerals, and maps portions. We can conclude that the focus students’ scored significantly higher on the essay portion of the HSAG exam.

Figure D.6. SPSS Output Focus Students

33

Appendix E: Hypothesis 5 Goal/Hypothesis: Is there a difference in the students’ mean section percent between the four portions of the HSAG exam

School 1: Black River

Figure E.1 shows the variability of Black River’s students’ mean score for each portion. Each boxplot has a different shape, and they do not look similar. We suspect that our ANOVA assumptions will not be met.

Figure E.1. Boxplots of Each Portion of the HSAG Exam

We will only check for equal variances since we presume this assumption will not be met. From Figure E.2, we see the test produces a test statistic 3.875 and a p-value of 0.011, so we cannot conclude that there is significant evidence of equal variances across the four portions of the HSAG Exam for the students of Black River.

Figure E.2. Levene’s Test SPSS Output for Black River

34

Since this assumption was NOT met, we cannot proceed with the One-Way ANOVA. We

will be conducting a Kruskal-Wallis Test instead.

From Figure E.3, we see that our Kruskal-Wallis test results in a test statistic of χ2(3) = 1.776 which gave us a p-value of 0.620. Therefore, there is not significant evidence to show that there is a difference in Black River’s students’ mean section score between the four portions of the HSAG exam.

Figure E.3. Kruskal-Wallis Test for Black River

35

School 2: Cody DIT Figure E.4 shows the variability of Cody DIT students’ mean score for each portion.

Figure E.4. Boxplots of Each Portion of the HSAG Exam for Cody DIT

We will only check for equal variances since we have a large sample to assume normality and independent samples. From Figure E.5, we see the Levene’s test produces a test statistic of 2.054 and a p-value of 0.115, so we conclude that there is significant evidence of equal variances across the four portions of the HSAG Exam for the students of Cody DIT.

Figure E.5. Levene’s Test SPSS Output for Cody DIT Since all three of the assumptions are met, we will proceed with the One-Way ANOVA.

36

From Figure E.6, we see that our One-Way ANOVA test results in a test statistic of F = 1.019, which produces a p-value of 0.390. This means that there not significant evidence that there is a difference in Cody DIT’s students’ mean section score between the four portions of the HSAG exam.

Figure E.6. SPSS Output for the One-Way ANOVA Test for Cody DIT

37

School 3: Dream Academy Figure E.7 shows the variability of Dream Academy’s students’ mean score for each portion.

Figure E.7. Boxplots of Each Portion of the HSAG Exam for Dream Academy

We will only check for equal variances since we have a large sample to assume normality and independent samples. From Figure E.8, we see that the Levene’s test produces a test statistic of 1.680 and a p-value of 0.179, so we conclude that there is significant evidence of equal variances across the four portions of the HSAG Exam for the students of Dream Academy.

Figure E.8. Levene’s Test SPSS Output for Dream Academy Since all three of the assumptions are met, we will proceed with the One-Way ANOVA.

38

From Figure E.9 we see that our One-Way ANOVA test results in a test statistic of F = 6.326, which produces a p-value of 0.001. This means that there is significant evidence that there is a difference in Dream Academy’s students’ mean section score between the four portions of the HSAG exam.

Figure E.9. SPSS Output for the One-Way ANOVA Test for Dream Academy

In order to see which portion causes the school the most difficulty, we ran a Tukey HSD post-hoc procedure so we can analyze which portion results in a significant difference from the other portions. Figure E.10 shows that the lowest mean value is essay and rocks and minerals for the Dream Academy students.

Figure E.10. Tukey HSD Test for Dream Academy

39

School 4: Grosse Point South Figure E.11 shows the variability of Grosse Point South’s students’ mean score for each portion.

Figure E.11. Boxplots of Each Portion of the HSAG Exam for Grosse Point South

We will only check for equal variances since we have a large sample to assume normality and independent samples. From Figure E.12, we see that the Levene’s test produces a test statistic of 1.040 and a p-value of 0.376, so we conclude that there is significant evidence of equal variances across the four portions of the HSAG Exam for the students of Grosse Point South.

Figure E.12. Levene’s Test SPSS Output for Grosse Point South Since all three of the assumptions are met, we will proceed with the One-Way ANOVA.

40

From Figure E.13 we see that our One-Way ANOVA test results in a test statistic of F = 3.102, which produces a p-value of 0.028. This means that there is significant evidence that there is a difference in Grosse Point South’s students’ mean section score between the four portions of the HSAG exam.

Figure E.13. SPSS Output for the One-Way ANOVA Test for Grosse Point South

In order to see which portion causes the school the most difficulty, we ran a Tukey HSD post-hoc procedure so we can analyze which portion results in a significant difference from the other portions. Figure E.14 shows that the lowest mean value are rocks and minerals, maps, and multiple choice for the Grosse Point South’s students.

Figure E.14. Tukey HSD Test for Grosse Point South

41

School 5: Grand Haven

Figure E.15 shows the variability of Grand Haven’s students’ mean score for each portion. Each boxplot has a different shape, and they do not look similar. We suspect that our ANOVA assumptions will not be met.

Figure E.15. Boxplots of Each Portion of the HSAG Exam for Grand Haven

We will only check for equal variances since we presume this assumption will not be met. From Figure E.16, we see the Levene’s test produces a test statistic 3.048 and a p-value of 0.030, so we cannot conclude that there is significant evidence of equal variances across the four portions of the HSAG Exam for the students of Grand Haven.

Figure E.16. Levene’s Test SPSS Output for Grand Haven

Since this assumption was NOT met, we cannot proceed with the One-Way ANOVA. We will be conducting a Kruskal-Wallis Test instead.

42

From Figure E.17, we see that our Kruskal-Wallis test results in a test statistic of χ2(3) = 10.188 which gave us a p-value of 0.017. Therefore, there is significant evidence to show that there is a difference in Grand Haven’s students’ mean section score between the four portions of the HSAG exam.

Figure E.17. Kruskal-Wallis Test for Grand Haven

In order to see which portion causes the school the most difficulty, we ran a post-hoc procedure so we can analyze which portion results in a significant difference from the other portions. Figure E.18 shows that the lowest mean value is multiple choice for the Grand Haven’s students.

Figure E.18. Post Hoc for Grand Haven

43

School 6: Henry Ford Figure E.19 shows the variability of Henry Ford’s students’ mean score for each portion.

Figure E.19. Boxplots of Each Portion of the HSAG Exam for Henry Ford

We will only check for equal variances since we assume high variability in the boxplots. From Figure E.20, we see that the Levene’s test produces a test statistic of 2.971 and a p-value of 0.063, so we can conclude that there is significant evidence of equal variances across the four portions of the HSAG Exam for the students of Henry Ford.

Figure E.20. Levene’s Test SPSS Output for Henry Ford Since all three of the assumptions are met, we will proceed with the One-Way ANOVA.

44

From Figure E.21 we see that our One-Way ANOVA test results in a test statistic of F = 0.148, which produces a p-value of 0.929. This means that there is not significant evidence that there is a difference in Henry Ford’s students’ mean section score between the four portions of the HSAG exam.

Figure E.21. SPSS Output for the One-Way ANOVA Test for Henry Ford

45

School 7: Hudsonville

Figure E.22 shows the variability of Hudsonville’s students’ mean score for each portion. Each boxplot has a different shape, and they do not look similar. We suspect that our ANOVA assumptions will not be met.

Figure E.22. Boxplots of Each Portion of the HSAG Exam for Hudsonville

We will only check for equal variances since we presume this assumption will not be met. From Figure E.23, we see the Levene’s test produces a test statistic 3.220 and a p-value of 0.022, so we cannot conclude that there is significant evidence of equal variances across the four portions of the HSAG Exam for the students of Hudsonville.

Figure E.23. Levene’s Test SPSS Output for Hudsonville

Since this assumption was NOT met, we cannot proceed with the One-Way ANOVA. We will be conducting a Kruskal-Wallis Test instead.

46

From Figure E.24, we see that our Kruskal-Wallis test results in a test statistic of χ2(3) = 62.992 which gave us a p-value less than 0.000. Therefore, there is significant evidence to show that there is a difference in Hudsonville’s students’ mean section score between the four portions of the HSAG exam.

Figure E.24. Kruskal-Wallis Test for Hudsonville

In order to see which portion causes the school the most difficulty, we ran a post-hoc procedure so we can analyze which portion results in a significant difference from the other portions. Figure E.25 shows that the lowest mean value is maps for the Hudsonville’s students, and the average rank for each portion.

Figure E.25. Post Hoc for Hudsonville and Ranks

47

School 8: Multicultural Academy Figure E.26 shows the variability of Multicultural Academy’s students’ mean score for each portion.

Figure E.26. Boxplots of Each Portion of the HSAG Exam for Grosse Point South

We will only check for equal variances since we have a large sample to assume normality and independent samples. From Figure E.27, we see that the Levene’s test produces a test statistic of 0.849 and a p-value of 0.493, so we conclude that there is significant evidence of equal variances across the four portions of the HSAG Exam for the students of Multicultural Academy.

Figure E.27. Levene’s Test SPSS Output for Grosse Point South Since all three of the assumptions are met, we will proceed with the One-Way ANOVA.

48

From Figure E.28 we see that our One-Way ANOVA test results in a test statistic of F = 8.177, which produces a p-value of 0.003. This means that there is significant evidence that there is a difference in Multicultural Academy’s students’ mean section score between the four portions of the HSAG exam.

Figure E.28. SPSS Output for the One-Way ANOVA Test for Multicultural Academy

In order to see which portion causes the school the most difficulty, we ran a Tukey HSD post-hoc procedure so we can analyze which portion results in a significant difference from the other portions. Figure E.29 shows that the lowest mean value are essay and maps for the Multicultural Academy students.

Figure E.29. Tukey HSD Test for Multicultural Academy

49

School 9: Okemos

Figure E.30 shows the variability of Black River’s students’ mean score for each portion. Since there are a few extreme values, we suspect that our ANOVA assumptions will not be met.

Figure E.30. Boxplots of Each Portion of the HSAG Exam for Okemos

We will only check for equal variances since we presume this assumption will not be met. From Figure E.31, we see the test produces a test statistic 3.611 and a p-value of 0.014, so we cannot conclude that there is significant evidence of equal variances across the four portions of the HSAG Exam for the students of Okemos.

Figure E.31. Levene’s Test SPSS Output for Okemos Since this assumption was NOT met, we cannot proceed with the One-Way ANOVA. We will be conducting a Kruskal-Wallis Test instead.

50

From Figure E.32, we see that our Kruskal-Wallis test results in a test statistic of χ2(3) = 3.290 which gave us a p-value of 0.349. Therefore, there is not significant evidence to show that there is a difference in Okemos’s students’ mean section score between the four portions of the HSAG exam.

Figure E.32. Kruskal-Wallis Test for Okemos

HSAG Report - Alison Chapman and Allison Makowski_PDF

Documents

Transcript of HSAG Report - Alison Chapman and Allison Makowski_PDF