admin
Tue, 02/10/2026 - 20:26
Edited Text
Retention and Graduation Rate
Analysis
Prepared for Clarion University of Pennsylvania
April 2013
In the following report, Hanover Research analyzes the retention and graduation rates of
students entering Clarion University of Pennsylvania between 2006 and 2011. More
specifically, we investigate various academic, institutional, and demographic factors that
significantly influence retention and graduation rates.
© 2013 Hanover Research | District Administration Practice
1
Hanover Research | April 2013
TABLE OF CONTENTS
Executive Summary and Key Findings ............................................................................... 3
Introduction ...........................................................................................................................3
Discussion ..............................................................................................................................3
Structure of the Report..........................................................................................................4
Key Findings ...........................................................................................................................4
Factors Promoting Retention.............................................................................................4
Factors Promoting Retention for One Additional Year......................................................5
Factors Correlated with Graduation in Four Years ............................................................5
Factors Correlated with Graduation in Five Years .............................................................6
Section I: Methodology .................................................................................................... 7
Regression Analysis................................................................................................................7
Dependent (Outcome) Variable ........................................................................................8
Independent (Explanatory) Variables ..............................................................................12
Section II: Regression Analysis ........................................................................................ 19
Regression Results: Set 1 Models ........................................................................................19
Key Findings Across Set 1 Models....................................................................................20
Model 1 (2nd Year Retention) ...........................................................................................20
Model 2 and 3 (3rd and 4th Year Retention) .....................................................................20
Model 4 (5th Year Retention) ...........................................................................................21
Model 5 (6th Year Retention) ...........................................................................................21
Regression Results: Set 2 Models ........................................................................................23
Key Findings Across Set 2 Models....................................................................................23
Model 6 (3rd Year Conditional Retention)........................................................................23
Model 7 (4th Year Conditional Retention)........................................................................24
Model 8 (5th Year Conditional Retention)........................................................................24
Model 9 (6th Year Conditional Retention)........................................................................24
Regression Results: Set 3 Models ........................................................................................26
Model 10 (Graduation within Four Years) .......................................................................26
Model 11 (Graduation within Five Years) ........................................................................27
Appendix A: Retention Rates .......................................................................................... 29
Appendix B: Graduation Rates ........................................................................................ 32
© 2013 Hanover Research | District Administration Practice
2
Hanover Research | April 2013
EXECUTIVE S UMMARY AND KEY FINDINGS
INTRODUCTION
In the following report, Hanover Research investigates the factors contributing to bachelor’s
degree-seeking students’ retention and graduation rates at Clarion University of
Pennsylvania (Clarion). In particular, we use the data supplied by Clarion to assess which
demographic, institutional, and academic variables proved the most useful predictors of
four-year and five-year graduation and second, third, fourth, fifth, and sixth year retention
for students entering in fall 2006 through 2011.
In our analysis of Clarion’s data, we developed three sets of models. The first set analyzes
factors associated with higher retention, with a focus on factors Clarion could know before a
student begins their studies at the university. Next, the second set of models analyzes
factors that can help a student be retained for an additional year, given that they were
retained through the previous year. Lastly, the third set of models analyzes factors
associated with increased graduation rates within four years and five years of enrollment at
the university.
DISCUSSION
In terms of the most notable outcomes of our analysis, we find that a student’s high school
ranking is a strong predictor of retention and graduation. A higher high school rank leads
to higher retention, and ultimately a higher likelihood of graduation within four or five years
of enrollment. This finding suggests that as Clarion seeks to boost retention and graduation
rates, the university should strive to increase its admission rates of higher ranked students.
As Clarion explained to Hanover, the university has found high school GPA to be a helpful
predictor of retention and graduation in the past. In the datasets provided to Hanover, an
extremely large number of students had missing values for the high school GPA variable
(listed as “0” in the files) and therefore high school GPA was excluded from our analysis.
Nevertheless, the finding that high school ranking (a reasonable proxy for high school GPA)
is a strong predictor of retention aligns well with Clarion’s experience.
Unfortunately, we also find that minorities, particularly black students, have a lower
likelihood of retention and graduation within four or five years at Clarion. This finding
holds true even after controlling for high school ranking, age, gender, and other observable
variables. Based on this result, Hanover recommends a review of why black students are not
being retained at higher rates at the university, with the goal of uncovering strategies to
better support these students.
Other notable takeaways from our analysis include the finding that being a student athlete
increases a student’s likelihood of retention in their third and fourth year, as well as
© 2013 Hanover Research | District Administration Practice
3
Hanover Research | April 2013
improves their chances of graduating within five years at Clarion. Perhaps unsurprisingly,
having a higher first year GPA also boosts a student’s chances of retention and graduation.
With regard to coursework, we find that if a student attempts a higher number of credits
during the spring semester, they are more likely to be retained the following year.
Additionally, if a student fails particular courses – ENG 111, MATH 050, MATH 110, and
MATH 112 – they are less likely to be retained by the university in the next year. All of these
findings offer insight into concrete predictors of retention and graduation at the university
and should assist in identifying groups of students who may need additional support while
progressing toward graduation.
STRUCTURE OF THE REPORT
The report is organized as follows: In Section I, we provide a description of the methodology
and data used to evaluate the factors affecting graduation and retention. In Section II, we
discuss and present our findings from the regression models that quantify the effects of
individual academic, institutional, and demographic factors on students’ likelihood of
retention and graduation. Further, in Appendix A and B, we present cross-tabulations of
retention and graduation rates broken down by students’ gender, ethnicity, and high school
class rank.
Before proceeding to the body of the report, below we offer a more detailed breakdown of
the key findings of our analysis.
KEY FINDINGS
FACTORS PROMOTING RETENTION
The following factors were found to be key predictors of retention:
1) Ranking in the top quarter of a high school class – For example, a student ranked in
the top 10 percent of his or her class is 19 percent more likely to be retained in their
second year at Clarion compared to a student whose ranking is within the 50-75
percent range of their high school class.1
2) Achieving higher SAT scores – While higher SAT scores were linked to an increased
likelihood of being retained, the magnitude of this effect was fairly small. For
instance, for every 100-point increase in total SAT score (Math, Reading, and Writing
combined), a student’s probability of being retained in his or her second year is
expected to increase by less than 1 percent.
Additionally, minorities, particularly black students, are found to have lower retention
rates. Holding other factors equal, our models indicate that a black student would be 12 to
19 percent less likely to be retained than a white student.
1
Note that the high school class rank measure calculates a given student’s rank as a percentage of the number of
nd
students in his or her high school class. For example, a student who is ranked 32 out of a class of 100 would
have a value of 32 percent on this measure.
© 2013 Hanover Research | District Administration Practice
4
Hanover Research | April 2013
FACTORS PROMOTING RETENTION FOR ONE ADDITIONAL YEAR
Described as “conditional retention” in our report, the following factors are associated with
an increased likelihood of being retained for an additional year if a student has completed
their previous year of study at Clarion.
1) Achieving a higher first year GPA at Clarion – A one-point increase in first year GPA
(e.g., 2.0 to 3.0) is associated with a roughly 3-4 percent increase in the likelihood of
being retained for an additional year.
2) Being a student athlete in the previous year – The chances of an athlete being
retained in the third or fourth year at Clarion is approximately 3-6 percent higher
than that of a non-athlete.
3) Attempting a higher number of credits in the previous semester – For example, a
one-credit increase in the number of credits attempted in the previous spring
semester is associated with a 4 percent increase in the likelihood of being retained
in the third year. Further, a three-credit increase would be associated with a 12
percent boost in the likelihood of being retained.
Additionally, failing the following subjects is associated with a decreased likelihood of
retention for an additional year (if the student has completed the previous year). The
estimated magnitude of this decline in likelihood of retention is 8-18 percent depending on
the course/year of retention.
1) Failing MATH 110 adversely affects third year retention.
2) Failing ENG 111 or MATH 050 adversely affects third and fourth year retention
3) Failing MATH 112 adversely affects third and fifth year retention.
FACTORS CORRELATED WITH GRADUATION IN FOUR YEARS
The following factors are associated with a higher likelihood of graduating in four years.
1) Having a higher high school ranking – For example, a student ranked in the top 10
percent of his or her high school class is roughly 12 percent more likely to graduate
in four years than a student whose ranking falls between 25-50 percent of the class.
2) Having a higher first year GPA – For every one-point increase in first year GPA, a
student’s probability of graduating within four years increases by 15 percent.
Further, being black was found to decrease a student’s likelihood of graduation in four
years by 11 percent, as compared to being white.
© 2013 Hanover Research | District Administration Practice
5
Hanover Research | April 2013
FACTORS CORRELATED WITH GRADUATION IN FIVE YEARS
The following factors are associated with a higher likelihood of graduating in five years.
1) Being female – Increases a student’s likelihood of graduating in five years by 10
percent, as compared to being a male.
2) Ranking in the top half of a high school class – For example, a student whose rank
falls between 50 and 75 percent of the high school class is 17 percent less likely to
graduate within five years than a student in the top 10 percent.
3) Being an athlete – Student athletes are nearly 9 percent more likely to graduate
within five years, as compared to non-athletes.
Finally, similar to our previous findings on retention and four-year graduation, being black
decreases the probability of a student graduating within five years by 13 percent, as
compared to being white.
© 2013 Hanover Research | District Administration Practice
6
Hanover Research | April 2013
SECTION I: METHODOLOGY
In this section, we offer an overview of the data and methodology used in our analysis.
Clarion provided Hanover with seven different datasets (a separate dataset for each year),
containing demographic, academic, and institutional information of students between the
years 2006 and 2012. More specifically, the data included the following:
Demographic data – Students’ gender, age, and race
Institutional data – Students’ starting year, graduation year, and athlete status
Academic data
o Pre-university (high school) – Students’ high school class ranking and SAT
scores
o University – Students’ first GPA, performance in specific courses (ENG 110,
ENG 111, MATH 050, MATH 110, MATH 112, and MGMT 120), number of
credits earned in each fall semester, and number of credits attempted in
each spring semester.
Our analysis is based on a master dataset that was created by combining the seven datasets
provided by Clarion. From each of the datasets, we selected students who enrolled at
Clarion in that year (either freshmen or transfer students). This provided us with a list of
9,131 unique students, who enrolled at Clarion between 2006 and 2011. For each of these
students we checked whether they appeared in the subsequent year’s dataset. Students
who appeared in the subsequent year’s dataset were marked as retained while students
who did not graduate and did not appear in the subsequent year’s dataset were marked as
not retained. Note that 299 students were dropped from our combined dataset as these
students continued to appear in the Clarion datasets even though they had been identified
as graduates of the previous year. Our final dataset available for analysis therefore contains
8,832 unique students.
REGRESSION ANALYSIS
In order to examine the impact of the various demographic, academic, and institutional
factors on retention and graduation rates, we constructed a series of linear probability
models (LPMs). Overall, we estimated 11 models grouped as Sets 1, 2, and 3, based on the
dependent (i.e., outcome) variable used.2 LPMs represent a variation of the standard
ordinary least squares (OLS) regression model used to analyze dichotomous dependent
variables. Dichotomous variables assume one of two values; in the context of the present
analysis, we assign a value of 1 to our dependent variable in cases where students meet a
specific criterion (e.g., having been retained or having graduated) and a value of 0 otherwise
(having not been retained or having not graduated).
2
As described in greater detail below, the dependent variables examined in this analysis include retention
(corresponding to Set 1), “conditional” retention (Set 2), and graduation (Set 3).
© 2013 Hanover Research | District Administration Practice
7
Hanover Research | April 2013
For each of our regression models, we provide coefficient estimates, as well as an indication
of which coefficients proved statistically significant. The coefficients of our models reveal
how much we expect the dependent variable to change when the independent variable
increases by one unit, holding all of the other independent variables in the model constant.
In the context of LPMs, the coefficients capture the change in the likelihood of retention or
graduation, depending on the model estimated. More specifically, multiplying the
coefficients by 100 percent indicates the extent to which the likelihood changes due to a
one-unit increase in the associated independent variable. For example, as seen in Figure 2.2,
a one-unit increase in first GPA leads to a 4.21 percent rise in the probability of first year
retention in Model 6.
Lastly, when presenting our results, we also include the R-squared value, which reveals the
percentage of variation in the dependent variable accounted for by the model. Using Figure
2.2 and Model 6 as an example once again, the R-squared value of 0.3284 indicates that the
model can explain 32.84 percent of the variation in retention.
DEPENDENT (OUTCOME) VARIABLE
As discussed previously, we used three types of dependent, or outcome, variables in our
regression analysis. We describe the dependent variables as follows:
Set 1 (Models 1, 2, 3, 4, and 5): The dependent variable in this set of models
indicates whether or not a student (who did not yet graduate) was retained in a
particular year of study. For instance, the second year retention of a student who
initially enrolled at Clarion in 2006 indicates whether or not the student was
retained in 2007. Similarly, the third year retention of that same student indicates
whether he or she was retained in 2008. Please note that a particular year’s
retention, as defined by this outcome variable, is independent of the students’
retention in the immediate previous year.3 Figure 1.1 below helps understand what
a particular year’s retention means for students who enrolled at Clarion between
2006 and 2011. We estimate separate models for different years of retention.
STARTING
3
Figure 1.1: Retention Years
2 YEAR
3RD YEAR
4TH YEAR
5TH YEAR
ND
6TH YEAR
YEAR
RETENTION
RETENTION
RETENTION
RETENTION
RETENTION
2006
2007
2008
2009
2007
2008
2009
2010
2008
2009
2010
2011
2009
2010
2011
2012
2010
2011
2012
--
2011
2012
---
In other words, if a student initially enrolled in 2006 but did not re-enroll in 2007, they would be marked as retained
under “second year retention.” However, if this same student re-enrolled in 2008, they would then be marked as
retained under “third year retention.” As discussed in greater detail below, the Set 2 models focus on “conditional
retention,” a measure that takes into account whether a student had been enrolled in the previous year at Clarion
in addition to whether they were enrolled in the current year.
© 2013 Hanover Research | District Administration Practice
8
Hanover Research | April 2013
STARTING
2ND YEAR
3RD YEAR
4TH YEAR
5TH YEAR
6TH YEAR
YEAR
RETENTION
RETENTION
RETENTION
RETENTION
RETENTION
2010
2011
2011
2012
2012
--
---
---
---
In Figure 1.2, we provide descriptive statistics of the dependent variables used for each
model in Set 1. For each of the student cohorts (e.g., students beginning in fall 2006,
students beginning in fall 2007, etc.), we provide the percentage of students who were
retained and not retained. The table indicates that for all cohorts, the retention rate
declines over the years. Please note that we have also provided a breakdown of retention
rates by various subgroups in Appendix A.
Figure 1.2: Distribution of Dependent Variables in Set 1 – Descriptive Statistics
STARTING
TERM
Fall 2006
Fall 2007
Fall 2008
Fall 2009
Fall 2010
Fall 2011
STATUS
Not Retained
Retained
Count
Not Retained
Retained
Count
Not Retained
Retained
Count
Not Retained
Retained
Count
Not Retained
Retained
Count
Not Retained
Retained
Count
Total
2ND YEAR
3RD YEAR
4TH YEAR
5TH YEAR
6TH YEAR
RETENTION
RETENTION
RETENTION
RETENTION
RETENTION
32%
68%
1,436
29%
71%
1,437
31%
69%
1,416
31%
69%
1,582
30%
70%
1,527
31%
69%
1,418
8,816
44%
56%
1,414
39%
61%
1,421
43%
57%
1,388
41%
59%
1,561
42%
58%
1,495
---7,279
50%
50%
1,346
44%
56%
1,365
49%
51%
1,326
48%
52%
1,499
------5,536
69%
31%
1,013
67%
33%
979
71%
29%
989
---------2,981
92%
8%
792
92%
8%
744
------------1,536
Set 2 (Models 6, 7, 8, and 9): The dependent variable for the models in Set 2
assumes a value of 1 whenever a student who was retained in a particular year is
enrolled in the following year and a value of 0 whenever a student who was retained
in a particular year is not enrolled in the following year. This is described as
“conditional” retention, as being retained in a given year is conditioned on whether
a student was retained in the previous year. For instance, in the case of a student
who started in 2006, fourth year conditional retention refers to that student being
© 2013 Hanover Research | District Administration Practice
9
Hanover Research | April 2013
retained in 2009 given that he/she was also retained in 2008. The table below
provides brief clarification of what a particular year’s conditional retention indicates.
Figure 1.3: Conditional Retention
STARTING
YEAR
2006
2007
2008
2009
2010
RD
3 YEAR CONDITIONAL
4TH YEAR CONDITIONAL
5TH YEAR CONDITIONAL
6TH YEAR CONDITIONAL
RETENTION
RETENTION
RETENTION
RETENTION
Students retained in
2007, did not graduate,
and were retained
again in 2008
Students retained in
2008, did not graduate,
and were retained
again in 2009
Students retained in
2009, did not graduate,
and were retained
again in 2010
Students retained in
2010, did not graduate,
and were retained
again in 2011
Students retained in
2011, did not graduate,
and were retained
again in 2012
Students retained in
2008, did not graduate,
and were retained
again in 2009
Students retained in
2009, did not graduate,
and were retained
again in 2010
Students retained in
2010, did not graduate,
and were retained
again in 2011
Students retained in
2011, did not graduate,
and were retained
again in 2012
Students retained in
2009, did not graduate,
and were retained
again in 2010
Students retained in
2010, did not graduate,
and were retained
again in 2011
Students retained in
2011, did not graduate,
and were retained
again in 2012
Students retained in
2010, did not graduate,
and were retained
again in 2011
Students retained in
2011, did not graduate,
and were retained
again in 2012
--
--
--
--
--
--
As the above table illustrates, conditional retention only looks at the group of students who
had been retained in the previous year. For example, if a student is marked as “not
retained” in the third year, they would not be included in the calculation of fourth year
retention (only students who had been retained in the third year would be included). This
differs from the dependent variable used in the Set 1 models, where the measure of
retention is not conditioned on whether the student was enrolled in the previous year.
Figure 1.4 on the following page provides descriptive statistics of the dependent variables
used in the Set 2 models. In general, between the third year and fifth year, the conditional
retention rate increases slightly. Note that we have also provided a breakdown of the
conditional retention rates by various sub-groups in Appendix A.
© 2013 Hanover Research | District Administration Practice
10
Hanover Research | April 2013
Figure 1.4: Distribution of Dependent Variables in Set 2 - Descriptive Statistics
STARTING
YEAR
Fall 2006
Fall 2007
Fall 2008
Fall 2009
Fall 2010
STATUS
Not Retained
Retained
Count
Not Retained
Retained
Count
Not Retained
Retained
Count
Not Retained
Retained
Count
Not Retained
Retained
Count
Total
3RD YEAR
4TH YEAR
5TH YEAR
6TH YEAR
CONDITIONAL
CONDITIONAL
CONDITIONAL
CONDITIONAL
RETENTION
RETENTION
RETENTION
RETENTION
21%
79%
956
17%
83%
1,002
19%
81%
952
18%
82%
1,074
19%
81%
1,039
5,023
11%
89%
726
9%
91%
808
10%
90%
729
10%
90%
853
---3,116
15%
85%
347
18%
82%
381
17%
83%
336
------1,064
42%
58%
96
45%
55%
95
---------191
Set 3 (Models 10 and 11): The dependent variable in this set of models indicates
whether or not a student graduated either within four years of enrollment (Model
10) or five years of enrollment (Model 11). Figure 1.5 on the following page provides
descriptive statistics of this dependent variable. As expected, the graduation rate
within five years is higher than the graduation rate within four years, as students
have had more time to complete their studies. We have also provided a breakdown
of the graduation rates by subgroups in Appendix B.
Please note that we do not have complete four-year data for students who enrolled
in 2009, 2010, and 2011. Furthermore, we also do not have five-year data for
students who enrolled in 2008. For most of these students, we do not know
whether they graduated within these years. For students in these years, we take
into account the students who are listed as graduates only. Other students are
excluded from the analysis as they are missing full four- and five-year data.
© 2013 Hanover Research | District Administration Practice
11
Hanover Research | April 2013
Figure 1.5: Distribution of Dependent Variables in Set 3 – Descriptive Statistics
STARTING
YEAR
Fall 2006
Fall 2007
Fall 2008
Fall 2009
Fall 2010
Fall 2011
STATUS
GRADUATION
WITHIN 4 YEARS
GRADUATION
WITHIN 5 YEARS
Did not graduate
Graduated
Count
Did not graduate
Graduated
Count
Did not graduate
Graduated
Count
Graduated
Count
Graduated
Count
Graduated
Count
Total
70%
30%
1,438
68%
32%
1,437
70%
30%
1,418
100%
131
100%
57
100%
11
4,492
55%
45%
1,438
52%
48%
1,437
-100%
542
100%
131
100%
57
100%
11
3,616
INDEPENDENT (EXPLANATORY) VARIABLES
We included a series of independent, or explanatory, variables in the models in order to
control for factors affecting the dependent variables of interest (retention, conditional
retention, and graduation). We provide a complete list of the independent variables used in
Figure 1.6. The table defines each independent variable, indicates the model(s) in which the
variable appears, and the variable type. Note that in general, if a variable was not found to
have a statistically significant effect on the outcome variable of interest (retention or
graduation), it was excluded from the model. For example, the “Female” variable did not
have a significant relationship with retention/graduation in Models 1-10 but did have a
significant effect on likelihood of graduation within five years. Therefore, the “Female”
variable was only included in our final version of Model 11.
Figure 1.6: Summary of Independent Variables
VARIABLES
Female
SUMMARY
Gender of students: (0) Male (1) Female
Age
Age of students
Age squared
Ethnicity
The squared value of the age of the students. This
allows us to examine whether age has a non-linear
effect on the dependent variable.
Ethnicity of students, selected from three categories:
Black, White, and “other or two or more races.”
© 2013 Hanover Research | District Administration Practice
MODEL
Model 11
Models 3, 5, 7,
and 10
VARIABLE TYPE
Categorical
Models 3 and 10
Continuous
Models 1, 2, 3, 4,
6, 8, 10, and 11
Categorical
Continuous
12
Hanover Research | April 2013
VARIABLES
High School Rank
SAT score total
SAT Math,
Reading, Writing
First GPA
ENG 111*
MATH 050*
MATH 110*
MATH 112*
SUMMARY
Student’s high school rank as a percentage of class
size. In other words, if a student was ranked 29th out
of a high school class of 100, their value for this
variable would be 29 percent. A student with a lower
ranking in the same high school class, say 45th out of
100, would have a value of 45 percent. As such,
higher values for this variable represent lower class
ranks (e.g., 45th) while lower values represent higher
class rankings (e.g., 29th).
Student’s total SAT scores (includes math, reading
and writing). We divided SAT score by 100 so that
our models show the effect of a 100 point increase in
SAT rather than a single point increase.
In cases where total SAT score was not found to a
statistically significant effect on retention or
graduation, we entered individual Math, Reading,
and/or Writing scores into the models. These scores
remained in the final models if they were found to
have a statistically significant effect on the outcome
of interest.
Student's first GPA at Clarion.
Whether student passed, failed, or did not attempt
this course.
Whether student passed, failed, or did not attempt
this course.
Whether student passed, failed, or did not attempt
this course.
Whether student passed, failed, or did not attempt
this course.
Athlete
Whether students is an athlete: (0) No (1) Yes
Credits attempted
in previous Spring
Credits earned in
previous Fall
Number of credits attempted in the previous Spring
semester
Number of credits earned in the previous Fall
semester
MODEL
VARIABLE TYPE
Models 1, 2, 3, 4,
9, 10, and 11
Categorical/Continuous
Models 1, 2, 3, 4,
6, 7, 8 , 9, and 10
Continuous
Models 5, 10,
and 11
Continuous
Models 6, 7, 8,
10, and 11
Continuous
Models 6 and 7
Categorical
Model 6 and 7
Categorical
Models 6 and 7
Categorical
Model 6, 7, and
8
Model 6, 7, and
11
Models 6, 7, 8,
and 9
Model 6
Categorical
Categorical
Continuous
Continuous
*Note that ENG 110 and MGMT 120 were excluded from our models as an extremely high percentage of students in our dataset
(90 percent or more for most cohorts) did not attempt these courses. In general, when selecting variables for inclusion in our
models, Hanover sought to use variables for which a substantial number of students had non-zero values.
Figures 1.7-1.12 present descriptive statistics for each independent variable used in the
models. While Figure 1.7 shows the distribution of students’ gender, ethnicity, and high
school ranking, Figure 1.8 and 1.9 shows descriptive statistics for some measures of
academic performance.
In our models, we used two forms of the independent variable that indicates whether a
student is an athlete. For the models in Set 2, the athlete variable refers to a student being
© 2013 Hanover Research | District Administration Practice
13
Hanover Research | April 2013
an athlete in the year immediately prior to the year in which retention is measured. For
instance, Model 7, which analyzes fourth year conditional retention, indicates the effect of a
student being an athlete in his/her third year only. By contrast, the athlete variable used in
the Set 3 models (Model 11), indicates whether the student was an athlete at any point
within the four or five years taken into account. Descriptive statistics of these alternate
forms of the athlete variable are provided in Figures 1.10 and 1.11. Finally, Figure 1.12
provides the average number of credits earned (fall) and attempted (spring) for each year of
our dataset.
© 2013 Hanover Research | District Administration Practice
14
Hanover Research | April 2013
Figure 1.7: Gender, Ethnicity, and High School Ranking – Descriptive Statistics
GENDER
FALL 2006
FALL 2007
FALL 2008
FALL 2009
FALL 2010
FALL 2011
Female
57% (n=803)
55% (n=789)
56% (n=793)
58% (n=924)
62% (n=941)
60% (n=850)
Male
43% (n=604)
45% (n=648)
44% (n=625)
42% (n=661)
38% (n=589)
40% (n=574)
Black
Other or two or more races4
6% (n=85)
3% (n=37)
7% (n=98)
3% (n=40)
9% (n=125)
4% (n=63)
9% (n=135)
4% (n=55)
9% (n=130)
5% (n=70)
9% (n=122)
6% (n=84)
White
91% (n=1281)
90% (n=1287)
87% (n=1225)
88% (n=1354)
86% (n=1276)
85% (n=1198)
0% to 10%
8% (n=108)
9% (n=111)
9% (n=114)
8% (n=104)
9% (n=117)
8% (n=98)
10% to 25%
19% (n=243)
15% (n=189)
19% (n=229)
19% (n=254)
20% (n=260)
18% (n=214)
25% to 50%
35% (n=445)
35% (n=443)
36% (n=443)
35% (n=476)
34% (n=446)
35% (n=413)
50% to 75%
27% (n=349)
29% (n=371)
27% (n=333)
26% (n=349)
26% (n=335)
25% (n=292)
75% to 100%
10% (n=132)
12% (n=156)
9% (n=109)
12% (n=162)
11% (n=145)
13% (n=157)
ETHNICITY
HIGH SCHOOL RANKING
Figure 1.8: Age, High School Rank, SAT Score, and First GPA- Descriptive Statistics
VALUES
Average of Age
Average of Rank as Percentage of Class Size
Average of SAT Total*
Average of SAT Math*
Average of SAT Reading*
Average of SAT Writing*
Average of First GPA
FALL 2006
18.93
43%
1381
462
466
455
2.44
FALL 2007
19.13
45%
1419
483
477
462
2.49
FALL 2008
19.20
42%
1412
479
474
460
2.51
FALL 2009
19.60
44%
1407
477
471
458
2.64
FALL 2010
19.70
43%
1419
478
477
464
2.66
FALL 2011
19.76
44%
1402
478
475
459
2.63
TOTAL
19.39
43%
1408
477
474
460
2.56
*Note that all SAT scores listed as “0” were recoded as missing and are not reflected in these averages.
4
Includes students who are listed as Asian, Hispanic, Native American, Pacific Islander, and from two or more races.
© 2013 Hanover Research | District Administration Practice
15
Hanover Research | April 2013
Figure 1.9: University Course Performance – Descriptive Statistics
COURSE
ENG 110
ENG 111
MATH 050
MATH 110
MATH 112
MGMT 120
VALUES
Did not attempt
Did not pass
Passed
Did not attempt
Did not pass
Passed
Did not attempt
Did not pass
Passed
Did not attempt
Did not pass
Passed
Did not attempt
Did not pass
Passed
Did not attempt
Did not pass
Passed
FALL 2006
99% (n=1429)
0% (n=)
1% (n=9)
27% (n=393)
10% (n=137)
63% (n=908)
79% (n=1135)
5% (n=75)
16% (n=228)
83% (n=1200)
4% (n=64)
12% (n=174)
76% (n=1099)
4% (n=63)
19% (n=276)
92% (n=1327)
1% (n=15)
7% (n=96)
© 2013 Hanover Research | District Administration Practice
FALL 2007
99% (n=1425)
0% (n=3)
1% (n=9)
21% (n=304)
11% (n=161)
68% (n=972)
81% (n=1165)
4% (n=62)
15% (n=210)
81% (n=1168)
4% (n=64)
14% (n=205)
77% (n=1112)
4% (n=57)
19% (n=268)
89% (n=1277)
2% (n=27)
9% (n=133)
FALL 2008
91% (n=1288)
2% (n=22)
8% (n=108)
25% (n=351)
10% (n=148)
65% (n=919)
80% (n=1141)
5% (n=64)
15% (n=213)
83% (n=1176)
5% (n=71)
12% (n=171)
76% (n=1075)
5% (n=72)
19% (n=271)
88% (n=1251)
1% (n=18)
11% (n=149)
FALL 2009
95% (n=1499)
1% (n=15)
4% (n=71)
26% (n=407)
8% (n=129)
66% (n=1049)
80% (n=1263)
4% (n=60)
17% (n=262)
87% (n=1375)
4% (n=62)
9% (n=148)
73% (n=1159)
4% (n=71)
22% (n=355)
92% (n=1458)
1% (n=13)
7% (n=114)
FALL 2010
85% (n=1304)
2% (n=37)
12% (n=189)
31% (n=481)
8% (n=120)
61% (n=929)
78% (n=1193)
5% (n=78)
17% (n=259)
89% (n=1361)
4% (n=54)
8% (n=115)
72% (n=1105)
5% (n=75)
23% (n=350)
90% (n=1375)
2% (n=31)
8% (n=124)
16
FALL 2011
84% (n=1192)
1% (n=20)
15% (n=212)
32% (n=459)
7% (n=105)
60% (n=860)
84% (n=1191)
3% (n=47)
13% (n=186)
89% (n=1263)
4% (n=51)
8% (n=110)
76% (n=1081)
4% (n=63)
20% (n=280)
90% (n=1281)
2% (n=26)
8% (n=117)
TOTAL
3,495
675
4,662
8,137
97
598
2,395
800
5,637
7,543
366
923
6,631
401
1,800
7,969
130
733
Hanover Research | April 2013
Figure 1.10: Athlete in the Year Prior to Retention – Descriptive Statistics
2006
2007
2008
2009
2010
ATHLETE STATUS
Athlete prior to 2nd year
retention
Athlete prior to 3rd year
retention
Athlete prior to 4th year
retention
Athlete prior to 5th year
retention
NO
YES
TOTAL
NO
YES
TOTAL
NO
YES
TOTAL
NO
YES
TOTAL
NO
YES
TOTAL
92%
8%
978
92%
8%
1,018
91%
9%
980
92%
8%
1,095
92%
8%
1,071
94%
6%
792
93%
7%
864
91%
9%
788
93%
7%
914
93%
7%
861
96%
4%
675
94%
6%
766
91%
9%
670
93%
7%
780
--
--
--
98%
2%
311
97%
3%
322
96%
4%
289
--
--
--
--
--
--
© 2013 Hanover Research | District Administration Practice
17
Hanover Research | April 2013
Figure 1.11: Athlete Before Graduation – Descriptive Statistics
Fall 2006
Fall 2007
Fall 2008
Fall 2009
Fall 2010
Fall 2011
Total
GRADUATION WITHIN 4 YEARS
YES
NO
TOTAL
8%
92%
1,024
8%
92%
1,063
10%
90%
1,003
8%
92%
1,134
8%
92%
1,089
---8%
92%
5,313
GRADUATION WITHIN 5 YEARS
YES
NO
TOTAL
26%
74%
322
25%
75%
342
30%
70%
323
12%
88%
760
10%
90%
863
0%
100%
1,424
11%
89%
4,034
Figure 1.12: Average Credits Earned and Attempted
2007
2008
2009
2010
2011
2012
Fall Credits Earned
14.74
14.81
14.68
14.48
14.31
14.11
Spring Credits Attempted
13.22
13.42
13.34
13.18
12.89
12.32
© 2013 Hanover Research | District Administration Practice
18
Hanover Research | April 2013
SECTION II: R EGRESSION A NALYSIS
In this section, we present the results of our regression models that help us understand the
factors that affect student retention and graduation rates at Clarion. As noted previously,
the models included in Set 1 and Set 2 examine students’ retention rates between the
second and sixth years at the university, while the models in Set 3 examine students’ fouryear and five-year graduation rates.
The results of each set of regression models are presented in Figures 2.1, 2.2., and 2.3. Each
set of models focuses on a different combination of variables. More specifically,
The models included in Set 1 (Figure 2.1) were constructed to examine the effects of
a variety of factors Clarion would be aware of prior to a student’s initial enrollment
at the university (e.g., gender, age, ethnicity, high school rank, and SAT). The
outcome variable for these Set 1 models is retention.
The Set 2 models (Figure 2.2) analyze the effects of a range of demographic, preuniversity, and university characteristics (e.g., age, gender, high school rank, SAT,
first GPA at Clarion, passage/failure of specific courses, athlete status, credits
attempted and earned) on conditional retention. As mentioned previously,
conditional retention takes into account (or is “conditioned on”) whether a student
was retained in the previous year. In other words, it only measures the retention of
students in a given year if they were also retained in the previous year.
Finally, the Set 3 models (Figure 2.3) analyze the effects of demographic, preuniversity, and university characteristics on graduation within four years and
graduation within five years.
REGRESSION RESULTS: SET 1 MODELS
We recall that Set 1, which covers Models 1 to 5, included a dependent variable showing the
proportion of students who re-enrolled at Clarion in each of the five years. We developed
separate models for the different years of retention (e.g., second year retention, third year
retention, fourth year retention, etc.) with only demographic and pre-university data
included in the models. Further, in general, the models only include variables that have a
statistically significant effect on our dependent variables5 and in the points below, we only
highlight findings that are statistically significant.
5
In the figures, statistical significance is denoted with asterisks:
*** denotes statistical significance at 1 percent – meaning that there is a less than 1 percent likelihood that the
observed effect is due to chance.
** denotes statistical significance at 5 percent – meaning that there is a less than 5 percent likelihood that the
observed effect is due to chance.
* denotes statistical significance at 10 percent – meaning that there is a less than 10 percent likelihood that the
observed effect is due to chance.
© 2013 Hanover Research | Academy Administration Practice
19
Hanover Research | April 2013
KEY FINDINGS ACROSS SET 1 MODELS
The following subsections highlight findings relative to each individual model. In terms of
broader points that can be drawn across the Set 1 models, we find that in general, the
following are associated with a higher likelihood of being retained:
Being ranked in the top quarter of a high school class
Having higher SAT scores
By contrast, minorities, and particularly black students, are less likely to be retained than
white students.
MODEL 1 (2ND YEAR RETENTION)
Relative to white students, black students and students from other racial/ethnic
backgrounds are less likely to be retained in their second year at Clarion.
Students ranked below the top 25 percent of their high school classes are less likely
to be retained in their second year.6 In fact, outside of the top 25 percent, the lower
the student’s ranking, the less likely the student is to be retained (e.g., students
whose ranking is within 50-75 percent of their high school class are less likely to be
retained than students whose ranking is within 25-50 percent of their high school
class).
Second year retention has a positive relationship with SAT scores. With every 100point increase in SAT scores, a student’s chances of being retained increase slightly
(by 0.78 percent).
MODEL 2 AND 3 (3RD AND 4TH YEAR RETENTION)
6
Black students and students from other ethnic backgrounds are less likely to be
retained in their third and fourth years than white students.
Students ranked below the top 25 percent of their high school classes are less likely
to be retained in their third and fourth years than students ranked in the top 10
percent. Similar to Model 1, outside of the top 25 percent, the lower a students’
high school rank, the less likely they are to be retained.
Note that we make this conclusion based on a combination of results. First, high school rank was separated into a
series of dummy variables where the reference category was a high school rank within the top 10 percent of a
students’ high school class. For example, the negative coefficient in Model 1 for the variable listed as “High School
Rank (25%-50%)” indicates that students whose ranking was among 25-50 percent of their high school class were
significantly less likely to be retained than students in the top 10 percent. The same could be said for students in
the 50-75 percent range and the 75-100 percent range. However, the variable measuring “High School Rank (10%25%)” was not found to a have a statistically significant effect (i.e., students in the 10-25 percent range were not
significantly less likely to be retained than students in the top 10 percent). Taken together, these results suggest
that students ranked within the top 25 percent of their high school class are more likely to be retained than
students with lower rankings.
© 2013 Hanover Research | Academy Administration Practice
20
Hanover Research | April 2013
Students with higher SAT scores are more likely to be retained in their third year at
Clarion. Once again, a 100-point increase in SAT scores increases a student’s chances
of being retained by a relatively small margin (i.e., a boost of 0.84 percent in their
likelihood of being retained in their third year). This effect was not observed with
regard to fourth year retention.
Older students are less likely to be retained in their fourth year at Clarion than
younger students. This finding did not hold true with regard to third year retention.
MODEL 4 (5TH YEAR RETENTION)
Similar to the previous models, black students and students of other ethnicities are
less likely to be retained in their fifth year than their white classmates.
Students ranked in the lower half of their high school classes (50-100 percent) are
less likely to be retained compared to students in the top half of their high school
classes.
MODEL 5 (6TH YEAR RETENTION)
Similar to Model 3 (fourth year retention), older students are less likely to be
retained in their sixth year than younger students.
While total SAT scores were not found to have an effect on sixth year retention,
higher SAT Reading scores were associated with a higher likelihood of retention and
higher SAT Writing scores were associated with a lower likelihood of retention.
Note, however, these effects appear fairly small in terms of magnitude. A 100-point
increase in Reading scores corresponds to a 3.43 percent increase in the likelihood
of being retained in the sixth year. A 100-point increase in Writing scores
corresponds to a 4.42 percent decrease in the likelihood of being retained.
© 2013 Hanover Research | Academy Administration Practice
21
Hanover Research | April 2013
Figure 2.1: Regression Output Results - Set 1
MODEL 1
ND
2 YEAR
MODEL 2
RD
3 YEAR
MODEL 3
TH
4 YEAR
MODEL 4
TH
5 YEAR
MODEL 5
TH
6 YEAR
RETENTION
RETENTION
RETENTION
RETENTION
RETENTION
Age
Age squared
-0.0584***
-0.0281*
(0.0121)
(0.017)
0.0008***
(0.0002)
Black
7
Other or two or more races
7
High School Rank (10%-25%)
8
8
High School Rank (25%-50%)
8
High School Rank (50%-75%)
High School Rank (75%-100%)
SAT Total
8
-0.1362***
-0.1864***
-0.1961***
-0.1231***
(0.022092)
(0.0261)
(0.0268)
(0.0329)
-0.0681**
-0.0711*
-0.0766*
-0.1194**
(0.032796)
(0.0409)
(0.0459)
(0.0588)
-0.0124
-0.0218
-0.0250
0.0379
(0.023772)
(0.0284)
(0.0297)
(0.0465)
-0.0718***
-0.0950***
-0.1150***
-0.0096
(0.022793)
(0.0272)
(0.0272)
(0.0424)
-0.1857***
-0.2156***
-0.2487***
-0.0722*
(0.024141)
(0.0288)
(0.0279)
(0.0425)
-0.2757***
-0.3569***
-0.3317***
-0.1442***
(0.028049)
(0.0337)
(0.0323)
(0.0464)
0.0078**
0.0084**
(0.003156)
(0.0038)
SAT Read
0.0343**
(0.0158)
SAT Write
-0.0442***
(0.0165)
Constant
0.7294***
0.6387***
1.5181***
0.3791***
0.6320**
(0.055672)
(0.0664)
(0.1580)
(0.0396)
(0.3174)
Number of observations
5,949
4,907
4,786
2,558
986
R-squared
0.05398
0.0671
0.06993
0.0211
0.009382
Coefficients estimated using Ordinary Least Squares with a linear regression model, with standard errors in
parenthesis. Statistical significance indicator using asterisk next to the coefficients, with * = significant at 10%, ** =
significant at 5%, and *** = significant at 1%.
7
8
Reference category for ethnicity is white
Reference category for high school rank is 0% to 10%
© 2013 Hanover Research | Academy Administration Practice
22
Hanover Research | April 2013
REGRESSION RESULTS: SET 2 MODELS
Figure 2.2, which includes Models 6, 7, 8, and 9, illustrates the relationship of various
factors on the conditional retention of students in their third, fourth, fifth, and sixth year at
Clarion. Once again, recall that conditional retention only looks at the retention of students
in a given year if they were retained in the previous year.
KEY FINDINGS ACROSS SET 2 MODELS
In terms of broad findings across the Set 2 models, we note that the following factors are
associated with a higher likelihood of being retained for one additional year (if the student
had been retained in the previous year):
Having a higher first year GPA
Being a student athlete in the previous year
Having attempted a higher number of credits in the previous semester
Additionally, having failed specific courses is associated with a lower likelihood of retention
for one additional year. These include ENG 111 (third and fourth year retention), MATH 050
(third and fourth year retention), MATH 110 (third year retention), and MATH 112 (third and
fifth year retention).
MODEL 6 (3RD YEAR CONDITIONAL RETENTION)
Compared to white students, black students who were retained in their second year
at Clarion are less likely to be retained in their third year at Clarion.
SAT score was found to have a negative relationship with conditional retention.
However, similar to our other findings surrounding SATs, the effect appears small. A
100-point increase in SAT score corresponds to a 1.05 percent decrease in the
likelihood of third year retention.
Students with higher first year GPAs at Clarion are more likely to be retained in their
third year than students with lower GPAs.
Students who failed ENG 111, MATH 050, MATH 110, and MATH 112 in the previous
year are less likely to be retained than students who did not attempt these courses.
Students who were listed as athletes in their second year at Clarion are more likely
to be retained in their third year than non-athletes.
Attempting more credits during the previous spring semester increases a student’s
likelihood of being retained in the third year. A one-credit increase is associated with
a 4.07 percent boost in the likelihood of retention (so a three-credit increase would
translate to a 12 percent boost). By contrast, the number of credits earned in the
previous fall semester was found to have a negative impact on likelihood of
retention. However, this effect was small (a one-credit increase in the number of
credits earned in the fall was associated with a 0.61 percent decrease in the
© 2013 Hanover Research | Academy Administration Practice
23
Hanover Research | April 2013
likelihood of retention), and this finding did not hold for retention in the fourth,
fifth, and sixth years (Models 7-9).
MODEL 7 (4TH YEAR CONDITIONAL RETENTION)
Among students who were retained in their third year, older students are less likely
to be retained in their fourth year than younger students.
SAT score also has a negative effect on the likelihood of retention, though once
again, the magnitude of this effect appears small in practical terms. A 100-point
increase in SAT scores is associated with a 0.63 percent decrease in the likelihood a
student will be retained in their fourth year.
Students with higher first year GPAs are more likely to be retained in their fourth
year.
Compared to students who did not attempt ENG 111 and MATH 050, students who
failed these courses have a lower chance of being retained in their fourth year.
Students who passed MATH 110 and MATH 112 in the previous year are expected to
have a higher retention rate than students who did not attempt these courses.
Students who were listed as athletes in their third year are more likely to be
retained in their fourth year.
Students who attempted more credits in the previous spring semester are more
likely to be retained in their fourth year.
MODEL 8 (5TH YEAR CONDITIONAL RETENTION)
Black students are less likely to be retained in their fifth year than white students.
First year GPA has a positive impact on students’ fifth year conditional retention. As
a student’s first GPA increases, he or she is more likely to be retained in the fifth
year.
Students who failed MATH 112 are less likely to be retained in their fifth year than
students who did not attempt the course.
Students who attempted more credits in the previous spring semester are more
likely to be retained in their fifth year.
Similar to Model 6, higher SAT scores are associated with a lower likelihood of being
retained in the fifth year. A 100-point increase in SATs is associated with a 1.57
percent decrease in the likelihood of retention at this stage.
MODEL 9 (6TH YEAR CONDITIONAL RETENTION)
An increase in high school rank as a percentage of class size is associated with a
decrease in the likelihood that a student would be retained in the sixth year. In
other words, lower ranked students (e.g., a student ranked 75th out of 100) are less
© 2013 Hanover Research | Academy Administration Practice
24
Hanover Research | April 2013
likely to be retained in the sixth year than students with higher ranks (e.g., a student
ranked at 25th out of 100).
Number of credits attempted in the previous spring semester has a positive impact
on students’ sixth year retention. Students who attempted more credits in the
previous spring are more likely to be retained in their sixth year.
Figure 2.2: Regression Output Results - Set 2
MODEL 6
RD
3 YEAR
CONDITIONAL
MODEL 7
TH
4 YEAR
CONDITIONAL
MODEL 8
TH
5 YEAR
CONDITIONAL
MODEL 9
TH
6 YEAR
CONDITIONAL
RETENTION
RETENTION
RETENTION
RETENTION
Age
-0.0286***
(0.0091)
Black
9
Other of two or more races
9
SAT
-0.0595***
-0.1337***
(0.021)
(0.044)
-0.0140
-0.0624
(0.0301)
(0.073)
-0.0105***
-0.0063**
-0.0157**
(0.003)
(0.003)
(0.0061)
High School Rank as percentage of class size
-0.4786***
(0.1566)
First GPA
ENG 111 in the previous year: Failed
10
ENG 111 in the previous year: Passed
10
MATH 050 in the previous year: Failed
11
MATH 050 in the previous year: Passed
MATH 110 in the previous year: Failed
12
MATH 110 in the previous year: Passed
MATH 112 in the previous year: Failed
11
12
13
0.0421***
0.0281***
0.0461***
(0.0082)
(0.0077)
(0.0162)
-0.1564***
-0.1074***
(0.0289)
(0.0331)
-0.0184
0.0089
(0.0168)
(0.0197)
-0.1854***
-0.1779***
(0.0364)
(0.0397)
-0.0151
-0.0059
(0.0145)
(0.0136)
-0.1475***
-0.0367
(0.0283)
(0.0259)
-0.0186
0.0353***
(0.0155)
(0.0135)
-0.0837***
-0.0256
-0.1296**
9
Reference category for ethnicity is white
Reference category for ENG 111 is did not attempt
11
Reference category for MATH 050 is did not attempt
12
Reference category for MATH 110 is did not attempt
13
Reference category for MATH 112 is did not attempt
10
© 2013 Hanover Research | Academy Administration Practice
25
Hanover Research | April 2013
MODEL 6
RD
3 YEAR
CONDITIONAL
MODEL 7
TH
4 YEAR
CONDITIONAL
MODEL 8
TH
5 YEAR
CONDITIONAL
MODEL 9
TH
6 YEAR
CONDITIONAL
RETENTION
RETENTION
RETENTION
RETENTION
(0.0302)
(0.0282)
(0.0549)
0.0173
0.0452***
0.036
(0.0132)
(0.0114)
(0.0266)
0.0641***
0.0355*
(0.0182)
(0.0182)
0.0407***
0.0303***
0.0294***
0.0301***
(0.0012)
(0.0011)
(0.0026)
(0.0057)
0.4165***
1.0009***
0.5613***
0.5099***
(0.0588)
(0.1801)
(0.0917)
(0.1067)
Number of Observations
3,904
2,538
841
173
R-squared
0.3284
0.278
0.1793
0.1809
MATH 112 in the previous year: Passed
Athlete in the previous year
13
14
Credits attempted in the last spring semester
Credits earned in the last fall semester
-0.0061**
(0.0028)
Constant
Coefficients estimated using Ordinary Least Squares with a linear regression model, with standard errors in
parenthesis. Statistical significance indicator using asterisk next to the coefficients, with * = significant at 10%, ** = significant
at 5%, and *** = significant at 1%.
REGRESSION RESULTS: SET 3 MODELS
Finally, we present the results to the regression models that estimate the variations in
students’ four-year and five-year graduation rates.
MODEL 10 (GRADUATION WITHIN FOUR YEARS)
14
In contrast to the relationship of age with retention, older students are more likely
to graduate within four years than younger students.
Black students are less likely to graduate within four years than their white
classmates.
Students in the top 25 percent of their high school class are more likely to graduate
within four years than other students. Once again, outside of the top 25 percent, the
lower a student ranks, the less likely they are to graduate within four years.
First year GPA is positively associated with the likelihood of graduating. In fact, a
one-point increase in GPA (e.g., 2.0 to 3.0) is associated with a 15.21 percent
increase in the likelihood of graduating within four years.
Total SAT score did not have a significant effect on graduation. However, individual
tests did have such an effect. More specifically, SAT Math and SAT Writing scores
Reference category is not an athlete
© 2013 Hanover Research | Academy Administration Practice
26
Hanover Research | April 2013
had a positive relationship (a 100-point increase in Math is associated with a 2.34
percent increase in likelihood of graduation, while a 100-point increase in Writing is
expected to yield a 3.96 percent increase in the likelihood of graduation). By
contrast, a 100-point increase in SAT Reading scores is associated with a 6.25
percent decrease in the probability of graduating in four years.
MODEL 11 (GRADUATION WITHIN FIVE YEARS)
Female students are 10.13 percent more likely to graduate within five years than
male students. Notably, this was the only model in which gender was found to have
a significant effect on graduation (or retention).
Black students are less likely to graduate within five years than white students.
First year GPA of students has a positive impact on the probability of five-year
graduation. A one-point increase in first GPA is associated with an 8.27 percent
increase in the likelihood of graduating within five years.
SAT Reading scores have a negative relationship with the likelihood of graduating
within five years. A 100-point increase in SAT Reading scores is associated with a
4.71 percent decrease in the probability of graduation.
Athletes are 8.6 percent more likely to graduate within five years than non-athletes.
Students in the top half of their high school class are more likely to graduate within
five years compared to students in the bottom 50 percent of their class. Outside of
the top 50 percent, the lower a student’s high school rank, the lower their likelihood
of graduating within five years.
Figure 2.3: Regression Output Results - Set 3
MODEL 10
MODEL 11
GRADUATION WITHIN 4 YEARS
GRADUATION WITHIN 5 YEARS
Female
0.1013***
(0.0332)
Age
-1.4003***
(0.377)
Age squared
0.0388***
(0.0101)
Black
15
Other or two or more races
15
High School Rank (10%-25%)
15
16
16
-0.1153***
-0.1331*
(0.0316)
(0.0699)
-0.0445
-0.0582
(0.0491)
(0.1141)
-0.0507
-0.0794
Reference category for ethnicity is white
Reference category for high school rank is 0% to 10%
© 2013 Hanover Research | Academy Administration Practice
27
Hanover Research | April 2013
High School Rank (25%-50%)
16
High School Rank (50%-75%)
16
High School Rank (75%-100%)
SAT Math
16
MODEL 10
MODEL 11
GRADUATION WITHIN 4 YEARS
GRADUATION WITHIN 5 YEARS
(0.0328)
(0.0659)
-0.1218***
-0.0945
(0.0317)
(0.064)
-0.1872***
-0.168**
(0.0347)
(0.0708)
-0.2362***
-0.2406***
(0.0414)
(0.0864)
0.0234*
(0.0125)
SAT Read
SAT Write
-0.0625***
-0.0471**
(0.0154)
(0.0219)
0.0396**
(0.0153)
First GPA
0.1521***
0.0827***
(0.009)
(0.0258)
Athlete
0.086**
(0.0371)
Constant
12.7073***
0.853***
(3.5365)
(0.15)
Number of Observations
2,839
604
R-squared
0.2112
0.1118
Coefficients estimated using Ordinary Least Squares with a linear regression model, with standard errors in
parenthesis. Statistical significance indicator using asterisk next to the coefficients, with * = significant at 10%, ** =
significant at 5%, and *** = significant at 1%.
© 2013 Hanover Research | Academy Administration Practice
28
Hanover Research | April 2013
APPENDIX A: RETENTION R ATES
Figure A.1: Retention Rates by Gender
80%
72%
67%
61%
60%
55%
54%
40%
50%
32%
31%
20%
9%
8%
0%
Female
2nd year retention
Male
3rd year retention
4th year retention
5th year retention
6th year retention
Figure A.2: Conditional Retention by Gender
100%
90%
83%
90%
85%
81%
79%
80%
61%
53%
60%
40%
20%
0%
Female
Male
3rd year conditional retention
4th year conditional retention
5th year conditional retention
6th year conditional retention
© 2013 Hanover Research | Academy Administration Practice
29
Hanover Research | April 2013
Figure A.3: Retention Rate by Ethnicity
80%
60%
71%
61%
60%
57%
55%
48%
40%
40%
41%
36%
33%
22%
21%
20%
10%
8%
9%
0%
Black
2nd year retention
Other or two or more races
3rd year retention
4th year retention
White
5th year retention
6th year retention
Figure A.4: Conditional Retention Rate by Ethnicity
100%
80%
65%
91%
89%
82%
77%
71%
83%
80%
85%
67%
59%
60%
56%
40%
20%
0%
Black
Other or two or more races
White
3rd year conditional retention
4th year conditional retention
5th year conditional retention
6th year conditional retention
© 2013 Hanover Research | Academy Administration Practice
30
Hanover Research | April 2013
Figure A.5: Retention Rate by High School Class Rank
100%
83%
80%
81%
74%
74%
72%
69%
68%
63%
58%
60%
36%
40%
63%
55%
51%
44%
41%
38%
34%
36%
29%
22%
20%
15%
9%
9%
5%
5%
0%
0% to 10%
2nd year retention
10% to 25%
3rd year retention
25% to 50%
4th year retention
50% to 75%
5th year retention
75% to 100%
6th year retention
Figure A.6: Conditional Retention Rate by High School Class Rank
100%
94%
91%
86%
80%
95%
87%
87%
91%
84%
85%
84%
82%
76%
74%
67%
86%
65%
75%
62%
66%
60%
40%
23%
20%
0%
0% to 10%
10% to 25%
25% to 50%
50% to 75%
75% to 100%
3rd year conditional retention
4th year conditional retention
5th year conditional retention
6th year conditional retention
© 2013 Hanover Research | Academy Administration Practice
31
Hanover Research | April 2013
APPENDIX B: GRADUATION RATES
Figure B.1: Graduation Rate by Gender
80%
62%
60%
40%
52%
37%
29%
20%
0%
Female
Male
Four year graduation rate
Five year graduation rate
Figure B.2: Graduation Rate by Ethnicity
80%
37%
40%
20%
59%
57%
60%
35%
33%
15%
0%
Black
Other or two or more
races
Four year graduation rate
© 2013 Hanover Research | Academy Administration Practice
White
Five year graduation rate
32
Hanover Research | April 2013
Figure B.3: Graduation Rate by High School Class Rank
100%
81%
75%
80%
63%
61%
60%
50%
45%
37%
40%
28%
22%
20%
12%
0%
0% to 10%
10% to 25%
25% to 50%
Four year graduation rate
© 2013 Hanover Research | Academy Administration Practice
50% to 75%
75% to 100%
Five year graduation rate
33
Hanover Research | April 2013
PROJECT EVALUATION FORM
Hanover Research is committed to providing a work product that meets or exceeds partner
expectations. In keeping with that goal, we would like to hear your opinions regarding our
reports. Feedback is critically important and serves as the strongest mechanism by which we
tailor our research to your organization. When you have had a chance to evaluate this
report, please take a moment to fill out the following questionnaire.
http://www.hanoverresearch.com/evaluation/index.php
CAVEAT
The publisher and authors have used their best efforts in preparing this brief. The publisher
and authors make no representations or warranties with respect to the accuracy or
completeness of the contents of this brief and specifically disclaim any implied warranties of
fitness for a particular purpose. There are no warranties which extend beyond the
descriptions contained in this paragraph. No warranty may be created or extended by
representatives of Hanover Research or its marketing materials. The accuracy and
completeness of the information provided herein and the opinions stated herein are not
guaranteed or warranted to produce any particular results, and the advice and strategies
contained herein may not be suitable for every partner. Neither the publisher nor the
authors shall be liable for any loss of profit or any other commercial damages, including but
not limited to special, incidental, consequential, or other damages. Moreover, Hanover
Research is not engaged in rendering legal, accounting, or other professional services.
Partners requiring such services are advised to consult an appropriate professional.
© 2013 Hanover Research | Academy Administration Practice
34
Hanover Research | April 2013
1750 H Street NW, 2nd Floor
Washington, DC 20006
P 202.756.2971 F 866.808.6585
www.hanoverresearch.com
© 2013 Hanover Research | Academy Administration Practice
35
Analysis
Prepared for Clarion University of Pennsylvania
April 2013
In the following report, Hanover Research analyzes the retention and graduation rates of
students entering Clarion University of Pennsylvania between 2006 and 2011. More
specifically, we investigate various academic, institutional, and demographic factors that
significantly influence retention and graduation rates.
© 2013 Hanover Research | District Administration Practice
1
Hanover Research | April 2013
TABLE OF CONTENTS
Executive Summary and Key Findings ............................................................................... 3
Introduction ...........................................................................................................................3
Discussion ..............................................................................................................................3
Structure of the Report..........................................................................................................4
Key Findings ...........................................................................................................................4
Factors Promoting Retention.............................................................................................4
Factors Promoting Retention for One Additional Year......................................................5
Factors Correlated with Graduation in Four Years ............................................................5
Factors Correlated with Graduation in Five Years .............................................................6
Section I: Methodology .................................................................................................... 7
Regression Analysis................................................................................................................7
Dependent (Outcome) Variable ........................................................................................8
Independent (Explanatory) Variables ..............................................................................12
Section II: Regression Analysis ........................................................................................ 19
Regression Results: Set 1 Models ........................................................................................19
Key Findings Across Set 1 Models....................................................................................20
Model 1 (2nd Year Retention) ...........................................................................................20
Model 2 and 3 (3rd and 4th Year Retention) .....................................................................20
Model 4 (5th Year Retention) ...........................................................................................21
Model 5 (6th Year Retention) ...........................................................................................21
Regression Results: Set 2 Models ........................................................................................23
Key Findings Across Set 2 Models....................................................................................23
Model 6 (3rd Year Conditional Retention)........................................................................23
Model 7 (4th Year Conditional Retention)........................................................................24
Model 8 (5th Year Conditional Retention)........................................................................24
Model 9 (6th Year Conditional Retention)........................................................................24
Regression Results: Set 3 Models ........................................................................................26
Model 10 (Graduation within Four Years) .......................................................................26
Model 11 (Graduation within Five Years) ........................................................................27
Appendix A: Retention Rates .......................................................................................... 29
Appendix B: Graduation Rates ........................................................................................ 32
© 2013 Hanover Research | District Administration Practice
2
Hanover Research | April 2013
EXECUTIVE S UMMARY AND KEY FINDINGS
INTRODUCTION
In the following report, Hanover Research investigates the factors contributing to bachelor’s
degree-seeking students’ retention and graduation rates at Clarion University of
Pennsylvania (Clarion). In particular, we use the data supplied by Clarion to assess which
demographic, institutional, and academic variables proved the most useful predictors of
four-year and five-year graduation and second, third, fourth, fifth, and sixth year retention
for students entering in fall 2006 through 2011.
In our analysis of Clarion’s data, we developed three sets of models. The first set analyzes
factors associated with higher retention, with a focus on factors Clarion could know before a
student begins their studies at the university. Next, the second set of models analyzes
factors that can help a student be retained for an additional year, given that they were
retained through the previous year. Lastly, the third set of models analyzes factors
associated with increased graduation rates within four years and five years of enrollment at
the university.
DISCUSSION
In terms of the most notable outcomes of our analysis, we find that a student’s high school
ranking is a strong predictor of retention and graduation. A higher high school rank leads
to higher retention, and ultimately a higher likelihood of graduation within four or five years
of enrollment. This finding suggests that as Clarion seeks to boost retention and graduation
rates, the university should strive to increase its admission rates of higher ranked students.
As Clarion explained to Hanover, the university has found high school GPA to be a helpful
predictor of retention and graduation in the past. In the datasets provided to Hanover, an
extremely large number of students had missing values for the high school GPA variable
(listed as “0” in the files) and therefore high school GPA was excluded from our analysis.
Nevertheless, the finding that high school ranking (a reasonable proxy for high school GPA)
is a strong predictor of retention aligns well with Clarion’s experience.
Unfortunately, we also find that minorities, particularly black students, have a lower
likelihood of retention and graduation within four or five years at Clarion. This finding
holds true even after controlling for high school ranking, age, gender, and other observable
variables. Based on this result, Hanover recommends a review of why black students are not
being retained at higher rates at the university, with the goal of uncovering strategies to
better support these students.
Other notable takeaways from our analysis include the finding that being a student athlete
increases a student’s likelihood of retention in their third and fourth year, as well as
© 2013 Hanover Research | District Administration Practice
3
Hanover Research | April 2013
improves their chances of graduating within five years at Clarion. Perhaps unsurprisingly,
having a higher first year GPA also boosts a student’s chances of retention and graduation.
With regard to coursework, we find that if a student attempts a higher number of credits
during the spring semester, they are more likely to be retained the following year.
Additionally, if a student fails particular courses – ENG 111, MATH 050, MATH 110, and
MATH 112 – they are less likely to be retained by the university in the next year. All of these
findings offer insight into concrete predictors of retention and graduation at the university
and should assist in identifying groups of students who may need additional support while
progressing toward graduation.
STRUCTURE OF THE REPORT
The report is organized as follows: In Section I, we provide a description of the methodology
and data used to evaluate the factors affecting graduation and retention. In Section II, we
discuss and present our findings from the regression models that quantify the effects of
individual academic, institutional, and demographic factors on students’ likelihood of
retention and graduation. Further, in Appendix A and B, we present cross-tabulations of
retention and graduation rates broken down by students’ gender, ethnicity, and high school
class rank.
Before proceeding to the body of the report, below we offer a more detailed breakdown of
the key findings of our analysis.
KEY FINDINGS
FACTORS PROMOTING RETENTION
The following factors were found to be key predictors of retention:
1) Ranking in the top quarter of a high school class – For example, a student ranked in
the top 10 percent of his or her class is 19 percent more likely to be retained in their
second year at Clarion compared to a student whose ranking is within the 50-75
percent range of their high school class.1
2) Achieving higher SAT scores – While higher SAT scores were linked to an increased
likelihood of being retained, the magnitude of this effect was fairly small. For
instance, for every 100-point increase in total SAT score (Math, Reading, and Writing
combined), a student’s probability of being retained in his or her second year is
expected to increase by less than 1 percent.
Additionally, minorities, particularly black students, are found to have lower retention
rates. Holding other factors equal, our models indicate that a black student would be 12 to
19 percent less likely to be retained than a white student.
1
Note that the high school class rank measure calculates a given student’s rank as a percentage of the number of
nd
students in his or her high school class. For example, a student who is ranked 32 out of a class of 100 would
have a value of 32 percent on this measure.
© 2013 Hanover Research | District Administration Practice
4
Hanover Research | April 2013
FACTORS PROMOTING RETENTION FOR ONE ADDITIONAL YEAR
Described as “conditional retention” in our report, the following factors are associated with
an increased likelihood of being retained for an additional year if a student has completed
their previous year of study at Clarion.
1) Achieving a higher first year GPA at Clarion – A one-point increase in first year GPA
(e.g., 2.0 to 3.0) is associated with a roughly 3-4 percent increase in the likelihood of
being retained for an additional year.
2) Being a student athlete in the previous year – The chances of an athlete being
retained in the third or fourth year at Clarion is approximately 3-6 percent higher
than that of a non-athlete.
3) Attempting a higher number of credits in the previous semester – For example, a
one-credit increase in the number of credits attempted in the previous spring
semester is associated with a 4 percent increase in the likelihood of being retained
in the third year. Further, a three-credit increase would be associated with a 12
percent boost in the likelihood of being retained.
Additionally, failing the following subjects is associated with a decreased likelihood of
retention for an additional year (if the student has completed the previous year). The
estimated magnitude of this decline in likelihood of retention is 8-18 percent depending on
the course/year of retention.
1) Failing MATH 110 adversely affects third year retention.
2) Failing ENG 111 or MATH 050 adversely affects third and fourth year retention
3) Failing MATH 112 adversely affects third and fifth year retention.
FACTORS CORRELATED WITH GRADUATION IN FOUR YEARS
The following factors are associated with a higher likelihood of graduating in four years.
1) Having a higher high school ranking – For example, a student ranked in the top 10
percent of his or her high school class is roughly 12 percent more likely to graduate
in four years than a student whose ranking falls between 25-50 percent of the class.
2) Having a higher first year GPA – For every one-point increase in first year GPA, a
student’s probability of graduating within four years increases by 15 percent.
Further, being black was found to decrease a student’s likelihood of graduation in four
years by 11 percent, as compared to being white.
© 2013 Hanover Research | District Administration Practice
5
Hanover Research | April 2013
FACTORS CORRELATED WITH GRADUATION IN FIVE YEARS
The following factors are associated with a higher likelihood of graduating in five years.
1) Being female – Increases a student’s likelihood of graduating in five years by 10
percent, as compared to being a male.
2) Ranking in the top half of a high school class – For example, a student whose rank
falls between 50 and 75 percent of the high school class is 17 percent less likely to
graduate within five years than a student in the top 10 percent.
3) Being an athlete – Student athletes are nearly 9 percent more likely to graduate
within five years, as compared to non-athletes.
Finally, similar to our previous findings on retention and four-year graduation, being black
decreases the probability of a student graduating within five years by 13 percent, as
compared to being white.
© 2013 Hanover Research | District Administration Practice
6
Hanover Research | April 2013
SECTION I: METHODOLOGY
In this section, we offer an overview of the data and methodology used in our analysis.
Clarion provided Hanover with seven different datasets (a separate dataset for each year),
containing demographic, academic, and institutional information of students between the
years 2006 and 2012. More specifically, the data included the following:
Demographic data – Students’ gender, age, and race
Institutional data – Students’ starting year, graduation year, and athlete status
Academic data
o Pre-university (high school) – Students’ high school class ranking and SAT
scores
o University – Students’ first GPA, performance in specific courses (ENG 110,
ENG 111, MATH 050, MATH 110, MATH 112, and MGMT 120), number of
credits earned in each fall semester, and number of credits attempted in
each spring semester.
Our analysis is based on a master dataset that was created by combining the seven datasets
provided by Clarion. From each of the datasets, we selected students who enrolled at
Clarion in that year (either freshmen or transfer students). This provided us with a list of
9,131 unique students, who enrolled at Clarion between 2006 and 2011. For each of these
students we checked whether they appeared in the subsequent year’s dataset. Students
who appeared in the subsequent year’s dataset were marked as retained while students
who did not graduate and did not appear in the subsequent year’s dataset were marked as
not retained. Note that 299 students were dropped from our combined dataset as these
students continued to appear in the Clarion datasets even though they had been identified
as graduates of the previous year. Our final dataset available for analysis therefore contains
8,832 unique students.
REGRESSION ANALYSIS
In order to examine the impact of the various demographic, academic, and institutional
factors on retention and graduation rates, we constructed a series of linear probability
models (LPMs). Overall, we estimated 11 models grouped as Sets 1, 2, and 3, based on the
dependent (i.e., outcome) variable used.2 LPMs represent a variation of the standard
ordinary least squares (OLS) regression model used to analyze dichotomous dependent
variables. Dichotomous variables assume one of two values; in the context of the present
analysis, we assign a value of 1 to our dependent variable in cases where students meet a
specific criterion (e.g., having been retained or having graduated) and a value of 0 otherwise
(having not been retained or having not graduated).
2
As described in greater detail below, the dependent variables examined in this analysis include retention
(corresponding to Set 1), “conditional” retention (Set 2), and graduation (Set 3).
© 2013 Hanover Research | District Administration Practice
7
Hanover Research | April 2013
For each of our regression models, we provide coefficient estimates, as well as an indication
of which coefficients proved statistically significant. The coefficients of our models reveal
how much we expect the dependent variable to change when the independent variable
increases by one unit, holding all of the other independent variables in the model constant.
In the context of LPMs, the coefficients capture the change in the likelihood of retention or
graduation, depending on the model estimated. More specifically, multiplying the
coefficients by 100 percent indicates the extent to which the likelihood changes due to a
one-unit increase in the associated independent variable. For example, as seen in Figure 2.2,
a one-unit increase in first GPA leads to a 4.21 percent rise in the probability of first year
retention in Model 6.
Lastly, when presenting our results, we also include the R-squared value, which reveals the
percentage of variation in the dependent variable accounted for by the model. Using Figure
2.2 and Model 6 as an example once again, the R-squared value of 0.3284 indicates that the
model can explain 32.84 percent of the variation in retention.
DEPENDENT (OUTCOME) VARIABLE
As discussed previously, we used three types of dependent, or outcome, variables in our
regression analysis. We describe the dependent variables as follows:
Set 1 (Models 1, 2, 3, 4, and 5): The dependent variable in this set of models
indicates whether or not a student (who did not yet graduate) was retained in a
particular year of study. For instance, the second year retention of a student who
initially enrolled at Clarion in 2006 indicates whether or not the student was
retained in 2007. Similarly, the third year retention of that same student indicates
whether he or she was retained in 2008. Please note that a particular year’s
retention, as defined by this outcome variable, is independent of the students’
retention in the immediate previous year.3 Figure 1.1 below helps understand what
a particular year’s retention means for students who enrolled at Clarion between
2006 and 2011. We estimate separate models for different years of retention.
STARTING
3
Figure 1.1: Retention Years
2 YEAR
3RD YEAR
4TH YEAR
5TH YEAR
ND
6TH YEAR
YEAR
RETENTION
RETENTION
RETENTION
RETENTION
RETENTION
2006
2007
2008
2009
2007
2008
2009
2010
2008
2009
2010
2011
2009
2010
2011
2012
2010
2011
2012
--
2011
2012
---
In other words, if a student initially enrolled in 2006 but did not re-enroll in 2007, they would be marked as retained
under “second year retention.” However, if this same student re-enrolled in 2008, they would then be marked as
retained under “third year retention.” As discussed in greater detail below, the Set 2 models focus on “conditional
retention,” a measure that takes into account whether a student had been enrolled in the previous year at Clarion
in addition to whether they were enrolled in the current year.
© 2013 Hanover Research | District Administration Practice
8
Hanover Research | April 2013
STARTING
2ND YEAR
3RD YEAR
4TH YEAR
5TH YEAR
6TH YEAR
YEAR
RETENTION
RETENTION
RETENTION
RETENTION
RETENTION
2010
2011
2011
2012
2012
--
---
---
---
In Figure 1.2, we provide descriptive statistics of the dependent variables used for each
model in Set 1. For each of the student cohorts (e.g., students beginning in fall 2006,
students beginning in fall 2007, etc.), we provide the percentage of students who were
retained and not retained. The table indicates that for all cohorts, the retention rate
declines over the years. Please note that we have also provided a breakdown of retention
rates by various subgroups in Appendix A.
Figure 1.2: Distribution of Dependent Variables in Set 1 – Descriptive Statistics
STARTING
TERM
Fall 2006
Fall 2007
Fall 2008
Fall 2009
Fall 2010
Fall 2011
STATUS
Not Retained
Retained
Count
Not Retained
Retained
Count
Not Retained
Retained
Count
Not Retained
Retained
Count
Not Retained
Retained
Count
Not Retained
Retained
Count
Total
2ND YEAR
3RD YEAR
4TH YEAR
5TH YEAR
6TH YEAR
RETENTION
RETENTION
RETENTION
RETENTION
RETENTION
32%
68%
1,436
29%
71%
1,437
31%
69%
1,416
31%
69%
1,582
30%
70%
1,527
31%
69%
1,418
8,816
44%
56%
1,414
39%
61%
1,421
43%
57%
1,388
41%
59%
1,561
42%
58%
1,495
---7,279
50%
50%
1,346
44%
56%
1,365
49%
51%
1,326
48%
52%
1,499
------5,536
69%
31%
1,013
67%
33%
979
71%
29%
989
---------2,981
92%
8%
792
92%
8%
744
------------1,536
Set 2 (Models 6, 7, 8, and 9): The dependent variable for the models in Set 2
assumes a value of 1 whenever a student who was retained in a particular year is
enrolled in the following year and a value of 0 whenever a student who was retained
in a particular year is not enrolled in the following year. This is described as
“conditional” retention, as being retained in a given year is conditioned on whether
a student was retained in the previous year. For instance, in the case of a student
who started in 2006, fourth year conditional retention refers to that student being
© 2013 Hanover Research | District Administration Practice
9
Hanover Research | April 2013
retained in 2009 given that he/she was also retained in 2008. The table below
provides brief clarification of what a particular year’s conditional retention indicates.
Figure 1.3: Conditional Retention
STARTING
YEAR
2006
2007
2008
2009
2010
RD
3 YEAR CONDITIONAL
4TH YEAR CONDITIONAL
5TH YEAR CONDITIONAL
6TH YEAR CONDITIONAL
RETENTION
RETENTION
RETENTION
RETENTION
Students retained in
2007, did not graduate,
and were retained
again in 2008
Students retained in
2008, did not graduate,
and were retained
again in 2009
Students retained in
2009, did not graduate,
and were retained
again in 2010
Students retained in
2010, did not graduate,
and were retained
again in 2011
Students retained in
2011, did not graduate,
and were retained
again in 2012
Students retained in
2008, did not graduate,
and were retained
again in 2009
Students retained in
2009, did not graduate,
and were retained
again in 2010
Students retained in
2010, did not graduate,
and were retained
again in 2011
Students retained in
2011, did not graduate,
and were retained
again in 2012
Students retained in
2009, did not graduate,
and were retained
again in 2010
Students retained in
2010, did not graduate,
and were retained
again in 2011
Students retained in
2011, did not graduate,
and were retained
again in 2012
Students retained in
2010, did not graduate,
and were retained
again in 2011
Students retained in
2011, did not graduate,
and were retained
again in 2012
--
--
--
--
--
--
As the above table illustrates, conditional retention only looks at the group of students who
had been retained in the previous year. For example, if a student is marked as “not
retained” in the third year, they would not be included in the calculation of fourth year
retention (only students who had been retained in the third year would be included). This
differs from the dependent variable used in the Set 1 models, where the measure of
retention is not conditioned on whether the student was enrolled in the previous year.
Figure 1.4 on the following page provides descriptive statistics of the dependent variables
used in the Set 2 models. In general, between the third year and fifth year, the conditional
retention rate increases slightly. Note that we have also provided a breakdown of the
conditional retention rates by various sub-groups in Appendix A.
© 2013 Hanover Research | District Administration Practice
10
Hanover Research | April 2013
Figure 1.4: Distribution of Dependent Variables in Set 2 - Descriptive Statistics
STARTING
YEAR
Fall 2006
Fall 2007
Fall 2008
Fall 2009
Fall 2010
STATUS
Not Retained
Retained
Count
Not Retained
Retained
Count
Not Retained
Retained
Count
Not Retained
Retained
Count
Not Retained
Retained
Count
Total
3RD YEAR
4TH YEAR
5TH YEAR
6TH YEAR
CONDITIONAL
CONDITIONAL
CONDITIONAL
CONDITIONAL
RETENTION
RETENTION
RETENTION
RETENTION
21%
79%
956
17%
83%
1,002
19%
81%
952
18%
82%
1,074
19%
81%
1,039
5,023
11%
89%
726
9%
91%
808
10%
90%
729
10%
90%
853
---3,116
15%
85%
347
18%
82%
381
17%
83%
336
------1,064
42%
58%
96
45%
55%
95
---------191
Set 3 (Models 10 and 11): The dependent variable in this set of models indicates
whether or not a student graduated either within four years of enrollment (Model
10) or five years of enrollment (Model 11). Figure 1.5 on the following page provides
descriptive statistics of this dependent variable. As expected, the graduation rate
within five years is higher than the graduation rate within four years, as students
have had more time to complete their studies. We have also provided a breakdown
of the graduation rates by subgroups in Appendix B.
Please note that we do not have complete four-year data for students who enrolled
in 2009, 2010, and 2011. Furthermore, we also do not have five-year data for
students who enrolled in 2008. For most of these students, we do not know
whether they graduated within these years. For students in these years, we take
into account the students who are listed as graduates only. Other students are
excluded from the analysis as they are missing full four- and five-year data.
© 2013 Hanover Research | District Administration Practice
11
Hanover Research | April 2013
Figure 1.5: Distribution of Dependent Variables in Set 3 – Descriptive Statistics
STARTING
YEAR
Fall 2006
Fall 2007
Fall 2008
Fall 2009
Fall 2010
Fall 2011
STATUS
GRADUATION
WITHIN 4 YEARS
GRADUATION
WITHIN 5 YEARS
Did not graduate
Graduated
Count
Did not graduate
Graduated
Count
Did not graduate
Graduated
Count
Graduated
Count
Graduated
Count
Graduated
Count
Total
70%
30%
1,438
68%
32%
1,437
70%
30%
1,418
100%
131
100%
57
100%
11
4,492
55%
45%
1,438
52%
48%
1,437
-100%
542
100%
131
100%
57
100%
11
3,616
INDEPENDENT (EXPLANATORY) VARIABLES
We included a series of independent, or explanatory, variables in the models in order to
control for factors affecting the dependent variables of interest (retention, conditional
retention, and graduation). We provide a complete list of the independent variables used in
Figure 1.6. The table defines each independent variable, indicates the model(s) in which the
variable appears, and the variable type. Note that in general, if a variable was not found to
have a statistically significant effect on the outcome variable of interest (retention or
graduation), it was excluded from the model. For example, the “Female” variable did not
have a significant relationship with retention/graduation in Models 1-10 but did have a
significant effect on likelihood of graduation within five years. Therefore, the “Female”
variable was only included in our final version of Model 11.
Figure 1.6: Summary of Independent Variables
VARIABLES
Female
SUMMARY
Gender of students: (0) Male (1) Female
Age
Age of students
Age squared
Ethnicity
The squared value of the age of the students. This
allows us to examine whether age has a non-linear
effect on the dependent variable.
Ethnicity of students, selected from three categories:
Black, White, and “other or two or more races.”
© 2013 Hanover Research | District Administration Practice
MODEL
Model 11
Models 3, 5, 7,
and 10
VARIABLE TYPE
Categorical
Models 3 and 10
Continuous
Models 1, 2, 3, 4,
6, 8, 10, and 11
Categorical
Continuous
12
Hanover Research | April 2013
VARIABLES
High School Rank
SAT score total
SAT Math,
Reading, Writing
First GPA
ENG 111*
MATH 050*
MATH 110*
MATH 112*
SUMMARY
Student’s high school rank as a percentage of class
size. In other words, if a student was ranked 29th out
of a high school class of 100, their value for this
variable would be 29 percent. A student with a lower
ranking in the same high school class, say 45th out of
100, would have a value of 45 percent. As such,
higher values for this variable represent lower class
ranks (e.g., 45th) while lower values represent higher
class rankings (e.g., 29th).
Student’s total SAT scores (includes math, reading
and writing). We divided SAT score by 100 so that
our models show the effect of a 100 point increase in
SAT rather than a single point increase.
In cases where total SAT score was not found to a
statistically significant effect on retention or
graduation, we entered individual Math, Reading,
and/or Writing scores into the models. These scores
remained in the final models if they were found to
have a statistically significant effect on the outcome
of interest.
Student's first GPA at Clarion.
Whether student passed, failed, or did not attempt
this course.
Whether student passed, failed, or did not attempt
this course.
Whether student passed, failed, or did not attempt
this course.
Whether student passed, failed, or did not attempt
this course.
Athlete
Whether students is an athlete: (0) No (1) Yes
Credits attempted
in previous Spring
Credits earned in
previous Fall
Number of credits attempted in the previous Spring
semester
Number of credits earned in the previous Fall
semester
MODEL
VARIABLE TYPE
Models 1, 2, 3, 4,
9, 10, and 11
Categorical/Continuous
Models 1, 2, 3, 4,
6, 7, 8 , 9, and 10
Continuous
Models 5, 10,
and 11
Continuous
Models 6, 7, 8,
10, and 11
Continuous
Models 6 and 7
Categorical
Model 6 and 7
Categorical
Models 6 and 7
Categorical
Model 6, 7, and
8
Model 6, 7, and
11
Models 6, 7, 8,
and 9
Model 6
Categorical
Categorical
Continuous
Continuous
*Note that ENG 110 and MGMT 120 were excluded from our models as an extremely high percentage of students in our dataset
(90 percent or more for most cohorts) did not attempt these courses. In general, when selecting variables for inclusion in our
models, Hanover sought to use variables for which a substantial number of students had non-zero values.
Figures 1.7-1.12 present descriptive statistics for each independent variable used in the
models. While Figure 1.7 shows the distribution of students’ gender, ethnicity, and high
school ranking, Figure 1.8 and 1.9 shows descriptive statistics for some measures of
academic performance.
In our models, we used two forms of the independent variable that indicates whether a
student is an athlete. For the models in Set 2, the athlete variable refers to a student being
© 2013 Hanover Research | District Administration Practice
13
Hanover Research | April 2013
an athlete in the year immediately prior to the year in which retention is measured. For
instance, Model 7, which analyzes fourth year conditional retention, indicates the effect of a
student being an athlete in his/her third year only. By contrast, the athlete variable used in
the Set 3 models (Model 11), indicates whether the student was an athlete at any point
within the four or five years taken into account. Descriptive statistics of these alternate
forms of the athlete variable are provided in Figures 1.10 and 1.11. Finally, Figure 1.12
provides the average number of credits earned (fall) and attempted (spring) for each year of
our dataset.
© 2013 Hanover Research | District Administration Practice
14
Hanover Research | April 2013
Figure 1.7: Gender, Ethnicity, and High School Ranking – Descriptive Statistics
GENDER
FALL 2006
FALL 2007
FALL 2008
FALL 2009
FALL 2010
FALL 2011
Female
57% (n=803)
55% (n=789)
56% (n=793)
58% (n=924)
62% (n=941)
60% (n=850)
Male
43% (n=604)
45% (n=648)
44% (n=625)
42% (n=661)
38% (n=589)
40% (n=574)
Black
Other or two or more races4
6% (n=85)
3% (n=37)
7% (n=98)
3% (n=40)
9% (n=125)
4% (n=63)
9% (n=135)
4% (n=55)
9% (n=130)
5% (n=70)
9% (n=122)
6% (n=84)
White
91% (n=1281)
90% (n=1287)
87% (n=1225)
88% (n=1354)
86% (n=1276)
85% (n=1198)
0% to 10%
8% (n=108)
9% (n=111)
9% (n=114)
8% (n=104)
9% (n=117)
8% (n=98)
10% to 25%
19% (n=243)
15% (n=189)
19% (n=229)
19% (n=254)
20% (n=260)
18% (n=214)
25% to 50%
35% (n=445)
35% (n=443)
36% (n=443)
35% (n=476)
34% (n=446)
35% (n=413)
50% to 75%
27% (n=349)
29% (n=371)
27% (n=333)
26% (n=349)
26% (n=335)
25% (n=292)
75% to 100%
10% (n=132)
12% (n=156)
9% (n=109)
12% (n=162)
11% (n=145)
13% (n=157)
ETHNICITY
HIGH SCHOOL RANKING
Figure 1.8: Age, High School Rank, SAT Score, and First GPA- Descriptive Statistics
VALUES
Average of Age
Average of Rank as Percentage of Class Size
Average of SAT Total*
Average of SAT Math*
Average of SAT Reading*
Average of SAT Writing*
Average of First GPA
FALL 2006
18.93
43%
1381
462
466
455
2.44
FALL 2007
19.13
45%
1419
483
477
462
2.49
FALL 2008
19.20
42%
1412
479
474
460
2.51
FALL 2009
19.60
44%
1407
477
471
458
2.64
FALL 2010
19.70
43%
1419
478
477
464
2.66
FALL 2011
19.76
44%
1402
478
475
459
2.63
TOTAL
19.39
43%
1408
477
474
460
2.56
*Note that all SAT scores listed as “0” were recoded as missing and are not reflected in these averages.
4
Includes students who are listed as Asian, Hispanic, Native American, Pacific Islander, and from two or more races.
© 2013 Hanover Research | District Administration Practice
15
Hanover Research | April 2013
Figure 1.9: University Course Performance – Descriptive Statistics
COURSE
ENG 110
ENG 111
MATH 050
MATH 110
MATH 112
MGMT 120
VALUES
Did not attempt
Did not pass
Passed
Did not attempt
Did not pass
Passed
Did not attempt
Did not pass
Passed
Did not attempt
Did not pass
Passed
Did not attempt
Did not pass
Passed
Did not attempt
Did not pass
Passed
FALL 2006
99% (n=1429)
0% (n=)
1% (n=9)
27% (n=393)
10% (n=137)
63% (n=908)
79% (n=1135)
5% (n=75)
16% (n=228)
83% (n=1200)
4% (n=64)
12% (n=174)
76% (n=1099)
4% (n=63)
19% (n=276)
92% (n=1327)
1% (n=15)
7% (n=96)
© 2013 Hanover Research | District Administration Practice
FALL 2007
99% (n=1425)
0% (n=3)
1% (n=9)
21% (n=304)
11% (n=161)
68% (n=972)
81% (n=1165)
4% (n=62)
15% (n=210)
81% (n=1168)
4% (n=64)
14% (n=205)
77% (n=1112)
4% (n=57)
19% (n=268)
89% (n=1277)
2% (n=27)
9% (n=133)
FALL 2008
91% (n=1288)
2% (n=22)
8% (n=108)
25% (n=351)
10% (n=148)
65% (n=919)
80% (n=1141)
5% (n=64)
15% (n=213)
83% (n=1176)
5% (n=71)
12% (n=171)
76% (n=1075)
5% (n=72)
19% (n=271)
88% (n=1251)
1% (n=18)
11% (n=149)
FALL 2009
95% (n=1499)
1% (n=15)
4% (n=71)
26% (n=407)
8% (n=129)
66% (n=1049)
80% (n=1263)
4% (n=60)
17% (n=262)
87% (n=1375)
4% (n=62)
9% (n=148)
73% (n=1159)
4% (n=71)
22% (n=355)
92% (n=1458)
1% (n=13)
7% (n=114)
FALL 2010
85% (n=1304)
2% (n=37)
12% (n=189)
31% (n=481)
8% (n=120)
61% (n=929)
78% (n=1193)
5% (n=78)
17% (n=259)
89% (n=1361)
4% (n=54)
8% (n=115)
72% (n=1105)
5% (n=75)
23% (n=350)
90% (n=1375)
2% (n=31)
8% (n=124)
16
FALL 2011
84% (n=1192)
1% (n=20)
15% (n=212)
32% (n=459)
7% (n=105)
60% (n=860)
84% (n=1191)
3% (n=47)
13% (n=186)
89% (n=1263)
4% (n=51)
8% (n=110)
76% (n=1081)
4% (n=63)
20% (n=280)
90% (n=1281)
2% (n=26)
8% (n=117)
TOTAL
3,495
675
4,662
8,137
97
598
2,395
800
5,637
7,543
366
923
6,631
401
1,800
7,969
130
733
Hanover Research | April 2013
Figure 1.10: Athlete in the Year Prior to Retention – Descriptive Statistics
2006
2007
2008
2009
2010
ATHLETE STATUS
Athlete prior to 2nd year
retention
Athlete prior to 3rd year
retention
Athlete prior to 4th year
retention
Athlete prior to 5th year
retention
NO
YES
TOTAL
NO
YES
TOTAL
NO
YES
TOTAL
NO
YES
TOTAL
NO
YES
TOTAL
92%
8%
978
92%
8%
1,018
91%
9%
980
92%
8%
1,095
92%
8%
1,071
94%
6%
792
93%
7%
864
91%
9%
788
93%
7%
914
93%
7%
861
96%
4%
675
94%
6%
766
91%
9%
670
93%
7%
780
--
--
--
98%
2%
311
97%
3%
322
96%
4%
289
--
--
--
--
--
--
© 2013 Hanover Research | District Administration Practice
17
Hanover Research | April 2013
Figure 1.11: Athlete Before Graduation – Descriptive Statistics
Fall 2006
Fall 2007
Fall 2008
Fall 2009
Fall 2010
Fall 2011
Total
GRADUATION WITHIN 4 YEARS
YES
NO
TOTAL
8%
92%
1,024
8%
92%
1,063
10%
90%
1,003
8%
92%
1,134
8%
92%
1,089
---8%
92%
5,313
GRADUATION WITHIN 5 YEARS
YES
NO
TOTAL
26%
74%
322
25%
75%
342
30%
70%
323
12%
88%
760
10%
90%
863
0%
100%
1,424
11%
89%
4,034
Figure 1.12: Average Credits Earned and Attempted
2007
2008
2009
2010
2011
2012
Fall Credits Earned
14.74
14.81
14.68
14.48
14.31
14.11
Spring Credits Attempted
13.22
13.42
13.34
13.18
12.89
12.32
© 2013 Hanover Research | District Administration Practice
18
Hanover Research | April 2013
SECTION II: R EGRESSION A NALYSIS
In this section, we present the results of our regression models that help us understand the
factors that affect student retention and graduation rates at Clarion. As noted previously,
the models included in Set 1 and Set 2 examine students’ retention rates between the
second and sixth years at the university, while the models in Set 3 examine students’ fouryear and five-year graduation rates.
The results of each set of regression models are presented in Figures 2.1, 2.2., and 2.3. Each
set of models focuses on a different combination of variables. More specifically,
The models included in Set 1 (Figure 2.1) were constructed to examine the effects of
a variety of factors Clarion would be aware of prior to a student’s initial enrollment
at the university (e.g., gender, age, ethnicity, high school rank, and SAT). The
outcome variable for these Set 1 models is retention.
The Set 2 models (Figure 2.2) analyze the effects of a range of demographic, preuniversity, and university characteristics (e.g., age, gender, high school rank, SAT,
first GPA at Clarion, passage/failure of specific courses, athlete status, credits
attempted and earned) on conditional retention. As mentioned previously,
conditional retention takes into account (or is “conditioned on”) whether a student
was retained in the previous year. In other words, it only measures the retention of
students in a given year if they were also retained in the previous year.
Finally, the Set 3 models (Figure 2.3) analyze the effects of demographic, preuniversity, and university characteristics on graduation within four years and
graduation within five years.
REGRESSION RESULTS: SET 1 MODELS
We recall that Set 1, which covers Models 1 to 5, included a dependent variable showing the
proportion of students who re-enrolled at Clarion in each of the five years. We developed
separate models for the different years of retention (e.g., second year retention, third year
retention, fourth year retention, etc.) with only demographic and pre-university data
included in the models. Further, in general, the models only include variables that have a
statistically significant effect on our dependent variables5 and in the points below, we only
highlight findings that are statistically significant.
5
In the figures, statistical significance is denoted with asterisks:
*** denotes statistical significance at 1 percent – meaning that there is a less than 1 percent likelihood that the
observed effect is due to chance.
** denotes statistical significance at 5 percent – meaning that there is a less than 5 percent likelihood that the
observed effect is due to chance.
* denotes statistical significance at 10 percent – meaning that there is a less than 10 percent likelihood that the
observed effect is due to chance.
© 2013 Hanover Research | Academy Administration Practice
19
Hanover Research | April 2013
KEY FINDINGS ACROSS SET 1 MODELS
The following subsections highlight findings relative to each individual model. In terms of
broader points that can be drawn across the Set 1 models, we find that in general, the
following are associated with a higher likelihood of being retained:
Being ranked in the top quarter of a high school class
Having higher SAT scores
By contrast, minorities, and particularly black students, are less likely to be retained than
white students.
MODEL 1 (2ND YEAR RETENTION)
Relative to white students, black students and students from other racial/ethnic
backgrounds are less likely to be retained in their second year at Clarion.
Students ranked below the top 25 percent of their high school classes are less likely
to be retained in their second year.6 In fact, outside of the top 25 percent, the lower
the student’s ranking, the less likely the student is to be retained (e.g., students
whose ranking is within 50-75 percent of their high school class are less likely to be
retained than students whose ranking is within 25-50 percent of their high school
class).
Second year retention has a positive relationship with SAT scores. With every 100point increase in SAT scores, a student’s chances of being retained increase slightly
(by 0.78 percent).
MODEL 2 AND 3 (3RD AND 4TH YEAR RETENTION)
6
Black students and students from other ethnic backgrounds are less likely to be
retained in their third and fourth years than white students.
Students ranked below the top 25 percent of their high school classes are less likely
to be retained in their third and fourth years than students ranked in the top 10
percent. Similar to Model 1, outside of the top 25 percent, the lower a students’
high school rank, the less likely they are to be retained.
Note that we make this conclusion based on a combination of results. First, high school rank was separated into a
series of dummy variables where the reference category was a high school rank within the top 10 percent of a
students’ high school class. For example, the negative coefficient in Model 1 for the variable listed as “High School
Rank (25%-50%)” indicates that students whose ranking was among 25-50 percent of their high school class were
significantly less likely to be retained than students in the top 10 percent. The same could be said for students in
the 50-75 percent range and the 75-100 percent range. However, the variable measuring “High School Rank (10%25%)” was not found to a have a statistically significant effect (i.e., students in the 10-25 percent range were not
significantly less likely to be retained than students in the top 10 percent). Taken together, these results suggest
that students ranked within the top 25 percent of their high school class are more likely to be retained than
students with lower rankings.
© 2013 Hanover Research | Academy Administration Practice
20
Hanover Research | April 2013
Students with higher SAT scores are more likely to be retained in their third year at
Clarion. Once again, a 100-point increase in SAT scores increases a student’s chances
of being retained by a relatively small margin (i.e., a boost of 0.84 percent in their
likelihood of being retained in their third year). This effect was not observed with
regard to fourth year retention.
Older students are less likely to be retained in their fourth year at Clarion than
younger students. This finding did not hold true with regard to third year retention.
MODEL 4 (5TH YEAR RETENTION)
Similar to the previous models, black students and students of other ethnicities are
less likely to be retained in their fifth year than their white classmates.
Students ranked in the lower half of their high school classes (50-100 percent) are
less likely to be retained compared to students in the top half of their high school
classes.
MODEL 5 (6TH YEAR RETENTION)
Similar to Model 3 (fourth year retention), older students are less likely to be
retained in their sixth year than younger students.
While total SAT scores were not found to have an effect on sixth year retention,
higher SAT Reading scores were associated with a higher likelihood of retention and
higher SAT Writing scores were associated with a lower likelihood of retention.
Note, however, these effects appear fairly small in terms of magnitude. A 100-point
increase in Reading scores corresponds to a 3.43 percent increase in the likelihood
of being retained in the sixth year. A 100-point increase in Writing scores
corresponds to a 4.42 percent decrease in the likelihood of being retained.
© 2013 Hanover Research | Academy Administration Practice
21
Hanover Research | April 2013
Figure 2.1: Regression Output Results - Set 1
MODEL 1
ND
2 YEAR
MODEL 2
RD
3 YEAR
MODEL 3
TH
4 YEAR
MODEL 4
TH
5 YEAR
MODEL 5
TH
6 YEAR
RETENTION
RETENTION
RETENTION
RETENTION
RETENTION
Age
Age squared
-0.0584***
-0.0281*
(0.0121)
(0.017)
0.0008***
(0.0002)
Black
7
Other or two or more races
7
High School Rank (10%-25%)
8
8
High School Rank (25%-50%)
8
High School Rank (50%-75%)
High School Rank (75%-100%)
SAT Total
8
-0.1362***
-0.1864***
-0.1961***
-0.1231***
(0.022092)
(0.0261)
(0.0268)
(0.0329)
-0.0681**
-0.0711*
-0.0766*
-0.1194**
(0.032796)
(0.0409)
(0.0459)
(0.0588)
-0.0124
-0.0218
-0.0250
0.0379
(0.023772)
(0.0284)
(0.0297)
(0.0465)
-0.0718***
-0.0950***
-0.1150***
-0.0096
(0.022793)
(0.0272)
(0.0272)
(0.0424)
-0.1857***
-0.2156***
-0.2487***
-0.0722*
(0.024141)
(0.0288)
(0.0279)
(0.0425)
-0.2757***
-0.3569***
-0.3317***
-0.1442***
(0.028049)
(0.0337)
(0.0323)
(0.0464)
0.0078**
0.0084**
(0.003156)
(0.0038)
SAT Read
0.0343**
(0.0158)
SAT Write
-0.0442***
(0.0165)
Constant
0.7294***
0.6387***
1.5181***
0.3791***
0.6320**
(0.055672)
(0.0664)
(0.1580)
(0.0396)
(0.3174)
Number of observations
5,949
4,907
4,786
2,558
986
R-squared
0.05398
0.0671
0.06993
0.0211
0.009382
Coefficients estimated using Ordinary Least Squares with a linear regression model, with standard errors in
parenthesis. Statistical significance indicator using asterisk next to the coefficients, with * = significant at 10%, ** =
significant at 5%, and *** = significant at 1%.
7
8
Reference category for ethnicity is white
Reference category for high school rank is 0% to 10%
© 2013 Hanover Research | Academy Administration Practice
22
Hanover Research | April 2013
REGRESSION RESULTS: SET 2 MODELS
Figure 2.2, which includes Models 6, 7, 8, and 9, illustrates the relationship of various
factors on the conditional retention of students in their third, fourth, fifth, and sixth year at
Clarion. Once again, recall that conditional retention only looks at the retention of students
in a given year if they were retained in the previous year.
KEY FINDINGS ACROSS SET 2 MODELS
In terms of broad findings across the Set 2 models, we note that the following factors are
associated with a higher likelihood of being retained for one additional year (if the student
had been retained in the previous year):
Having a higher first year GPA
Being a student athlete in the previous year
Having attempted a higher number of credits in the previous semester
Additionally, having failed specific courses is associated with a lower likelihood of retention
for one additional year. These include ENG 111 (third and fourth year retention), MATH 050
(third and fourth year retention), MATH 110 (third year retention), and MATH 112 (third and
fifth year retention).
MODEL 6 (3RD YEAR CONDITIONAL RETENTION)
Compared to white students, black students who were retained in their second year
at Clarion are less likely to be retained in their third year at Clarion.
SAT score was found to have a negative relationship with conditional retention.
However, similar to our other findings surrounding SATs, the effect appears small. A
100-point increase in SAT score corresponds to a 1.05 percent decrease in the
likelihood of third year retention.
Students with higher first year GPAs at Clarion are more likely to be retained in their
third year than students with lower GPAs.
Students who failed ENG 111, MATH 050, MATH 110, and MATH 112 in the previous
year are less likely to be retained than students who did not attempt these courses.
Students who were listed as athletes in their second year at Clarion are more likely
to be retained in their third year than non-athletes.
Attempting more credits during the previous spring semester increases a student’s
likelihood of being retained in the third year. A one-credit increase is associated with
a 4.07 percent boost in the likelihood of retention (so a three-credit increase would
translate to a 12 percent boost). By contrast, the number of credits earned in the
previous fall semester was found to have a negative impact on likelihood of
retention. However, this effect was small (a one-credit increase in the number of
credits earned in the fall was associated with a 0.61 percent decrease in the
© 2013 Hanover Research | Academy Administration Practice
23
Hanover Research | April 2013
likelihood of retention), and this finding did not hold for retention in the fourth,
fifth, and sixth years (Models 7-9).
MODEL 7 (4TH YEAR CONDITIONAL RETENTION)
Among students who were retained in their third year, older students are less likely
to be retained in their fourth year than younger students.
SAT score also has a negative effect on the likelihood of retention, though once
again, the magnitude of this effect appears small in practical terms. A 100-point
increase in SAT scores is associated with a 0.63 percent decrease in the likelihood a
student will be retained in their fourth year.
Students with higher first year GPAs are more likely to be retained in their fourth
year.
Compared to students who did not attempt ENG 111 and MATH 050, students who
failed these courses have a lower chance of being retained in their fourth year.
Students who passed MATH 110 and MATH 112 in the previous year are expected to
have a higher retention rate than students who did not attempt these courses.
Students who were listed as athletes in their third year are more likely to be
retained in their fourth year.
Students who attempted more credits in the previous spring semester are more
likely to be retained in their fourth year.
MODEL 8 (5TH YEAR CONDITIONAL RETENTION)
Black students are less likely to be retained in their fifth year than white students.
First year GPA has a positive impact on students’ fifth year conditional retention. As
a student’s first GPA increases, he or she is more likely to be retained in the fifth
year.
Students who failed MATH 112 are less likely to be retained in their fifth year than
students who did not attempt the course.
Students who attempted more credits in the previous spring semester are more
likely to be retained in their fifth year.
Similar to Model 6, higher SAT scores are associated with a lower likelihood of being
retained in the fifth year. A 100-point increase in SATs is associated with a 1.57
percent decrease in the likelihood of retention at this stage.
MODEL 9 (6TH YEAR CONDITIONAL RETENTION)
An increase in high school rank as a percentage of class size is associated with a
decrease in the likelihood that a student would be retained in the sixth year. In
other words, lower ranked students (e.g., a student ranked 75th out of 100) are less
© 2013 Hanover Research | Academy Administration Practice
24
Hanover Research | April 2013
likely to be retained in the sixth year than students with higher ranks (e.g., a student
ranked at 25th out of 100).
Number of credits attempted in the previous spring semester has a positive impact
on students’ sixth year retention. Students who attempted more credits in the
previous spring are more likely to be retained in their sixth year.
Figure 2.2: Regression Output Results - Set 2
MODEL 6
RD
3 YEAR
CONDITIONAL
MODEL 7
TH
4 YEAR
CONDITIONAL
MODEL 8
TH
5 YEAR
CONDITIONAL
MODEL 9
TH
6 YEAR
CONDITIONAL
RETENTION
RETENTION
RETENTION
RETENTION
Age
-0.0286***
(0.0091)
Black
9
Other of two or more races
9
SAT
-0.0595***
-0.1337***
(0.021)
(0.044)
-0.0140
-0.0624
(0.0301)
(0.073)
-0.0105***
-0.0063**
-0.0157**
(0.003)
(0.003)
(0.0061)
High School Rank as percentage of class size
-0.4786***
(0.1566)
First GPA
ENG 111 in the previous year: Failed
10
ENG 111 in the previous year: Passed
10
MATH 050 in the previous year: Failed
11
MATH 050 in the previous year: Passed
MATH 110 in the previous year: Failed
12
MATH 110 in the previous year: Passed
MATH 112 in the previous year: Failed
11
12
13
0.0421***
0.0281***
0.0461***
(0.0082)
(0.0077)
(0.0162)
-0.1564***
-0.1074***
(0.0289)
(0.0331)
-0.0184
0.0089
(0.0168)
(0.0197)
-0.1854***
-0.1779***
(0.0364)
(0.0397)
-0.0151
-0.0059
(0.0145)
(0.0136)
-0.1475***
-0.0367
(0.0283)
(0.0259)
-0.0186
0.0353***
(0.0155)
(0.0135)
-0.0837***
-0.0256
-0.1296**
9
Reference category for ethnicity is white
Reference category for ENG 111 is did not attempt
11
Reference category for MATH 050 is did not attempt
12
Reference category for MATH 110 is did not attempt
13
Reference category for MATH 112 is did not attempt
10
© 2013 Hanover Research | Academy Administration Practice
25
Hanover Research | April 2013
MODEL 6
RD
3 YEAR
CONDITIONAL
MODEL 7
TH
4 YEAR
CONDITIONAL
MODEL 8
TH
5 YEAR
CONDITIONAL
MODEL 9
TH
6 YEAR
CONDITIONAL
RETENTION
RETENTION
RETENTION
RETENTION
(0.0302)
(0.0282)
(0.0549)
0.0173
0.0452***
0.036
(0.0132)
(0.0114)
(0.0266)
0.0641***
0.0355*
(0.0182)
(0.0182)
0.0407***
0.0303***
0.0294***
0.0301***
(0.0012)
(0.0011)
(0.0026)
(0.0057)
0.4165***
1.0009***
0.5613***
0.5099***
(0.0588)
(0.1801)
(0.0917)
(0.1067)
Number of Observations
3,904
2,538
841
173
R-squared
0.3284
0.278
0.1793
0.1809
MATH 112 in the previous year: Passed
Athlete in the previous year
13
14
Credits attempted in the last spring semester
Credits earned in the last fall semester
-0.0061**
(0.0028)
Constant
Coefficients estimated using Ordinary Least Squares with a linear regression model, with standard errors in
parenthesis. Statistical significance indicator using asterisk next to the coefficients, with * = significant at 10%, ** = significant
at 5%, and *** = significant at 1%.
REGRESSION RESULTS: SET 3 MODELS
Finally, we present the results to the regression models that estimate the variations in
students’ four-year and five-year graduation rates.
MODEL 10 (GRADUATION WITHIN FOUR YEARS)
14
In contrast to the relationship of age with retention, older students are more likely
to graduate within four years than younger students.
Black students are less likely to graduate within four years than their white
classmates.
Students in the top 25 percent of their high school class are more likely to graduate
within four years than other students. Once again, outside of the top 25 percent, the
lower a student ranks, the less likely they are to graduate within four years.
First year GPA is positively associated with the likelihood of graduating. In fact, a
one-point increase in GPA (e.g., 2.0 to 3.0) is associated with a 15.21 percent
increase in the likelihood of graduating within four years.
Total SAT score did not have a significant effect on graduation. However, individual
tests did have such an effect. More specifically, SAT Math and SAT Writing scores
Reference category is not an athlete
© 2013 Hanover Research | Academy Administration Practice
26
Hanover Research | April 2013
had a positive relationship (a 100-point increase in Math is associated with a 2.34
percent increase in likelihood of graduation, while a 100-point increase in Writing is
expected to yield a 3.96 percent increase in the likelihood of graduation). By
contrast, a 100-point increase in SAT Reading scores is associated with a 6.25
percent decrease in the probability of graduating in four years.
MODEL 11 (GRADUATION WITHIN FIVE YEARS)
Female students are 10.13 percent more likely to graduate within five years than
male students. Notably, this was the only model in which gender was found to have
a significant effect on graduation (or retention).
Black students are less likely to graduate within five years than white students.
First year GPA of students has a positive impact on the probability of five-year
graduation. A one-point increase in first GPA is associated with an 8.27 percent
increase in the likelihood of graduating within five years.
SAT Reading scores have a negative relationship with the likelihood of graduating
within five years. A 100-point increase in SAT Reading scores is associated with a
4.71 percent decrease in the probability of graduation.
Athletes are 8.6 percent more likely to graduate within five years than non-athletes.
Students in the top half of their high school class are more likely to graduate within
five years compared to students in the bottom 50 percent of their class. Outside of
the top 50 percent, the lower a student’s high school rank, the lower their likelihood
of graduating within five years.
Figure 2.3: Regression Output Results - Set 3
MODEL 10
MODEL 11
GRADUATION WITHIN 4 YEARS
GRADUATION WITHIN 5 YEARS
Female
0.1013***
(0.0332)
Age
-1.4003***
(0.377)
Age squared
0.0388***
(0.0101)
Black
15
Other or two or more races
15
High School Rank (10%-25%)
15
16
16
-0.1153***
-0.1331*
(0.0316)
(0.0699)
-0.0445
-0.0582
(0.0491)
(0.1141)
-0.0507
-0.0794
Reference category for ethnicity is white
Reference category for high school rank is 0% to 10%
© 2013 Hanover Research | Academy Administration Practice
27
Hanover Research | April 2013
High School Rank (25%-50%)
16
High School Rank (50%-75%)
16
High School Rank (75%-100%)
SAT Math
16
MODEL 10
MODEL 11
GRADUATION WITHIN 4 YEARS
GRADUATION WITHIN 5 YEARS
(0.0328)
(0.0659)
-0.1218***
-0.0945
(0.0317)
(0.064)
-0.1872***
-0.168**
(0.0347)
(0.0708)
-0.2362***
-0.2406***
(0.0414)
(0.0864)
0.0234*
(0.0125)
SAT Read
SAT Write
-0.0625***
-0.0471**
(0.0154)
(0.0219)
0.0396**
(0.0153)
First GPA
0.1521***
0.0827***
(0.009)
(0.0258)
Athlete
0.086**
(0.0371)
Constant
12.7073***
0.853***
(3.5365)
(0.15)
Number of Observations
2,839
604
R-squared
0.2112
0.1118
Coefficients estimated using Ordinary Least Squares with a linear regression model, with standard errors in
parenthesis. Statistical significance indicator using asterisk next to the coefficients, with * = significant at 10%, ** =
significant at 5%, and *** = significant at 1%.
© 2013 Hanover Research | Academy Administration Practice
28
Hanover Research | April 2013
APPENDIX A: RETENTION R ATES
Figure A.1: Retention Rates by Gender
80%
72%
67%
61%
60%
55%
54%
40%
50%
32%
31%
20%
9%
8%
0%
Female
2nd year retention
Male
3rd year retention
4th year retention
5th year retention
6th year retention
Figure A.2: Conditional Retention by Gender
100%
90%
83%
90%
85%
81%
79%
80%
61%
53%
60%
40%
20%
0%
Female
Male
3rd year conditional retention
4th year conditional retention
5th year conditional retention
6th year conditional retention
© 2013 Hanover Research | Academy Administration Practice
29
Hanover Research | April 2013
Figure A.3: Retention Rate by Ethnicity
80%
60%
71%
61%
60%
57%
55%
48%
40%
40%
41%
36%
33%
22%
21%
20%
10%
8%
9%
0%
Black
2nd year retention
Other or two or more races
3rd year retention
4th year retention
White
5th year retention
6th year retention
Figure A.4: Conditional Retention Rate by Ethnicity
100%
80%
65%
91%
89%
82%
77%
71%
83%
80%
85%
67%
59%
60%
56%
40%
20%
0%
Black
Other or two or more races
White
3rd year conditional retention
4th year conditional retention
5th year conditional retention
6th year conditional retention
© 2013 Hanover Research | Academy Administration Practice
30
Hanover Research | April 2013
Figure A.5: Retention Rate by High School Class Rank
100%
83%
80%
81%
74%
74%
72%
69%
68%
63%
58%
60%
36%
40%
63%
55%
51%
44%
41%
38%
34%
36%
29%
22%
20%
15%
9%
9%
5%
5%
0%
0% to 10%
2nd year retention
10% to 25%
3rd year retention
25% to 50%
4th year retention
50% to 75%
5th year retention
75% to 100%
6th year retention
Figure A.6: Conditional Retention Rate by High School Class Rank
100%
94%
91%
86%
80%
95%
87%
87%
91%
84%
85%
84%
82%
76%
74%
67%
86%
65%
75%
62%
66%
60%
40%
23%
20%
0%
0% to 10%
10% to 25%
25% to 50%
50% to 75%
75% to 100%
3rd year conditional retention
4th year conditional retention
5th year conditional retention
6th year conditional retention
© 2013 Hanover Research | Academy Administration Practice
31
Hanover Research | April 2013
APPENDIX B: GRADUATION RATES
Figure B.1: Graduation Rate by Gender
80%
62%
60%
40%
52%
37%
29%
20%
0%
Female
Male
Four year graduation rate
Five year graduation rate
Figure B.2: Graduation Rate by Ethnicity
80%
37%
40%
20%
59%
57%
60%
35%
33%
15%
0%
Black
Other or two or more
races
Four year graduation rate
© 2013 Hanover Research | Academy Administration Practice
White
Five year graduation rate
32
Hanover Research | April 2013
Figure B.3: Graduation Rate by High School Class Rank
100%
81%
75%
80%
63%
61%
60%
50%
45%
37%
40%
28%
22%
20%
12%
0%
0% to 10%
10% to 25%
25% to 50%
Four year graduation rate
© 2013 Hanover Research | Academy Administration Practice
50% to 75%
75% to 100%
Five year graduation rate
33
Hanover Research | April 2013
PROJECT EVALUATION FORM
Hanover Research is committed to providing a work product that meets or exceeds partner
expectations. In keeping with that goal, we would like to hear your opinions regarding our
reports. Feedback is critically important and serves as the strongest mechanism by which we
tailor our research to your organization. When you have had a chance to evaluate this
report, please take a moment to fill out the following questionnaire.
http://www.hanoverresearch.com/evaluation/index.php
CAVEAT
The publisher and authors have used their best efforts in preparing this brief. The publisher
and authors make no representations or warranties with respect to the accuracy or
completeness of the contents of this brief and specifically disclaim any implied warranties of
fitness for a particular purpose. There are no warranties which extend beyond the
descriptions contained in this paragraph. No warranty may be created or extended by
representatives of Hanover Research or its marketing materials. The accuracy and
completeness of the information provided herein and the opinions stated herein are not
guaranteed or warranted to produce any particular results, and the advice and strategies
contained herein may not be suitable for every partner. Neither the publisher nor the
authors shall be liable for any loss of profit or any other commercial damages, including but
not limited to special, incidental, consequential, or other damages. Moreover, Hanover
Research is not engaged in rendering legal, accounting, or other professional services.
Partners requiring such services are advised to consult an appropriate professional.
© 2013 Hanover Research | Academy Administration Practice
34
Hanover Research | April 2013
1750 H Street NW, 2nd Floor
Washington, DC 20006
P 202.756.2971 F 866.808.6585
www.hanoverresearch.com
© 2013 Hanover Research | Academy Administration Practice
35