The College Scorecard, put out by the US Department of Education, is the definitive source for statistics on US colleges and universities. As a math professor I often heard that more and more students were majoring in math and computer science. And yet, personally I often felt that the most important and useful majors were related to environmental science and conservation.
The question for this week is: Between the 2000-2001 and 2014-2015 academic years, how did enrollment in math-related majors change versus environment-related majors?
The GitHub repo for this report has the College Scorecard data for the 2000-2001 and 2014-2015 academic years. In 2000 there were 6478 schools measured, and in 2014 there were 7703. The repo also has a data dictionary that explains all the 1743 variables that were measured.
For each school that has data for 2000-2001 and 2014-2015, we want to look at those years’ percentages of degrees awarded in various areas. For this purpose, we have the following variables from the Scorecard.
Variable | Percentage of degrees awarded in… |
---|---|
PCIP27 | Mathematics and Statistics |
PCIP11 | Computer and Information Sciences and Support Services |
PCIP03 | Natural Resources and Conservation |
PCIP31 | Parks, Recreation, Leisure, and Fitness Studies |
In the following sections, I’ll look at each of these academic areas and how their stats have changed from 2000 to 2014.
Of the 5062 schools for which we have 2000 and 2014 data, there are 3537 with a zero percentage of math and statistics degrees awarded in 2000. So there is probably no math or statistics program at these schools. For the 2014 academic year, there are 3456 schools with zero percentage. We might infer that 81 schools started math/statistics degree programs between 2000 and 2014.
For those 1525 schools in 2000-2001 that had a non-zero percentage of degrees awarded in mathematics and statistics, the summary statistics for this percentage are the following.
## Min. 1st Qu. Median Mean 3rd Qu. Max. NA's
## 0.060 0.520 0.940 1.238 1.602 9.600 265
For those 1606 schools in 2014-2015 with a non-zero percentage of math and statistics degrees, we have the following.
## Min. 1st Qu. Median Mean 3rd Qu. Max. NA's
## 0.060 0.590 1.030 1.367 1.720 11.990 213
We can create a variable PCIP27_CHANGE that measures the change in the percentage of math and statistics degrees between 2000 and 2014 academic years for each school. Here are the summary stats and histogram for that variable, removing the schools with no program at all.
## Min. 1st Qu. Median Mean 3rd Qu. Max. NA's
## -7.6100 -0.2300 0.2600 0.3129 0.7600 6.3000 216
These all indicate that more students are getting degrees in math and statistics. Here is a list of the schools with the highest increase, as well as their 2000 percentage.
## NAME PCIP27_2000 PCIP27_CHANGE
## 1 Ohlone College 0.00 6.30
## 2 Bryn Mawr College 5.81 6.18
## 3 Bryant University 0.00 5.59
## 4 The National Hispanic University 0.00 5.56
## 5 Swarthmore College 1.05 4.97
## 6 Allen University 0.00 4.94
## 7 Reed College 1.42 4.81
## 8 Wabash College 2.79 4.77
## 9 California Institute of Technology 5.94 4.72
## 10 Philander Smith College 0.00 4.63
## 11 Agnes Scott College 1.93 4.61
## 12 Virginia Military Institute 0.38 4.46
## 13 Ottawa University-Kansas City 0.00 4.40
## 14 Brown University 1.87 4.34
## 15 Sweet Briar College 2.59 4.31
Some of the increase is driven by the schools that started math/stats programs between 2000 and 2014, but since there are only 81 such schools, the increase is pervasive.
We can do the same analysis for the percentage of degrees awarded in “Computer and Information Sciences and Support Services”.
Of the 5062 schools with data from both years, 2580 of them had CS programs in 2000-2001, and 2759 had CS programs in 2014-2015 (as judged by a non-zero PCIP11 value).
The summary statistics for the 2000 and 2014 values, among schools with CS programs, are the following.
summary(d$PCIP11_2000[d$PCIP11_2000>0])
## Min. 1st Qu. Median Mean 3rd Qu. Max. NA's
## 0.080 1.880 3.780 7.738 7.740 100.000 265
summary(d$PCIP11_2014[d$PCIP11_2014>0])
## Min. 1st Qu. Median Mean 3rd Qu. Max. NA's
## 0.030 1.440 2.695 5.249 5.230 100.000 213
This is surprising, because my hunch would be that more students are getting computer science-related degrees. The Scorecard doesn’t have any other category that would better fit technology-related degrees than PCIP11, so I don’t know why these percentages went down. This trend is confirmed if we look at each school’s change from 2000 to 2014. The variable PCIP11_CHANGE measures each schools change in percentage of CS-related degrees awarded. The summary stats and histogram for this variable are the following.
## Min. 1st Qu. Median Mean 3rd Qu. Max. NA's
## -88.4400 -2.7900 -0.4100 -0.6799 1.5600 100.0000 223
It is interesting that some schools report significant decreases, of 30-50%, all the way to 88%. These may be tech-centered schools that tanked after the dot com bubble burst in the late 1990s.
Of the 5062 schools with data from both years, 795 of them had “Natural Resources and Conservation” programs in 2000-2001, and 1069 had such programs in 2014-2015 (as judged by a non-zero PCIP03 value).
The summary statistics for the 2000 and 2014 values, among schools with these programs, are the following.
summary(d$PCIP03_2000[d$PCIP03_2000>0])
## Min. 1st Qu. Median Mean 3rd Qu. Max. NA's
## 0.0200 0.5925 1.4850 3.1770 3.1800 100.0000 265
summary(d$PCIP03_2014[d$PCIP03_2014>0])
## Min. 1st Qu. Median Mean 3rd Qu. Max. NA's
## 0.01 0.48 1.12 2.12 2.26 61.81 213
The variable PCIP03_CHANGE measures each schools change in percentage of conservation-related degrees awarded. The summary stats and histogram for this variable are the following.
## Min. 1st Qu. Median Mean 3rd Qu. Max. NA's
## -13.3300 -0.0800 0.3900 0.6075 1.2620 25.1400 216
The positive increase is driven by the 274 new programs in 2014. But as a percentage of all degrees awarded, we see above that there are relatively fewer in 2014 than in 2000.
Of the 5062 schools with data from both years, 976 of them had “Parks, Recreation, Leisure, and Fitness Studies” programs in 2000-2001, and 1400 had such programs in 2014-2015 (as judged by a non-zero PCIP31 value).
The summary statistics for the 2000 and 2014 values, among schools with these programs, are the following.
summary(d$PCIP31_2000[d$PCIP31_2000>0])
## Min. 1st Qu. Median Mean 3rd Qu. Max. NA's
## 0.020 1.010 2.390 3.099 4.215 26.230 265
summary(d$PCIP31_2014[d$PCIP31_2014>0])
## Min. 1st Qu. Median Mean 3rd Qu. Max. NA's
## 0.020 1.170 3.280 4.654 5.870 100.000 213
The variable PCIP31_CHANGE measures each schools change in percentage of outdoor recreation-related degrees awarded. The summary stats and histogram for this variable are the following.
## Min. 1st Qu. Median Mean 3rd Qu. Max. NA's
## -20.670 0.300 1.470 2.446 3.728 64.200 218
All in all, this seems to be a popular and growing academic area. Now I’m not so sure this is a good thing.