PRIISM Seminar
April 3rd, 2024
Magdalena Bennett
The University of Texas at Austin
Christopher Neilson
Yale University
Nicolás Rojas
Columbia University
Multiple issues with the use of standardized tests:
Teaching to the test/explicit cheating.
Correlation with SES.
Non-representative patterns of attendance.
Prior literature related to student exclusion:
Prior literature related to student exclusion:
Some studies analyzing the effect of attendance manipulation in Chile
Prior literature related to student exclusion:
Some studies analyzing the effect of attendance manipulation in Chile
Schools have incentives to game the system
Attendance Patterns
Event study approach:
How do these exclusions patterns look like? Are these the same for every (type of) school and every grade?
Focus beyond bottom performers
Robustness checks for alternative mechanisms
Attendance Patterns
Event study approach:
How do these exclusions patterns look like? Are these the same for every (type of) school and every grade?
Focus beyond bottom performers
Robustness checks for alternative mechanisms
What can be done?
Machine learning prediction:
Identification of schools that are most likely gaming the system
Consequences of blanket policies in imputation of scores
The Chilean Educational Context
Chile has a universal voucher system (school choice)
Universal standardized testing since 1980's (SIMCE)
Chile has a universal voucher system (school choice)
Universal standardized testing since 1980's (SIMCE)
SIMCE as a high-stake test:
Results widely available in a universal voucher system
Tied to teachers' bonuses
Tied to budget restrictions and school closures
Use of pre-filled communication for parents to be sent out by schools
Use of pre-filled communication for parents to be sent out by schools
No real consequences for low attendance:
Between 2005-2007, non-representative results where marked with symbols
No imputation strategy so far
Use of pre-filled communication for parents to be sent out by schools
No real consequences for low attendance:
Between 2005-2007, non-representative results where marked with symbols
No imputation strategy so far
Improvement of regulation for justifying students exclusion
Attendance Patterns for the Day of the Test
Standardized tests 2011-2018 (SIMCE)
Scores at student and school level for different subjects (Math, Language, History, and Science)
Student's socioeconomic characterization (parental questionnaire)
Standardized tests 2011-2018 (SIMCE)
Scores at student and school level for different subjects (Math, Language, History, and Science)
Student's socioeconomic characterization (parental questionnaire)
Daily attendance data 2011-2018 (SIGE)
Standardized tests 2011-2018 (SIMCE)
Scores at student and school level for different subjects (Math, Language, History, and Science)
Student's socioeconomic characterization (parental questionnaire)
Daily attendance data 2011-2018 (SIGE)
GPA Performance 2011-2018 (Rendimiento)
Grade | Years tested | Num Schools | Num Students |
---|---|---|---|
2 | 2013, 2014, 2015 | 5,266 | 628,073 |
4 | 2011, 2013-2018 | 5,673 | 1,461,289 |
6 | 2013-2016, 2018 | 5,516 | 1,056,243 |
8 | 2011, 2013-2015, 2017 | 5,545 | 1,078,140 |
10 | 2013-2018 | 2,623 | 1,213,067 |
Yipsgt=5∑P=15∑T=−4τPTDPTipsgt+γpt+αi+ϵipsgt
Where
Yipsgt: Binary attendance for student i, from GPA group p, in school s and grade g, for day t.
DPTipsgt: Indicator variables (lags and leads) for students that belong to a tested grade.
GPA Decile | Told | Notification | Preparation | Grades |
---|---|---|---|---|
D1 | -0.06*** | -0.11*** | -0.08*** | 0.14*** |
(0.00) | (0.00) | (0.00) | (0.00) | |
D10 | 0.06*** | 0.05*** | 0.05*** | -0.2*** |
(0.00) | (0.00) | (0.00) | (0.00) | |
Baseline | 0.89*** | 0.87*** | 0.89*** | 0.39*** |
(0.00) | (0.00) | (0.00) | (0.00) |
GPA Decile | Told | Notification | Preparation | Grades |
---|---|---|---|---|
D1 | -0.06*** | -0.11*** | -0.08*** | 0.14*** |
(0.00) | (0.00) | (0.00) | (0.00) | |
D10 | 0.06*** | 0.05*** | 0.05*** | -0.2*** |
(0.00) | (0.00) | (0.00) | (0.00) | |
Baseline | 0.89*** | 0.87*** | 0.89*** | 0.39*** |
(0.00) | (0.00) | (0.00) | (0.00) |
GPA Decile | Told | Notification | Preparation | Grades |
---|---|---|---|---|
D1 | -0.06*** | -0.11*** | -0.08*** | 0.14*** |
(0.00) | (0.00) | (0.00) | (0.00) | |
D10 | 0.06*** | 0.05*** | 0.05*** | -0.2*** |
(0.00) | (0.00) | (0.00) | (0.00) | |
Baseline | 0.89*** | 0.87*** | 0.89*** | 0.39*** |
(0.00) | (0.00) | (0.00) | (0.00) |
GPA Decile | Told | Notification | Preparation | Grades |
---|---|---|---|---|
D1 | -0.02*** | -0.01*** | -0.02*** | 0.05*** |
(0.00) | (0.00) | (0.00) | (0.00) | |
D10 | 0.01*** | 0.00 | 0.00 | -0.03*** |
(0.00) | (0.00) | (0.00) | (0.00) | |
Baseline | 0.95*** | 0.78*** | 0.82*** | 0.33*** |
(0.00) | (0.00) | (0.00) | (0.00) |
Students experience a disutility from testing?
Grade - Year | D1 | D2 | D3D8 | D9 | D10 |
---|---|---|---|---|---|
2nd 2011 | -0.01 | 0.01 | 0.01* | 0.02 | 0.00 |
(0.01) | (0.01) | (0.01) | (0.01) | (0.01) | |
5th 2012 | 0.00 | -0.01 | 0.00 | 0.01 | 0.01 |
(0.01) | (0.01) | (0.01) | (0.01) | (0.01) | |
6th 2011 | 0.02* | 0.01 | 0.01** | 0.01 | 0.00 |
(0.01) | (0.01) | (0.00) | (0.01) | (0.01) | |
6th 2017 | 0.00 | 0.03 | 0.01 | 0.01 | 0.00 |
(0.02) | (0.02) | (0.01) | (0.02) | (0.01) | |
11th 2012 | 0.00 | 0.00 | 0.00 | -0.02** | 0.00 |
(0.01) | (0.01) | (0.00) | (0.01) | (0.01) |
Predicting the Counterfactual
Can we use this existing rich panel data to predict attendance on the day of the test as if it was a regular day?
Use Extreme Gradient Boosting (XGBoost) with panel data for attendance prediction
Can we use this existing rich panel data to predict attendance on the day of the test as if it was a regular day?
Use Extreme Gradient Boosting (XGBoost) with panel data for attendance prediction
Use data between 1st and 5th grade (2017) in the Metropolitan region - 4th grade is treated:
School1
Math: 258
Language: 260
School2
Math: 259
Language: 256
School1
Math: 258
Language: 260
School2
Math: 259
Language: 256
School1
Math: 258
Language: 260
School2
Math: 259
Language: 256
Cluster 1 Increase Att (N=1094) |
Cluster 2 Lower Att (bottom) (N=346) |
|||||
---|---|---|---|---|---|---|
Mean | SD | Mean | SD | Diff. in Means | P-value | |
Avg. SIMCE Lang | 258.84 | 22.38 | 252.62 | 23.84 | -6.22 | 0.00 |
Avg. SIMCE Math | 254.42 | 25.70 | 247.80 | 25.15 | -6.62 | 0.00 |
Public | 0.35 | 0.48 | 0.42 | 0.49 | 0.07 | 0.03 |
SEP status | 0.84 | 0.37 | 0.88 | 0.33 | 0.03 | 0.11 |
% Priority Students | 0.48 | 0.19 | 0.52 | 0.19 | 0.04 | 0.00 |
Diff D1 GPA | 0.02 | 0.15 | -0.22 | 0.27 | -0.24 | 0.00 |
Diff D2 GPA | 0.05 | 0.11 | -0.17 | 0.21 | -0.22 | 0.00 |
Diff D9 GPA | 0.04 | 0.06 | -0.03 | 0.15 | -0.07 | 0.00 |
Diff D10 GPA | 0.03 | 0.07 | -0.01 | 0.12 | -0.04 | 0.00 |
Cluster 1 Increase Att (N=1094) |
Cluster 2 Lower Att (bottom) (N=346) |
|||||
---|---|---|---|---|---|---|
Mean | SD | Mean | SD | Diff. in Means | P-value | |
Avg. SIMCE Lang | 258.84 | 22.38 | 252.62 | 23.84 | -6.22 | 0.00 |
Avg. SIMCE Math | 254.42 | 25.70 | 247.80 | 25.15 | -6.62 | 0.00 |
Public | 0.35 | 0.48 | 0.42 | 0.49 | 0.07 | 0.03 |
SEP status | 0.84 | 0.37 | 0.88 | 0.33 | 0.03 | 0.11 |
% Priority Students | 0.48 | 0.19 | 0.52 | 0.19 | 0.04 | 0.00 |
Diff D1 GPA | 0.02 | 0.15 | -0.22 | 0.27 | -0.24 | 0.00 |
Diff D2 GPA | 0.05 | 0.11 | -0.17 | 0.21 | -0.22 | 0.00 |
Diff D9 GPA | 0.04 | 0.06 | -0.03 | 0.15 | -0.07 | 0.00 |
Diff D10 GPA | 0.03 | 0.07 | -0.01 | 0.12 | -0.04 | 0.00 |
How to handle this absenteeism problem?
How to handle this absenteeism problem?
Proposals to impute lowest scores for absent students to disincentivize arbitrary exclusion
How can we impute missing scores?
Scenario 1: Not impute at all. Show observed distributions.
Scenario 2: Impute by decile only for the difference between predicted and observed attendance.
Scenario 3: Impute every missing student.
How can we impute missing scores?
Scenario 1: Not impute at all. Show observed distributions.
Scenario 2: Impute by decile only for the difference between predicted and observed attendance.
Scenario 3: Impute every missing student.
Some caveats:
Difference between predicted and obs. captures total incentives/disincentives in attendance.
Imputed score might be too optimistic (e.g. real score would be lower than observed distribution)
Let's Wrap Up...
Non-representative patterns of absenteeism beyond exclusion of low-performers
Non-representative patterns of absenteeism beyond exclusion of low-performers
Communication strategies play important role for lower-performing students
Non-representative patterns of absenteeism beyond exclusion of low-performers
Communication strategies play important role for lower-performing students
Impact of imputation policies?
PRIISM Seminar
April 3rd, 2024
Magdalena Bennett
The University of Texas at Austin
Christopher Neilson
Yale University
Nicolás Rojas
Columbia University
Keyboard shortcuts
↑, ←, Pg Up, k | Go to previous slide |
↓, →, Pg Dn, Space, j | Go to next slide |
Home | Go to first slide |
End | Go to last slide |
Number + Return | Go to specific slide |
b / m / f | Toggle blackout / mirrored / fullscreen mode |
c | Clone slideshow |
p | Toggle presenter mode |
t | Restart the presentation timer |
?, h | Toggle this help |
Esc | Back to slideshow |