+ - 0:00:00
Notes for current slide
Notes for next slide

How far is too far?
Generalization of a regression discontinuity design away from the cutoff

Magdalena Bennett

International Methods Colloquium
Mar 12, 2021

Regression discontinuity design

Strong internal validity

... But limited external validity

Missing data and no overlap

Can we find a generalization interval?

This paper

Identification of generalization interval and estimation of ATT for population within such interval:

This paper

Identification of generalization interval and estimation of ATT for population within such interval:

  • Pre-intervention period informs the generalization bandwidth
    (Wing & Cook, 2013; Keele, Small, Hsu, & Fogarty, 2020)

  • Leverage the use of predictive covariates for breaking link between running variable and outcome
    (Angrist & Rokkanen, 2015; Rokkanen, 2015; Keele, Titiunik, & Zubizarreta, 2015)

  • Based on idea of local randomization near the cutoff
    (Lee, 2008; Cattaneo, Frandsen, & Titiunik, 2015)

This paper

Main advantages:

This paper

Main advantages:

  • Gradual approach
    • No need for "All or Nothing"
    • Interval informed by the data (Cattaneo et al., 2015)

This paper

Main advantages:

  • Gradual approach
    • No need for "All or Nothing"
    • Interval informed by the data (Cattaneo et al., 2015)
  • No extrapolation of population characteristics
    • Compare like-to-like (Rosenbaum, 1987)
    • Makes overlap region explicit

This paper

Main advantages:

  • Gradual approach
    • No need for "All or Nothing"
    • Interval informed by the data (Cattaneo et al., 2015)
  • No extrapolation of population characteristics
    • Compare like-to-like (Rosenbaum, 1987)
    • Makes overlap region explicit
  • Generalization to population of interest
    • Use of representative template matching (Silber et al., 2014; Bennett, Vielma, & Zubizarreta, 2020)

This paper

Main advantages:

  • Gradual approach
    • No need for "All or Nothing"
    • Interval informed by the data (Cattaneo et al., 2015)
  • No extrapolation of population characteristics
    • Compare like-to-like (Rosenbaum, 1987)
    • Makes overlap region explicit
  • Generalization to population of interest
    • Use of representative template matching (Silber et al., 2014; Bennett, Vielma, & Zubizarreta, 2020)
  • Sensitivity analysis to hidden bias (Rosenbaum, 2010; Keele et al., 2020)

Outline

  1. Motivation

  2. Generalized Regression Discontinuity Design (GRD)

    2.1 Framework

    2.2 GRD in practice

  3. Application: Free Higher Education in Chile

  4. Conclusions

Generalized Regression Discontinuity Design

Generalized Regression Discontinuity Design (GRD)

Generalized Regression Discontinuity Design (GRD)

Two-part problem with pre- and post-intervention periods:

Generalized Regression Discontinuity Design (GRD)

Two-part problem with pre- and post-intervention periods:

1) Identification of generalization interval H*

(using pre-intervention period)

Generalized Regression Discontinuity Design (GRD)

Two-part problem with pre- and post-intervention periods:

1) Identification of generalization interval H*

(using pre-intervention period)
2) Estimation of ATT for population within H*

(using post-intervention period)

The setup

  • Two periods: pre- and post-intervention, t=0 and t=1.

  • Running variable R determines assingment Z in t=1. E.g.: Zit=I(Rit<c)

  • Potential outcomes under treatmet z=0,1: Yit(z)=gz(Xit,uit,rit)+zitτ(Xit,uit,rit)Treat. Effect+αtPeriod FE

    • Xit: Predictive covariates

    • uit: Unobserved confounders

    • τ(): Causal effect

Two periods for GRD

A gradual approach

  • Conditional expectations of potential outcomes, Yt(z)(R): Y0(0)(R)=E[Yi0(0)|R]=μ0(R) Y0(1)(R)=E[Yi0(1)|R]=μ0(R)Avg. Outcome by R+τ0(R)Treat. Effect by R

A gradual approach

  • Conditional expectations of potential outcomes, Yt(z)(R): Y0(0)(R)=E[Yi0(0)|R]=μ0(R) Y0(1)(R)=E[Yi0(1)|R]=μ0(R)Avg. Outcome by R+τ0(R)Treat. Effect by R

  • Identify generalization interval H=[H,H+] for t=0: Ri=h(Xi)+ηi   RiH

A gradual approach

  • Conditional expectations of potential outcomes, Yt(z)(R): Y0(0)(R)=E[Yi0(0)|R]=μ0(R) Y0(1)(R)=E[Yi0(1)|R]=μ0(R)Avg. Outcome by R+τ0(R)Treat. Effect by R

  • Identify generalization interval H=[H,H+] for t=0: Ri=h(Xi)+ηi   RiH

  • If H=max{|H|} exists, then for a set of covariates X=XT: Y0(0)(R)|XT=Y0(0)(R)|XT      for any R,RH

Conditional outcome within the generalization interval

Main assumption for generalization to t=1



Assumption: Conditional time-invariance under control Y0(0)(R|X)=Y1(0)(R|X)+α,   RH

Main assumption for generalization to t=1



Assumption: Conditional time-invariance under control Y0(0)(R|X)=Y1(0)(R|X)+α,   RH

  • No changes in unobserved confounders between t=0 and t=1 for units within H

  • Partially testable for Z=0 in t=1

Estimating an effect away from the cutoff

GRD in practice

Context: Representative template matching

Diagram for GRD

Step 0: Identification of narrow interval

Step 1: Template selection

Step 2: Matching units in pre-intervention period

Step 2: Matching units in pre-intervention period

Step 2: Matching units in pre-intervention period

Step 2: Matching units in pre-intervention period

Step 2: Matching units in pre-intervention period

Step 2: Matching units in pre-intervention period

Step 3: Fit a local polynomial

Step 4: Identify new generalization interval

Step 5: If generalization interval > template...

Step 5a: ... expand template and do it again.

Step 5b: ... until template is the same as the interval

Step 5b: ... until template is the same as the interval

This can break in two ways:

1) No overlap of covariates
2) Predictive covariates don't explain correlation between R and Y

−400−2000200400
Running variableRunning variableRunning variableRunning variableOutcomeOutcomeOutcomeOutcomeCutoffCutoffCutoffPlay

Step 6: ATT estimation

Step 6: ATT estimation

Straightforward estimation given the matched sample:

  • E.g. paired t-test:

τ^ATT=k=1NYk(1)1Yk(0)1(Yk(1)0Yk(0)0)N=k=1NdkN

Yk(z)t: Outcome within matched group k with treatment z for period t.

Application: Free Higher Education

Free Higher Education (FHE) in Chile

Context of higher education in Chile:

  • Centralized admission system (deferred admission mechanism)

  • Admission score: PSU score + GPA score + ranking score

  • Before 2016: Scholarships + government-backed loans

FHE policy:

  • Introduced in December 2015 (unanticipated)

  • Eligibility: Lower 50% income distribution + admitted to eligible program

Research question


What was the effect of being eligible for FHE on application and enrollment to university?

Research question


What was the effect of being eligible for FHE on application and enrollment to university?

  • Treatment: SE eligibility for FHE

  • Two outcomes: Application to university and enrollment

    • Lower-income students financial constraints
    • Saliency of the policy

Research question


What was the effect of being eligible for FHE on application and enrollment to university?

  • Treatment: SE eligibility for FHE

  • Two outcomes: Application to university and enrollment

    • Lower-income students financial constraints
    • Saliency of the policy
  • Larger effects for students away from the cutoff?

    • Compare RD to GRD results

What data do I have?



  • 3 cohorts: 2014, 2015, and 2016 (~ 200,000 students)
  • Rich baseline data: Demographic and socioeconomic data at student level, 10th (8th) grade standardized scores, school characteristics.
  • Application data: Scores by subject, application, and enrollment.

How does the RD look like?

GRD for Free Higher Education

Steps for GRD:

  • Select template size: N=1,000

  • 20 bin for grid

  • MIP matching:

    • Restricted mean balance (0.05 SD) : Academic performance, school characteristics, demographic/socioeconomic variables.

    • Fine balance: Gender, mother's and father's education (8 cat.), PSU language score (deciles), PSU math score (deciles), HS GPA (quintiles).

GRD for Free Higher Education

Steps for GRD:

  • Select template size: N=1,000

  • 20 bin for grid

  • MIP matching:

    • Restricted mean balance (0.05 SD) : Academic performance, school characteristics, demographic/socioeconomic variables.

    • Fine balance: Gender, mother's and father's education (8 cat.), PSU language score (deciles), PSU math score (deciles), HS GPA (quintiles).

Generalization interval: [-M$500.3, M$300.9]

For what population are we generalizing for?

For what population are we generalizing for?

Testing time invariance on control side for t=1

Balance for the entire sample

Balance within generalization interval (before matching)

Balance within generalization interval (after matching)

Effects of introduction of FHE: RD and GRD

Robust RD results
GRD results
Application
Enrollment
Application
Enrollment
Effect
0.035
0.069**
0.052**
0.077***
[-0.007, 0.077]
[0.026, 0.112]
[0.008, 0.096]
[0.029, 0.125]
Effective N Obs
6,588
6,458
2,000
2,000
Control Mean
0.606
0.515
0.568
0.472

Note: Generalization interval [-M$500, M$301]. 95% CI in brackets.

Effects of introduction of FHE: Application

Robust RD results
GRD results
Application
Enrollment
Application
Enrollment
Effect
0.035
0.069**
0.052**
0.077***
[-0.007, 0.077]
[0.026, 0.112]
[0.008, 0.096]
[0.029, 0.125]
Effective N Obs
6,588
6,458
2,000
2,000
Control Mean
0.606
0.515
0.568
0.472

Note: Generalization interval [-M$500, M$301]. 95% CI in brackets.

Effects of introduction of FHE: Enrollment

Robust RD results
GRD results
Application
Enrollment
Application
Enrollment
Effect
0.035
0.069**
0.052**
0.077***
[-0.007, 0.077]
[0.026, 0.112]
[0.008, 0.096]
[0.029, 0.125]
Effective N Obs
6,588
6,458
2,000
2,000
Control Mean
0.606
0.515
0.568
0.472

Note: Generalization interval [-M$500, M$301]. 95% CI in brackets.

How does the effect change with interval width?

Conclusions

  • GRD as a gradual approach (not all or nothing)

Conclusions

  • GRD as a gradual approach (not all or nothing)

  • Use data to inform interval for generalization

Conclusions

  • GRD as a gradual approach (not all or nothing)

  • Use data to inform interval for generalization

  • Use of matching to avoid extrapolation

Conclusions

  • GRD as a gradual approach (not all or nothing)

  • Use data to inform interval for generalization

  • Use of matching to avoid extrapolation

  • Limitations:
    • More data: two periods
    • Conditional time invariance assumption for t=1

Conclusions

  • GRD as a gradual approach (not all or nothing)

  • Use data to inform interval for generalization

  • Use of matching to avoid extrapolation

  • Limitations:
    • More data: two periods
    • Conditional time invariance assumption for t=1

  • Multiple applications for DD-RD: e.g. geographic RDs.

How Far is too Far? Generalization of a Regression Discontinuity Design Away from the Cutoff



New version of the paper is coming.

www.magdalenabennett.com

Regression discontinuity design

Paused

Help

Keyboard shortcuts

, , Pg Up, k Go to previous slide
, , Pg Dn, Space, j Go to next slide
Home Go to first slide
End Go to last slide
Number + Return Go to specific slide
b / m / f Toggle blackout / mirrored / fullscreen mode
c Clone slideshow
p Toggle presenter mode
t Restart the presentation timer
?, h Toggle this help
Esc Back to slideshow