EventStudy

R package and guide for performing event studies with heterogeneous dynamic effects.

Generate Convert Improve

Install / Use

/learn @setzler/EventStudy

About this skill

Quality Score

0/100

README

Warning: This package is now deprecated.

This package is now deprecated and we do not plan to maintain it going forward. We suggest one of the following alternatives:

The did package in R (here).
The DiDforBigData package in R (here).

eventStudy: Perform an Event Study in R

Created by David Novgorodsky and Bradley Setzler at the University of Chicago

The methods are explained in our companion Practical Guide to Event Studies.

Disclaimer: By using this software, you accept the terms of the MIT license. This is a work in progress. It is updated frequently. Please let us know if you find any bugs or have questions that are not addressed in the documentation.

Overview

Purpose

eventStudy is an R package for performing event studies. It has many capabilites:

It maximizes sample size by using a "stacked" approach to match all possible control observations to any given treatment observation at any event time;
It allows the estimated treatment effects to vary over time and by treatment cohort. It can also impose a common treatment effect across cohorts (extending results by Abraham & Sun, 2018);
It has built-in tools to correct for anticipation (extending results by Fadlon & Nielsen, 2019) and deviations from parallel trends (either using covariate balancing in the spirit of Abadie, 2005, or modeling parametric deviations from parallel trends).

Installation

Preferred method: The latest version of the package can be installed with the command devtools::install_github("setzler/eventStudy/eventStudy").

Installation without internet access: Clone this repo to your personal machine, use the command line approach of R CMD BUILD eventStudy/ to build the file eventStudy_0.1.0.tar.gz, move this .tar.gz file to the location without internet access, then install on that location using the command install.package("eventStudy_0.1.0.tar.gz",repo=NULL) within an R session. In case you cannot clone and build the repo, we provide a (possibly out of date) version of the .tar.gz file above.

The `ES` command

The event study is performed by the ES command. The examples below show how to use it.

Required Arguments of the `ES` command:

These required arguments simply supply ES your data and tell ES the names of your variables:

long_data: This is the data in long format (e.g., each row corresponds to a household in a given calendar year). It must be a data.table.
outcomevar: This is the outcome of interest. It must be a character.
unit_var: This is the individual. It must be a character.
cal_time_var: This is the calendar time variable (e.g., year). It must be a character.
onset_time_var: This is the time at which the observation is treated. It must be a character.
cluster_vars: These are the variables used to cluster standard errors (e.g., setting it to the same variable as unit_var is a common choice). It must be a character vector.

Optional Arguments of the `ES` command:

These optional arguments allow you to customize ES to deal with issues like anticipation and non-parallel trends:

omitted_event_time: This lets you decide the pre-treatment event time to use as the reference year. It must be an integer that is less than or equal to -1. The default is -2.
min_control_gap and max_control_gap: min_control_gap is the minimum number of time periods ahead that a cohort must receive treatment in order to be included in the control group (e.g., Fadlon & Nielsen, 2019, use min_control_gap=5). It must be an integer that is at least 1. The default is 1. max_control_gap is similar but for the maximum number of time periods ahead (e.g., Fadlon & Nielsen, 2019, also use max_control_gap=5). It must be an integer that is at least as large as min_control_gap. The default is Inf.
anticipation: This drops observations from the control group this many time periods ahead of the control groups receiving the treatment. For example, consider a treatment group that receives treatment in 2001. If min_control_gap=5 and anticipation=0, then another cohort treated in 2006 can be used as a control group up to 2005, but if anticipation=1, then they can be used as a control group up to 2004.
homogeneous_ATT: This will only estimate the pooled (homogeneous) treatment effects if set to TRUE, and skip estimating the cohort-specific effects, which may be slow for large data. Default is FALSE.
never_treat_action: This determines how to handle never-treated groups. Those who do not receive treatment should be coded as onset_time_var=NA. The default option is never_treat_action="none", in which case, it is expected that all observations are treated, and an error is thrown if any observations have onset_time_var=NA. Other options are: never_treat_action="only", in which case only never-winners are used as control groups; never_treat_action="exclude", in which case never-winners are never used as controls; and never_treat_action="keep", in which case both never-winners and not-yet-winners are used as controls.
control_subset_var and control_subset_event_time: These variables allow one to require that the control group satisfies a logical condition at a given event time. control_subset_var must be the name of a logical variable in the data (all values of this variable are TRUE or FALSE) and control_subset_event_time is an integer which specifies the event time at which the control group must have a value of TRUE on control_subset_var. The default is control_subset_var=NA, which bypasses this option.
fill_zeros: fill_zeros is a logical indicator (TRUE or FALSE) which results in the data being filled and the outcome set to zero for all observations on a unit that are missing within the time frame included in the data. The default is FALSE.
linearize_pretrends: linearize_pretrends is a logical indicator that, when set to TRUE, results in linear pre-trends being fit during the pre-event years, and then residualized out of both the pre-event and post-event outcomes. The default is FALSE.
residualize_covariates: Like linearize_pretrends, but it residualizes on provided covariates rather than time. Either discrete_covars (discrete covariates) or cont_covars (continuous covariates) or both must be specified as character vectors of variable names. Discrete covariates will be residualized using fixed effects, while continuous covariates will be residualized linearly. The default is FALSE.

Plotting the results from `ES`

You can make your own plots with the results. As a convenience, we have prepared these functions to automatically make your plots:

ES_plot_levels will plot the levels across event times for the treated and control cohorts.
ES_plot_ATTs will plot the treatment effects across event times. By default, it will plot heterogeneous effects for each cohort; set homogeneous_ATT = TRUE to only plot the pooled effect under the homogeneity assumption.

Examples

Set-up

devtools::install_github("setzler/eventStudy/eventStudy")

library(eventStudy)
library(data.table)
library(ggplot2)

Let's simulate some data using our built-in convenience function called ES_simulate_data:

# simulate the data with 1000 individuals (use only the first element)
sim_data <- ES_simulate_data(units = 1000)[["observed"]]
# view the simulated data
sim_data[]

##       individual year treatment_year   outcome
##    1:          1 1999           2003 0.8141879
##    2:          1 2000           2003 0.8703408
##    3:          1 2001           2003 0.7651807
##    4:          1 2002           2003 0.9349418
##    5:          1 2003           2003 0.4439295
##   ---                                         
## 6996:       1000 2001           2003 1.0030966
## 6997:       1000 2002           2003 0.7616094
## 6998:       1000 2003           2003 0.8115637
## 6999:       1000 2004           2003 0.5377100
## 7000:       1000 2005           2003 0.4799415

In this data, the treatment is received in the year given by the treatment_year variable. The other variables are individual, year, and outcome. We wish to perform an event study to understand the effect of this treatment on this outcome.

To get a sense of what is in the simulated data, here is a plot of the mean outcomes by year at which treatment is received:

# calculate the means
cohort_means <- sim_data[,list(outcome = mean(outcome)), list(treatment_year,year)]
# plot the means
ggplot(data=cohort_means,aes(x=year,y=outcome,colour=factor(treatment_year))) + geom_line() + 
  labs(x = "Year", y = "Outcome", color = "Treatment") + theme_bw(base_size=16)

(Note: even though the data is drawn from the model with perfect control groups, the control groups do not look great visually!)

Example 1. Basic Estimation

Given this data, we perform the basic event study as follows:

# run the event study
results <- ES(long_data=sim_data, outcomevar="outcome", 
              unit_var="individual", cal_time_var="year", 
              onset_time_var="treatment_year", cluster_vars="individual")

## INFO [2019-02-11 14:27:24] Beginning ES.
## INFO [2019-02-11 14:27:24] Beginning data stacking.
## INFO [2019-02-11 14:27:43] Successfully produced a stacked dataset with 27,992 rows.
## INFO [2019-02-11 14:27:44] Estimated heterogeneous case with OLS.
## INFO [2019-02-11 14:27:44] Estimated homogeneous case with OLS.
## INFO [2019-02-11 14:27:44] ES is finished.

Example 2.

Related Skills

node-connect

346.8k

Diagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps

frontend-design

107.6k

Create distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.

openai-whisper-api

346.8k

Transcribe audio via OpenAI Audio Transcriptions API (Whisper).

qqbot-media

346.8k

QQBot 富媒体收发能力。使用 <qqmedia> 标签，系统根据文件扩展名自动识别类型（图片/语音/视频/文件）。

setzler

View profile

View on GitHub

GitHub Stars74

CategoryDevelopment

Updated21d ago

Forks17

setzler/eventStudy

Languages

Security Score

95/100

Audited on Mar 12, 2026

No findings

EventStudy

Install / Use

README

Warning: This package is now deprecated.

eventStudy: Perform an Event Study in R

Overview

Purpose

Installation

The ES command

Required Arguments of the ES command:

Optional Arguments of the ES command:

Plotting the results from ES

Examples

Set-up

Example 1. Basic Estimation

Example 2.

Related Skills

The `ES` command

Required Arguments of the `ES` command:

Optional Arguments of the `ES` command:

Plotting the results from `ES`