Introduction
In this assignment, you will demonstrate your ability to understand and conduct a difference-in-differences analysis by replicating part of the main analysis from the article “When Migrants Mobilize Against Labor Exploitation” (Gemma Dipoppa, American Political Science Review, 2025).

Formalities
Your assignment should be a maximum of 14,400 characters.*
* excluding your code, tables, table notes, figure notes, and bibliography
All of the requirements below are standard in basically all working papers in political science and for submission to journals, so please stick to the following conventions:
- 12-point font
- 1 inch margins
- Double-spaced
- No Table of Content
- 14,400 characters maximum
- Excluding your code, tables, table notes, figure notes, bibliography
If you write your assignment in LaTeX (e.g. Overleaf) it’s therefore:
\documentclass[12pt, A4]{article}\usepackage[margin = 1in]{geometry}\doublespacing after your \begin{document}
If you want to use the Overleaf template that I use for my own academic manuscripts, you can find it here.
Expectations
The technical side of your assignment should, of course, be well-executed. But make sure to focus a lot of your energy on the structure and clarity of your writing. The reason that students often do not receive the grade that they might hope for is because the written part of their assignment is unclear. A few points:
- When discussing a method or test used in the article, provide the intution behind it in plain language (non-technical, non-abstract langauge)
- i.e. explain a method or test in plain non-technical language before turning to any abstract or technical language
- When discussing a statistical test, provide the theoretical or substantive reason why it is being used (again, in plain, non-technical, non-abstract language)
- Even if your audience understands the technical side of things, you always want to give a reader some intuition behind what you’re doing
Instructions
Data: Assignment_2B_Data.zip
Note: The author uses time (year) and city (panelvar) fixed effects in all of her models. She clusters her standard errors at the city level (panelvar). The numerical results in your assignment should almost exactly match the numerical results in the paper that you are replicating.
Codebook:
year: Year
panelvar: City
treated: Whether city i in year y is treated
treated_psm: Whether city i in year y is treated (for a propensity score matched sample)
newsd: Any rackeeting news
news_pc: Rackeeting news per capita
anyseiz: Any goods seized from the mafia
imm_seized: Goods seized from the mafia
Introduction
- Briefly introduce the article that you are replicating:
- Introduce the topic
- Introduce the research question
- Introduce the hypothesis or hypotheses
Research design
- Explain the author’s research design:
- The empirical strategy and data
- The identifying assumptions- i.e. how does their research design allow them to estimate a causal effect and what are the potential threats to causal inference?
- Note: It should be clear from this section that you know how a difference-in-differences research design works.
- Note: The author suggests that the intervention needs to be as-if random. Note, however, that as-if randomness is not the identifying assumption of a difference-in-differences design. Her suggestion seems to be that it’s more likely that the parallel trends assumption will be violated if the treated and untreated units are very different from each other in a general way, which is a fair concern though.
Analysis
- Replicate Figure 2 from the article.
- Use theNewsdataset to create the summary data that go into this figure.
- Explain what the figure is showing and why the author is showing it.
- It might look something like this, but make it look however you want:

- Replicate the Two-Way Fixed Effects results from Figure 3, but do so as a table.
- Models 1-4 use theNewsdataset; Model 5-8 use theSeizeddataset.treatedandtreated_psmimplicitly define which dataset is being used:treated_psmis NA for any observation that isn’t part of the matched dataset. i.e. you don’t need to subset here, just usedtreatedin the odd-numbered Models (1, 3, 5, 7) andtreated_psmin the even-numbered models (2, 4, 6, 8).- Important: the author uses only data from 2016 or earlier, so be sure to only use data that are
year <= 2016.- Explain the table’s purpose and interpret each of the results.
- Your table might look something like this, but make it look however you want:

- Create an event study figure using a Two-Way Fixed Effects model for each of the four outcomes from Figure 3 (use
treatedrather thantreated_psmfor each of them).
- You can show four separate figures or combine them into 4 panels in a single figure.
- Again, the author uses only data from 2016 or earlier, so be sure to only use data that areyear <= 2016.
- Explain the purpose of an event study graph.
- Explain the results and what they say about the authors main findings (i.e. those from Figure 3).
- Your figure might look something like this, but, again, make it look however you want:

- For each of the four outcomes from Figure 3, create an event study figure using a model designed to correct for the issues with staggered difference-in-differences data (use
treatedrather thantreated_psmfor each of them).- You can show four separate figures or combine them into 4 panels in a single figure.
- Again, the author uses only data from 2016 or earlier, so be sure to only use data that are
year <= 2016. - Explain the results of this figure and explain the reason why using a staggered difference-in-differences model is important.
- Calculate the average treatment effect on the treated (ATT), and state how the results differ from those of the Two-Way Fixed Effects estimates that you calculated for the table in Step 2.

Discussion
- Explain the overall findings of the article. One paragraph should suffice.
- Discuss the benefits and drawbacks of the research design as it is applied in the article.
Submission instructions
-
Formatting requirements!
- 12-point font
- 1 inch margins
- Double-spaced
- No Table of Contents
- 14,400 characters maximum
- Excluding your code, tables, table notes, figure notes, bibliography
-
Your code should be commented so that it is clear to me that you know what each piece of code does. You don’t need to be excessive, but comment your code in a way that you think is reasonable.
-
If you are in a group, only one of you needs to submit the assignment. Just ensure that all of your student IDs are on the title page.
-
Submit your assignment and R code separately. Please send your assignment as a PDF and use an equivalent file name across the files:
- Assignment_2B.pdf
- Assignment_2B.R
-
Submit through Absalon under “Assignment 2B”.