Impacts of COVID-19 on Public Universities in Brazil: A Machine Learning Counterfactual Analysis
Volume 22, Issue 4 (2024), pp. 621–630
Pub. online: 5 February 2024
Type: Data Science In Action
Open Access
Received
29 September 2023
29 September 2023
Accepted
17 January 2024
17 January 2024
Published
5 February 2024
5 February 2024
Abstract
This study delves into the impact of the COVID-19 pandemic on the enrollment rates of on-site undergraduate programs within Brazilian public universities. Employing the Machine Learning Control Method, a counterfactual scenario was constructed in which the pandemic did not occur. By contrasting this hypothetical scenario with real-world data on new entrants, a variable was defined to characterize the impact of the COVID-19 pandemic on on-site undergraduate programs at Brazilian public universities. This variable reveals that the impact factor varies significantly when considering the geographical locations of the institutions offering these courses. Courses offered by institutions located in smaller population cities experienced a more pronounced impact compared to those situated in larger urban centers.
Supplementary material
Supplementary MaterialThe following files are included in the supplementary material: (1) Study code file; (2) URL to INEP census data; (3) IBGE population of each municipality data.
References
Athey S, Bayati M, Doudchenko N, Imbens GW, Khosravi K (2021). Matrix completion methods for causal panel data models. Journal of the American Statistical Association, 116(536): 1716–1730. https://doi.org/10.1080/01621459.2021.1891924
Athey S, Imbens GW (2016). Recursive partitioning for heterogeneous causal effects. Proceedings of the National Academy of Sciences of the United States of America, 113(27): 7353–7360. https://doi.org/10.1073/pnas.1510489113
Belloni A, Chernozhukov V, Fernandez-Val I, Hansen C (2017). Program evaluation and causal inference with high-dimensional data. Econometrica, 85(1): 233–298. https://doi.org/10.3982/ECTA12723
Burlig F, Knittel C, Rapson D, Reguant M, Wolfram C (2020). Machine learning from schools about energy efficiency. Journal of the Association of Environmental and Resource Economists, 7(6): 1181–1217. https://doi.org/10.1086/710606
Cerqua A, Di Stefano R LM, Miccoli S (2021). Local mortality estimates during the COVID-19 pandemic in Italy. Journal of Population Economics, 34: 1189–1217. https://doi.org/10.1007/s00148-021-00857-y
Cerqua A, Letta M (2022). Local inequalities of the COVID-19 crisis. Regional Science and Urban Economics, 92: 103752. https://doi.org/10.1016/j.regsciurbeco.2021.103752
Hofman J, Watts D, Athey S, Garip F, Griffiths T, Kleinberg J (2021). Integrating explanation and prediction in computational social science. Nature, 595(7866): 181–188. https://doi.org/10.1038/s41586-021-03659-0
Souza M (2019). Predictive counterfactuals for treatment effect heterogeneity in event studies with staggered adoption. Social Science Research Network Electronic Journal, https://doi.org/10.2139/ssrn.3484635.
Svábová L, Kramárová K, Durica M (2021). Evaluation of the effects of the graduate practice in Slovakia: comparison of results of counterfactual methods. Central European Business Review, 10(4): 1. https://doi.org/10.18267/j.cebr.266
Varian HR (2014). Big data: new tricks for econometrics. The Journal of Economic Perspectives, 28(2): 3–28. https://doi.org/10.1257/jep.28.2.3
Varian HR (2016). Causal inference in economics and marketing. Proceedings of the National Academy of Sciences of the United States of America, 113(27): 7310–7315. https://doi.org/10.1073/pnas.1510479113
Wager S, Athey S (2018). Estimation and inference of heterogeneous treatment effects using random forests. Journal of the American Statistical Association, 113(523): 1228–1242. https://doi.org/10.1080/01621459.2017.1319839