<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.0 20120330//EN" "JATS-journalpublishing1.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">JDS</journal-id>
<journal-title-group><journal-title>Journal of Data Science</journal-title></journal-title-group>
<issn pub-type="epub">1683-8602</issn><issn pub-type="ppub">1680-743X</issn><issn-l>1680-743X</issn-l>
<publisher>
<publisher-name>School of Statistics, Renmin University of China</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">JDS1118</article-id>
<article-id pub-id-type="doi">10.6339/24-JDS1118</article-id>
<article-categories><subj-group subj-group-type="heading">
<subject>Data Science in Action</subject></subj-group></article-categories>
<title-group>
<article-title>Impacts of COVID-19 on Public Universities in Brazil: A Machine Learning Counterfactual Analysis</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Rossi</surname><given-names>R.</given-names><suffix>Jr.</suffix></name><email xlink:href="mailto:romeu.rossi@ufv.br">romeu.rossi@ufv.br</email><email xlink:href="mailto:romeu.rossi@ufv.br">romeu.rossi@ufv.br</email><xref ref-type="aff" rid="j_jds1118_aff_001">1</xref>
</contrib>
<aff id="j_jds1118_aff_001"><label>1</label><institution>Universidade Federal de Viçosa - Instituto de Ciências Exatas e Tecnológicas - CAF, LMG818 Km6</institution>, Minas Gerais, Florestal 35690-000, <country>Brazil</country></aff>
</contrib-group>
<pub-date pub-type="ppub"><year>2024</year></pub-date><pub-date pub-type="epub"><day>5</day><month>2</month><year>2024</year></pub-date><volume>22</volume><issue>4</issue><fpage>621</fpage><lpage>630</lpage><supplementary-material id="S1" content-type="archive" xlink:href="jds1118_s001.zip" mimetype="application" mime-subtype="x-zip-compressed">
<caption>
<title>Supplementary Material</title>
<p>The following files are included in the supplementary material: (1) Study code file; (2) URL to INEP census data; (3) IBGE population of each municipality data.</p>
</caption>
</supplementary-material><history><date date-type="received"><day>29</day><month>9</month><year>2023</year></date><date date-type="accepted"><day>17</day><month>1</month><year>2024</year></date></history>
<permissions><copyright-statement>2024 The Author(s). Published by the School of Statistics and the Center for Applied Statistics, Renmin University of China.</copyright-statement><copyright-year>2024</copyright-year>
<license license-type="open-access" xlink:href="https://creativecommons.org/licenses/by/4.0/">
<license-p>Open access article under the <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">CC BY</ext-link> license.</license-p></license></permissions>
<abstract>
<p>This study delves into the impact of the COVID-19 pandemic on the enrollment rates of on-site undergraduate programs within Brazilian public universities. Employing the Machine Learning Control Method, a counterfactual scenario was constructed in which the pandemic did not occur. By contrasting this hypothetical scenario with real-world data on new entrants, a variable was defined to characterize the impact of the COVID-19 pandemic on on-site undergraduate programs at Brazilian public universities. This variable reveals that the impact factor varies significantly when considering the geographical locations of the institutions offering these courses. Courses offered by institutions located in smaller population cities experienced a more pronounced impact compared to those situated in larger urban centers.</p>
</abstract>
<kwd-group>
<label>Keywords</label>
<kwd>counterfactual approach</kwd>
<kwd>educational data-mining</kwd>
</kwd-group>
</article-meta>
</front>
<back>
<ref-list id="j_jds1118_reflist_001">
<title>References</title>
<ref id="j_jds1118_ref_001">
<mixed-citation publication-type="journal"> <string-name><surname>Athey</surname> <given-names>S</given-names></string-name>, <string-name><surname>Bayati</surname> <given-names>M</given-names></string-name>, <string-name><surname>Doudchenko</surname> <given-names>N</given-names></string-name>, <string-name><surname>Imbens</surname> <given-names>GW</given-names></string-name>, <string-name><surname>Khosravi</surname> <given-names>K</given-names></string-name> (<year>2021</year>). <article-title>Matrix completion methods for causal panel data models</article-title>. <source><italic>Journal of the American Statistical Association</italic></source>, <volume>116</volume>(<issue>536</issue>): <fpage>1716</fpage>–<lpage>1730</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1080/01621459.2021.1891924" xlink:type="simple">https://doi.org/10.1080/01621459.2021.1891924</ext-link></mixed-citation>
</ref>
<ref id="j_jds1118_ref_002">
<mixed-citation publication-type="journal"> <string-name><surname>Athey</surname> <given-names>S</given-names></string-name>, <string-name><surname>Bayati</surname> <given-names>M</given-names></string-name>, <string-name><surname>Imbens</surname> <given-names>GW</given-names></string-name>, <string-name><surname>Qu</surname> <given-names>Z</given-names></string-name> (<year>2019</year>). <article-title>Ensemble methods for causal effects in panel data settings</article-title>. <source><italic>American Economic Association Papers and Proceedings</italic></source>, <volume>109</volume>: <fpage>65</fpage>–<lpage>70</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1118_ref_003">
<mixed-citation publication-type="journal"> <string-name><surname>Athey</surname> <given-names>S</given-names></string-name>, <string-name><surname>Imbens</surname> <given-names>GW</given-names></string-name> (<year>2016</year>). <article-title>Recursive partitioning for heterogeneous causal effects</article-title>. <source><italic>Proceedings of the National Academy of Sciences of the United States of America</italic></source>, <volume>113</volume>(<issue>27</issue>): <fpage>7353</fpage>–<lpage>7360</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1073/pnas.1510489113" xlink:type="simple">https://doi.org/10.1073/pnas.1510489113</ext-link></mixed-citation>
</ref>
<ref id="j_jds1118_ref_004">
<mixed-citation publication-type="journal"> <string-name><surname>Belloni</surname> <given-names>A</given-names></string-name>, <string-name><surname>Chernozhukov</surname> <given-names>V</given-names></string-name>, <string-name><surname>Fernandez-Val</surname> <given-names>I</given-names></string-name>, <string-name><surname>Hansen</surname> <given-names>C</given-names></string-name> (<year>2017</year>). <article-title>Program evaluation and causal inference with high-dimensional data</article-title>. <source><italic>Econometrica</italic></source>, <volume>85</volume>(<issue>1</issue>): <fpage>233</fpage>–<lpage>298</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.3982/ECTA12723" xlink:type="simple">https://doi.org/10.3982/ECTA12723</ext-link></mixed-citation>
</ref>
<ref id="j_jds1118_ref_005">
<mixed-citation publication-type="other"> <string-name><surname>Benatia</surname> <given-names>D</given-names></string-name> (<year>2020</year>). Reaching new lows? The pandemic’s consequences for electricity markets. <italic>United States Association for Energy Economics Working</italic>.</mixed-citation>
</ref>
<ref id="j_jds1118_ref_006">
<mixed-citation publication-type="other"> <string-name><surname>Benatia</surname> <given-names>D</given-names></string-name>, <string-name><surname>de Villemeur</surname> <given-names>E</given-names></string-name> (<year>2019</year>). Strategic reneging in sequential imperfect markets. <italic>Center for Research in Economics and Statistics Working Papers</italic>, 19.</mixed-citation>
</ref>
<ref id="j_jds1118_ref_007">
<mixed-citation publication-type="other"> <string-name><surname>Bijnens</surname> <given-names>G</given-names></string-name>, <string-name><surname>Karimov</surname> <given-names>S</given-names></string-name>, <string-name><surname>Konings</surname> <given-names>J</given-names></string-name> (<year>2019</year>). Wage indexation and jobs. a machine learning approach. VIVES Discussion Paper (82).</mixed-citation>
</ref>
<ref id="j_jds1118_ref_008">
<mixed-citation publication-type="journal"> <string-name><surname>Burlig</surname> <given-names>F</given-names></string-name>, <string-name><surname>Knittel</surname> <given-names>C</given-names></string-name>, <string-name><surname>Rapson</surname> <given-names>D</given-names></string-name>, <string-name><surname>Reguant</surname> <given-names>M</given-names></string-name>, <string-name><surname>Wolfram</surname> <given-names>C</given-names></string-name> (<year>2020</year>). <article-title>Machine learning from schools about energy efficiency</article-title>. <source><italic>Journal of the Association of Environmental and Resource Economists</italic></source>, <volume>7</volume>(<issue>6</issue>): <fpage>1181</fpage>–<lpage>1217</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1086/710606" xlink:type="simple">https://doi.org/10.1086/710606</ext-link></mixed-citation>
</ref>
<ref id="j_jds1118_ref_009">
<mixed-citation publication-type="journal"> <string-name><surname>Cerqua</surname> <given-names>A</given-names></string-name>, <string-name><surname>Di Stefano R</surname> <given-names>LM</given-names></string-name>, <string-name><surname>Miccoli</surname> <given-names>S</given-names></string-name> (<year>2021</year>). <article-title>Local mortality estimates during the COVID-19 pandemic in Italy</article-title>. <source><italic>Journal of Population Economics</italic></source>, <volume>34</volume>: <fpage>1189</fpage>–<lpage>1217</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1007/s00148-021-00857-y" xlink:type="simple">https://doi.org/10.1007/s00148-021-00857-y</ext-link></mixed-citation>
</ref>
<ref id="j_jds1118_ref_010">
<mixed-citation publication-type="journal"> <string-name><surname>Cerqua</surname> <given-names>A</given-names></string-name>, <string-name><surname>Letta</surname> <given-names>M</given-names></string-name> (<year>2022</year>). <article-title>Local inequalities of the COVID-19 crisis</article-title>. <source><italic>Regional Science and Urban Economics</italic></source>, <volume>92</volume>: <fpage>103752</fpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1016/j.regsciurbeco.2021.103752" xlink:type="simple">https://doi.org/10.1016/j.regsciurbeco.2021.103752</ext-link></mixed-citation>
</ref>
<ref id="j_jds1118_ref_011">
<mixed-citation publication-type="journal"> <string-name><surname>Hofman</surname> <given-names>J</given-names></string-name>, <string-name><surname>Watts</surname> <given-names>D</given-names></string-name>, <string-name><surname>Athey</surname> <given-names>S</given-names></string-name>, <string-name><surname>Garip</surname> <given-names>F</given-names></string-name>, <string-name><surname>Griffiths</surname> <given-names>T</given-names></string-name>, <string-name><surname>Kleinberg</surname> <given-names>J</given-names></string-name> (<year>2021</year>). <article-title>Integrating explanation and prediction in computational social science</article-title>. <source><italic>Nature</italic></source>, <volume>595</volume>(<issue>7866</issue>): <fpage>181</fpage>–<lpage>188</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1038/s41586-021-03659-0" xlink:type="simple">https://doi.org/10.1038/s41586-021-03659-0</ext-link></mixed-citation>
</ref>
<ref id="j_jds1118_ref_012">
<mixed-citation publication-type="other"> <string-name><surname>IBGE</surname></string-name> (<year>2022</year>). Prévia da população dos municípios com base nos dados do censo demográfico de 2022 coletados até o dia 25/12/2022. Rio de Janeiro: IBGE.</mixed-citation>
</ref>
<ref id="j_jds1118_ref_013">
<mixed-citation publication-type="other"> <string-name><surname>INEP</surname></string-name> (<year>2022</year>). Censo da educação superior 2022. Institute of Educational Studies and Research Anísio Teixeira.</mixed-citation>
</ref>
<ref id="j_jds1118_ref_014">
<mixed-citation publication-type="chapter"> <string-name><surname>Kotosz</surname> <given-names>B</given-names></string-name> (<year>2016</year>). <chapter-title>University impact evaluation: Counterfactual methods</chapter-title>. In: <source><italic>56th Congress of the European Regional Science Association</italic></source>. <publisher-name>European Regional Science Association</publisher-name>.</mixed-citation>
</ref>
<ref id="j_jds1118_ref_015">
<mixed-citation publication-type="other"> <string-name><surname>Souza</surname> <given-names>M</given-names></string-name> (<year>2019</year>). Predictive counterfactuals for treatment effect heterogeneity in event studies with staggered adoption. <italic>Social Science Research Network Electronic Journal</italic>, <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.2139/ssrn.3484635" xlink:type="simple">https://doi.org/10.2139/ssrn.3484635</ext-link>.</mixed-citation>
</ref>
<ref id="j_jds1118_ref_016">
<mixed-citation publication-type="journal"> <string-name><surname>Svábová</surname> <given-names>L</given-names></string-name>, <string-name><surname>Kramárová</surname> <given-names>K</given-names></string-name>, <string-name><surname>Durica</surname> <given-names>M</given-names></string-name> (<year>2021</year>). <article-title>Evaluation of the effects of the graduate practice in Slovakia: comparison of results of counterfactual methods</article-title>. <source><italic>Central European Business Review</italic></source>, <volume>10</volume>(<issue>4</issue>): <fpage>1</fpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.18267/j.cebr.266" xlink:type="simple">https://doi.org/10.18267/j.cebr.266</ext-link></mixed-citation>
</ref>
<ref id="j_jds1118_ref_017">
<mixed-citation publication-type="journal"> <string-name><surname>Varian</surname> <given-names>HR</given-names></string-name> (<year>2014</year>). <article-title>Big data: new tricks for econometrics</article-title>. <source><italic>The Journal of Economic Perspectives</italic></source>, <volume>28</volume>(<issue>2</issue>): <fpage>3</fpage>–<lpage>28</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1257/jep.28.2.3" xlink:type="simple">https://doi.org/10.1257/jep.28.2.3</ext-link></mixed-citation>
</ref>
<ref id="j_jds1118_ref_018">
<mixed-citation publication-type="journal"> <string-name><surname>Varian</surname> <given-names>HR</given-names></string-name> (<year>2016</year>). <article-title>Causal inference in economics and marketing</article-title>. <source><italic>Proceedings of the National Academy of Sciences of the United States of America</italic></source>, <volume>113</volume>(<issue>27</issue>): <fpage>7310</fpage>–<lpage>7315</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1073/pnas.1510479113" xlink:type="simple">https://doi.org/10.1073/pnas.1510479113</ext-link></mixed-citation>
</ref>
<ref id="j_jds1118_ref_019">
<mixed-citation publication-type="journal"> <string-name><surname>Wager</surname> <given-names>S</given-names></string-name>, <string-name><surname>Athey</surname> <given-names>S</given-names></string-name> (<year>2018</year>). <article-title>Estimation and inference of heterogeneous treatment effects using random forests</article-title>. <source><italic>Journal of the American Statistical Association</italic></source>, <volume>113</volume>(<issue>523</issue>): <fpage>1228</fpage>–<lpage>1242</lpage>. <ext-link ext-link-type="doi" xlink:href="https://doi.org/10.1080/01621459.2017.1319839" xlink:type="simple">https://doi.org/10.1080/01621459.2017.1319839</ext-link></mixed-citation>
</ref>
</ref-list>
</back>
</article>
