<?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.0 20120330//EN" "JATS-journalpublishing1.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">JDS</journal-id>
<journal-title-group><journal-title>Journal of Data Science</journal-title></journal-title-group>
<issn pub-type="epub">1683-8602</issn><issn pub-type="ppub">1680-743X</issn><issn-l>1680-743X</issn-l>
<publisher>
<publisher-name>School of Statistics, Renmin University of China</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">JDS1024</article-id>
<article-id pub-id-type="doi">10.6339/21-JDS1024</article-id>
<article-categories><subj-group subj-group-type="heading">
<subject>Statistical Data Science</subject></subj-group></article-categories>
<title-group>
<article-title>Hypothesis Testing for Hierarchical Structures in Cognitive Diagnosis Models</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Ma</surname><given-names>Chenchen</given-names></name><xref ref-type="aff" rid="j_jds1024_aff_001">1</xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Xu</surname><given-names>Gongjun</given-names></name><email xlink:href="mailto:gongjun@umich.edu">gongjun@umich.edu</email><xref ref-type="aff" rid="j_jds1024_aff_001">1</xref><xref ref-type="corresp" rid="cor1">∗</xref>
</contrib>
<aff id="j_jds1024_aff_001"><label>1</label>Department of Statistics, <institution>University of Michigan</institution>, West Hall, 1085 S University Ave, Ann Arbor, MI 48109, <country>USA</country></aff>
</contrib-group>
<author-notes>
<corresp id="cor1"><label>∗</label>Corresponding author. Email: <ext-link ext-link-type="uri" xlink:href="mailto:gongjun@umich.edu">gongjun@umich.edu</ext-link>.</corresp>
</author-notes>
<pub-date pub-type="ppub"><year>2022</year></pub-date><pub-date pub-type="epub"><day>14</day><month>10</month><year>2021</year></pub-date><volume>20</volume><issue>3</issue><fpage>279</fpage><lpage>302</lpage><supplementary-material id="S1" content-type="document" xlink:href="jds1024_s001.pdf" mimetype="application" mime-subtype="pdf">
<caption>
<title>Supplementary Material</title>
<p>More comprehensive simulation results are presented in the supplementary material. Specifically, bootstrap results for DINA and GDINA models under both null hypothesis and alternative hypothesis with different sample sizes and noise levels are plotted there. We also include the codes for simulations and real data analysis.</p>
</caption>
</supplementary-material><history><date date-type="received"><day>2</day><month>6</month><year>2021</year></date><date date-type="accepted"><day>11</day><month>9</month><year>2021</year></date></history>
<permissions><copyright-statement>2022 The Author(s). Published by the School of Statistics and the Center for Applied Statistics, Renmin University of China.</copyright-statement><copyright-year>2022</copyright-year>
<license license-type="open-access" xlink:href="https://creativecommons.org/licenses/by/4.0/">
<license-p>Open access article under the <ext-link ext-link-type="uri" xlink:href="https://creativecommons.org/licenses/by/4.0/">CC BY</ext-link> license.</license-p></license></permissions>
<abstract>
<p>Cognitive Diagnosis Models (CDMs) are a special family of discrete latent variable models widely used in educational, psychological and social sciences. In many applications of CDMs, certain hierarchical structures among the latent attributes are assumed by researchers to characterize their dependence structure. Specifically, a directed acyclic graph is used to specify hierarchical constraints on the allowable configurations of the discrete latent attributes. In this paper, we consider the important yet unaddressed problem of testing the existence of latent hierarchical structures in CDMs. We first introduce the concept of testability of hierarchical structures in CDMs and present sufficient conditions. Then we study the asymptotic behaviors of the likelihood ratio test (LRT) statistic, which is widely used for testing nested models. Due to the irregularity of the problem, the asymptotic distribution of LRT becomes nonstandard and tends to provide unsatisfactory finite sample performance under practical conditions. We provide statistical insights on such failures, and propose to use parametric bootstrap to perform the testing. We also demonstrate the effectiveness and superiority of parametric bootstrap for testing the latent hierarchies over non-parametric bootstrap and the naïve Chi-squared test through comprehensive simulations and an educational assessment dataset.</p>
</abstract>
<kwd-group>
<label>Keywords</label>
<kwd>bootstrapping</kwd>
<kwd>latent hierarchical structure</kwd>
<kwd>likelihood ratio test</kwd>
</kwd-group>
<funding-group><award-group><funding-source xlink:href="https://doi.org/10.13039/100000001">National Science Foundation</funding-source><award-id>CAREER SES-1846747</award-id></award-group><award-group><funding-source xlink:href="https://doi.org/10.13039/100005246">Institute of Education Sciences</funding-source><award-id>R305D200015</award-id></award-group><funding-statement>This research is partially supported by National Science Foundation CAREER SES-1846747 and Institute of Education Sciences R305D200015. </funding-statement></funding-group>
</article-meta>
</front>
<body/>
<back>
<ref-list id="j_jds1024_reflist_001">
<title>References</title>
<ref id="j_jds1024_ref_001">
<mixed-citation publication-type="journal"> <string-name><surname>Chen</surname> <given-names>J</given-names></string-name> (<year>2017</year>). <article-title>On finite mixture models</article-title>. <source>Statistical Theory and Related Fields</source>, <volume>1</volume>(<issue>1</issue>): <fpage>15</fpage>–<lpage>27</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1024_ref_002">
<mixed-citation publication-type="journal"> <string-name><surname>Chen</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Moustaki</surname> <given-names>I</given-names></string-name>, <string-name><surname>Zhang</surname> <given-names>H</given-names></string-name> (<year>2020</year>). <article-title>A note on likelihood ratio tests for models with latent variables</article-title>. <source>Psychometrika</source>, <volume>85</volume>: <fpage>1</fpage>–<lpage>17</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1024_ref_003">
<mixed-citation publication-type="journal"> <string-name><surname>Dahlgren</surname> <given-names>MA</given-names></string-name>, <string-name><surname>Hult</surname> <given-names>H</given-names></string-name>, <string-name><surname>Dahlgren</surname> <given-names>LO</given-names></string-name>, <string-name><surname>af Segerstad</surname> <given-names>HH</given-names></string-name>, <string-name><surname>Johansson</surname> <given-names>K</given-names></string-name> (<year>2006</year>). <article-title>From senior student to novice worker: Learning trajectories in political science, psychology and mechanical engineering</article-title>. <source>Studies in Higher Education</source>, <volume>31</volume>(<issue>5</issue>): <fpage>569</fpage>–<lpage>586</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1024_ref_004">
<mixed-citation publication-type="journal"> <string-name><surname>de la Torre</surname> <given-names>J</given-names></string-name> (<year>2011</year>). <article-title>The generalized DINA model framework</article-title>. <source>Psychometrika</source>, <volume>76</volume>(<issue>2</issue>): <fpage>179</fpage>–<lpage>199</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1024_ref_005">
<mixed-citation publication-type="journal"> <string-name><surname>de la Torre</surname> <given-names>J</given-names></string-name>, <string-name><surname>van der Ark</surname> <given-names>LA</given-names></string-name>, <string-name><surname>Rossi</surname> <given-names>G</given-names></string-name> (<year>2018</year>). <article-title>Analysis of clinical data from a cognitive diagnosis modeling framework</article-title>. <source>Measurement and Evaluation in Counseling and Development</source>, <volume>51</volume>(<issue>4</issue>): <fpage>281</fpage>–<lpage>296</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1024_ref_006">
<mixed-citation publication-type="chapter"> <string-name><surname>DiBello</surname> <given-names>LV</given-names></string-name>, <string-name><surname>Stout</surname> <given-names>WF</given-names></string-name>, <string-name><surname>Roussos</surname> <given-names>LA</given-names></string-name> (<year>1995</year>). <chapter-title>Unified cognitive/psychometric diagnostic assessment likelihood-based classification techniques</chapter-title>. In: Edited by <string-name><given-names>Paul D.</given-names> <surname>Nichols</surname></string-name>, <string-name><given-names>Susan F.</given-names> <surname>Chipman</surname></string-name>, <string-name><given-names>Robert L.</given-names> <surname>Brennan</surname></string-name>, <source>Cognitively Diagnostic Assessment</source>, <fpage>361</fpage>–<lpage>389</lpage>. <publisher-name>Routledge</publisher-name>.</mixed-citation>
</ref>
<ref id="j_jds1024_ref_007">
<mixed-citation publication-type="journal"> <string-name><surname>Efron</surname> <given-names>B</given-names></string-name> (<year>1979</year>). <article-title>Bootstrap methods: Another look at the Jackknife</article-title>. <source>The Annals of Statistics</source>, <volume>7</volume>(<issue>1</issue>): <fpage>1</fpage>–<lpage>26</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1024_ref_008">
<mixed-citation publication-type="journal"> <string-name><surname>George</surname> <given-names>AC</given-names></string-name>, <string-name><surname>Robitzsch</surname> <given-names>A</given-names></string-name> (<year>2015</year>). <article-title>Cognitive diagnosis models in R: A didactic</article-title>. <source>The Quantitative Methods for Psychology</source>, <volume>11</volume>(<issue>3</issue>): <fpage>189</fpage>–<lpage>205</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1024_ref_009">
<mixed-citation publication-type="journal"> <string-name><surname>Gu</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Xu</surname> <given-names>G</given-names></string-name> (<year>2019</year>). <article-title>Learning attribute patterns in high-dimensional structured latent attribute models</article-title>. <source>Journal of Machine Learning Research</source>, <volume>20</volume>(<issue>2019</issue>): <fpage>1</fpage>–<lpage>58</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1024_ref_010">
<mixed-citation publication-type="journal"> <string-name><surname>Gu</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Xu</surname> <given-names>G</given-names></string-name> (<year>2020</year>). <article-title>Partial identifiability of restricted latent class models</article-title>. <source>Annals of Statistics</source>, <volume>48</volume>(<issue>4</issue>): <fpage>2082</fpage>–<lpage>2107</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1024_ref_011">
<mixed-citation publication-type="other"> <string-name><surname>Gu</surname> <given-names>Y</given-names></string-name>, <string-name><surname>Xu</surname> <given-names>G</given-names></string-name> (2021). Identifiability of hierarchical latent attribute models. <italic>arXiv preprint</italic> <ext-link ext-link-type="uri" xlink:href="http://arxiv.org/abs/"><italic>arXiv:1906.07869</italic></ext-link>.</mixed-citation>
</ref>
<ref id="j_jds1024_ref_012">
<mixed-citation publication-type="journal"> <string-name><surname>Haertel</surname> <given-names>EH</given-names></string-name> (<year>1989</year>). <article-title>Using restricted latent class models to map the skill structure of achievement items</article-title>. <source>Journal of Educational Measurement</source>, <volume>26</volume>(<issue>4</issue>): <fpage>301</fpage>–<lpage>321</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1024_ref_013">
<mixed-citation publication-type="journal"> <string-name><surname>Henson</surname> <given-names>RA</given-names></string-name>, <string-name><surname>Templin</surname> <given-names>JL</given-names></string-name>, <string-name><surname>Willse</surname> <given-names>JT</given-names></string-name> (<year>2009</year>). <article-title>Defining a family of cognitive diagnosis models using log-linear models with latent variables</article-title>. <source>Psychometrika</source>, <volume>74</volume>(<issue>2</issue>): <fpage>191</fpage>.</mixed-citation>
</ref>
<ref id="j_jds1024_ref_014">
<mixed-citation publication-type="journal"> <string-name><surname>Jimoyiannis</surname> <given-names>A</given-names></string-name>, <string-name><surname>Komis</surname> <given-names>V</given-names></string-name> (<year>2001</year>). <article-title>Computer simulations in physics teaching and learning: A case study on students’ understanding of trajectory motion</article-title>. <source>Computers &amp; Education</source>, <volume>36</volume>(<issue>2</issue>): <fpage>183</fpage>–<lpage>204</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1024_ref_015">
<mixed-citation publication-type="journal"> <string-name><surname>Junker</surname> <given-names>BW</given-names></string-name>, <string-name><surname>Sijtsma</surname> <given-names>K</given-names></string-name> (<year>2001</year>). <article-title>Cognitive assessment models with few assumptions, and connections with nonparametric item response theory</article-title>. <source>Applied Psychological Measurement</source>, <volume>25</volume>(<issue>3</issue>): <fpage>258</fpage>–<lpage>272</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1024_ref_016">
<mixed-citation publication-type="journal"> <string-name><surname>Leighton</surname> <given-names>JP</given-names></string-name>, <string-name><surname>Gierl</surname> <given-names>MJ</given-names></string-name>, <string-name><surname>Hunka</surname> <given-names>SM</given-names></string-name> (<year>2004</year>). <article-title>The attribute hierarchy method for cognitive assessment: A variation on Tatsuoka’s rule-space approach</article-title>. <source>Journal of Educational Measurement</source>, <volume>41</volume>(<issue>3</issue>): <fpage>205</fpage>–<lpage>237</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1024_ref_017">
<mixed-citation publication-type="journal"> <string-name><surname>Nylund</surname> <given-names>KL</given-names></string-name>, <string-name><surname>Asparouhov</surname> <given-names>T</given-names></string-name>, <string-name><surname>Muthén</surname> <given-names>BO</given-names></string-name> (<year>2007</year>). <article-title>Deciding on the number of classes in latent class analysis and growth mixture modeling: A Monte Carlo simulation study</article-title>. <source>Structural Equation Modeling: A Multidisciplinary Journal</source>, <volume>14</volume>(<issue>4</issue>): <fpage>535</fpage>–<lpage>569</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1024_ref_018">
<mixed-citation publication-type="journal"> <string-name><surname>O’Brien</surname> <given-names>KL</given-names></string-name>, <string-name><surname>Baggett</surname> <given-names>HC</given-names></string-name>, <string-name><surname>Brooks</surname> <given-names>WA</given-names></string-name>, <string-name><surname>Feikin</surname> <given-names>DR</given-names></string-name>, <string-name><surname>Hammitt</surname> <given-names>LL</given-names></string-name>, <string-name><surname>Higdon</surname> <given-names>MM</given-names></string-name>, <etal>et al.</etal> (<year>2019</year>). <article-title>Causes of severe pneumonia requiring hospital admission in children without HIV infection from Africa and Asia: The PERCH multi-country case-control study</article-title>. <source>The Lancet</source>, <volume>394</volume>(<issue>10200</issue>): <fpage>757</fpage>–<lpage>779</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1024_ref_019">
<mixed-citation publication-type="chapter"> <string-name><surname>Reckase</surname> <given-names>MD</given-names></string-name> (<year>2009</year>). <chapter-title>Multidimensional item response theory models</chapter-title>. In: <source>Multidimensional Item Response Theory</source>, <fpage>79</fpage>–<lpage>112</lpage>. <publisher-name>Springer</publisher-name>.</mixed-citation>
</ref>
<ref id="j_jds1024_ref_020">
<mixed-citation publication-type="journal"> <string-name><surname>Self</surname> <given-names>SG</given-names></string-name>, <string-name><surname>Liang</surname> <given-names>KY</given-names></string-name> (<year>1987</year>). <article-title>Asymptotic properties of maximum likelihood estimators and likelihood ratio tests under nonstandard conditions</article-title>. <source>Journal of the American Statistical Association</source>, <volume>82</volume>(<issue>398</issue>): <fpage>605</fpage>–<lpage>610</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1024_ref_021">
<mixed-citation publication-type="journal"> <string-name><surname>Simon</surname> <given-names>MA</given-names></string-name>, <string-name><surname>Tzur</surname> <given-names>R</given-names></string-name> (<year>2004</year>). <article-title>Explicating the role of mathematical tasks in conceptual learning: An elaboration of the hypothetical learning trajectory</article-title>. <source>Mathematical Thinking and Learning</source>, <volume>6</volume>(<issue>2</issue>): <fpage>91</fpage>–<lpage>104</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1024_ref_022">
<mixed-citation publication-type="journal"> <string-name><surname>Tatsuoka</surname> <given-names>KK</given-names></string-name> (<year>1983</year>). <article-title>Rule space: An approach for dealing with misconceptions based on item response theory</article-title>. <source>Journal of Educational Measurement</source>, <volume>20</volume>: <fpage>345</fpage>–<lpage>354</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1024_ref_023">
<mixed-citation publication-type="chapter"> <string-name><surname>Tatsuoka</surname> <given-names>KK</given-names></string-name> (<year>1990</year>). <chapter-title>Toward an integration of item-response theory and cognitive error diagnosis</chapter-title>. In: Edited by <string-name><given-names>Norman</given-names> <surname>Frederiksen</surname></string-name>, <string-name><given-names>Robert</given-names> <surname>Glaser</surname></string-name>, <string-name><given-names>Alan</given-names> <surname>Lesgold</surname></string-name>, and <string-name><given-names>Michael G.</given-names> <surname>Shafto</surname></string-name>, <source>Diagnostic Monitoring of Skill and Knowledge Acquisition</source>, <fpage>453</fpage>–<lpage>488</lpage>. <publisher-name>Routledge</publisher-name>.</mixed-citation>
</ref>
<ref id="j_jds1024_ref_024">
<mixed-citation publication-type="journal"> <string-name><surname>Templin</surname> <given-names>J</given-names></string-name>, <string-name><surname>Bradshaw</surname> <given-names>L</given-names></string-name> (<year>2014</year>). <article-title>Hierarchical diagnostic classification models: A family of models for estimating and testing attribute hierarchies</article-title>. <source>Psychometrika</source>, <volume>79</volume>(<issue>2</issue>): <fpage>317</fpage>–<lpage>339</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1024_ref_025">
<mixed-citation publication-type="journal"> <string-name><surname>Templin</surname> <given-names>JL</given-names></string-name>, <string-name><surname>Henson</surname> <given-names>RA</given-names></string-name> (<year>2006</year>). <article-title>Measurement of psychological disorders using cognitive diagnosis models</article-title>. <source>Psychological Methods</source>, <volume>11</volume>(<issue>3</issue>): <fpage>287</fpage>.</mixed-citation>
</ref>
<ref id="j_jds1024_ref_026">
<mixed-citation publication-type="journal"> <string-name><surname>von Davier</surname> <given-names>M</given-names></string-name> (<year>2005</year>). <article-title>A general diagnostic model applied to language testing data</article-title>. <source>ETS Research Report Series</source>, <volume>2005</volume>(<issue>2</issue>): <fpage>1</fpage>–<lpage>35</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1024_ref_027">
<mixed-citation publication-type="journal"> <string-name><surname>Wang</surname> <given-names>C</given-names></string-name>, <string-name><surname>Gierl</surname> <given-names>MJ</given-names></string-name> (<year>2011</year>). <article-title>Using the attribute hierarchy method to make diagnostic inferences about examinees’ cognitive skills in critical reading</article-title>. <source>Journal of Educational Measurement</source>, <volume>48</volume>(<issue>2</issue>): <fpage>165</fpage>–<lpage>187</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1024_ref_028">
<mixed-citation publication-type="journal"> <string-name><surname>Wu</surname> <given-names>Z</given-names></string-name>, <string-name><surname>Deloria-Knoll</surname> <given-names>M</given-names></string-name>, <string-name><surname>Hammitt</surname> <given-names>LL</given-names></string-name>, <string-name><surname>Zeger</surname> <given-names>SL</given-names></string-name> (<collab>for Child Health Core Team PER</collab>) (<year>2016</year>a). <article-title>Partially latent class models for case–control studies of childhood pneumonia etiology</article-title>. <source>Journal of the Royal Statistical Society: Series C (Applied Statistics)</source>, <volume>65</volume>(<issue>1</issue>): <fpage>97</fpage>–<lpage>114</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1024_ref_029">
<mixed-citation publication-type="journal"> <string-name><surname>Wu</surname> <given-names>Z</given-names></string-name>, <string-name><surname>Deloria-Knoll</surname> <given-names>M</given-names></string-name>, <string-name><surname>Zeger</surname> <given-names>SL</given-names></string-name> (<year>2016</year>b). <article-title>Nested partially latent class models for dependent binary data; estimating disease etiology</article-title>. <source>Biostatistics</source>, <volume>18</volume>(<issue>2</issue>): <fpage>200</fpage>–<lpage>213</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1024_ref_030">
<mixed-citation publication-type="journal"> <string-name><surname>Xu</surname> <given-names>G</given-names></string-name> (<year>2017</year>). <article-title>Identifiability of restricted latent class models with binary responses</article-title>. <source>The Annals of Statistics</source>, <volume>45</volume>(<issue>2</issue>): <fpage>675</fpage>–<lpage>707</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1024_ref_031">
<mixed-citation publication-type="journal"> <string-name><surname>Xu</surname> <given-names>G</given-names></string-name>, <string-name><surname>Shang</surname> <given-names>Z</given-names></string-name> (<year>2018</year>). <article-title>Identifying latent structures in restricted latent class models</article-title>. <source>Journal of the American Statistical Association</source>, <volume>113</volume>(<issue>523</issue>): <fpage>1284</fpage>–<lpage>1295</lpage>.</mixed-citation>
</ref>
<ref id="j_jds1024_ref_032">
<mixed-citation publication-type="journal"> <string-name><surname>Xu</surname> <given-names>G</given-names></string-name>, <string-name><surname>Zhang</surname> <given-names>S</given-names></string-name> (<year>2016</year>). <article-title>Identifiability of diagnostic classification models</article-title>. <source>Psychometrika</source>, <volume>81</volume>(<issue>3</issue>): <fpage>625</fpage>–<lpage>649</lpage>.</mixed-citation>
</ref>
</ref-list>
</back>
</article>
