Hypothesis Testing for Hierarchical Structures in Cognitive Diagnosis Models
Volume 20, Issue 3 (2022): Special Issue: Data Science Meets Social Sciences, pp. 279–302
Pub. online: 14 October 2021
Type: Statistical Data Science
Open Access
Received
2 June 2021
2 June 2021
Accepted
11 September 2021
11 September 2021
Published
14 October 2021
14 October 2021
Abstract
Cognitive Diagnosis Models (CDMs) are a special family of discrete latent variable models widely used in educational, psychological and social sciences. In many applications of CDMs, certain hierarchical structures among the latent attributes are assumed by researchers to characterize their dependence structure. Specifically, a directed acyclic graph is used to specify hierarchical constraints on the allowable configurations of the discrete latent attributes. In this paper, we consider the important yet unaddressed problem of testing the existence of latent hierarchical structures in CDMs. We first introduce the concept of testability of hierarchical structures in CDMs and present sufficient conditions. Then we study the asymptotic behaviors of the likelihood ratio test (LRT) statistic, which is widely used for testing nested models. Due to the irregularity of the problem, the asymptotic distribution of LRT becomes nonstandard and tends to provide unsatisfactory finite sample performance under practical conditions. We provide statistical insights on such failures, and propose to use parametric bootstrap to perform the testing. We also demonstrate the effectiveness and superiority of parametric bootstrap for testing the latent hierarchies over non-parametric bootstrap and the naïve Chi-squared test through comprehensive simulations and an educational assessment dataset.
Supplementary material
Supplementary MaterialMore comprehensive simulation results are presented in the supplementary material. Specifically, bootstrap results for DINA and GDINA models under both null hypothesis and alternative hypothesis with different sample sizes and noise levels are plotted there. We also include the codes for simulations and real data analysis.
References
Gu Y, Xu G (2021). Identifiability of hierarchical latent attribute models. arXiv preprint arXiv:1906.07869.