Journal of Data Science logo


Login Register

  1. Home
  2. Issues
  3. Volume 10, Issue 3 (2012)
  4. Direct and Unbiased Multiple Imputation ...

Journal of Data Science

Submit your article Information
  • Article info
  • More
    Article info

Direct and Unbiased Multiple Imputation Methods for Missing Values of Categorical Variables
Volume 10, Issue 3 (2012), pp. 465–481
Yuanhui Xiao   Ruiguang Song   Mi Chen  

Authors

 
Placeholder
https://doi.org/10.6339/JDS.201207_10(3).0007
Pub. online: 4 August 2022      Type: Research Article      Open accessOpen Access

Published
4 August 2022

Abstract

Abstract: Missing data is a common problem in statistical analyses. To make use of information in data with incomplete observation, missing values can be imputed so that standard statistical methods can be used to analyze the data. Variables with missing values are often categorical and the miss ing pattern may not be monotone. Currently, commonly used imputation methods for data with a non-monotone missing pattern do not allow di rect inclusion of categorical variables. Categorical variables are converted to numerical variables before imputation. For many applications, the imputed numerical values for those categorical variables must then be converted back to categorical values. However, this conversion introduces bias which can seriously affect subsequent analyses. In this paper, we propose two direct imputation methods for categorical variables with a non-monotone missing pattern: the direct imputation approach incorporated with the expectation maximization algorithm and the direct imputation approach incorporated with a new algorithm: the imputation-maximization algorithm. Simulation studies show that both methods perform better than the method using vari able conversion. An application to real data is provided to compare the direct imputation method and the method using variable conversion.

PDF XML
PDF XML

Copyright
No copyright data available.

Keywords
bias categorical variable HIV

Metrics
since February 2021
652

Article info
views

435

PDF
downloads

Export citation

Copy and paste formatted citation
Placeholder

Download citation in file


Share


RSS

Journal of data science

  • Online ISSN: 1683-8602
  • Print ISSN: 1680-743X

About

  • About journal

For contributors

  • Submit
  • OA Policy
  • Become a Peer-reviewer

Contact us

  • JDS@ruc.edu.cn
  • No. 59 Zhongguancun Street, Haidian District Beijing, 100872, P.R. China
Powered by PubliMill  •  Privacy policy