Journal of Data Science logo


Login Register

  1. Home
  2. Issues
  3. Volume 11, Issue 3 (2013)
  4. Effect Size Estimation and Misclassifica ...

Journal of Data Science

Submit your article Information
  • Article info
  • More
    Article info

Effect Size Estimation and Misclassification Rate Based Variable Selection in Linear Discriminant Analysis
Volume 11, Issue 3 (2013), pp. 537–558
Bernd Klaus  

Authors

 
Placeholder
https://doi.org/10.6339/JDS.2013.11(3).1185
Pub. online: 4 August 2022      Type: Research Article      Open accessOpen Access

Published
4 August 2022

Abstract

Abstract: Supervised classifying of biological samples based on genetic information, (e.g., gene expression profiles) is an important problem in biostatistics. In order to find both accurate and interpretable classification rules variable selection is indispensable. This article explores how an assessment of the individual importance of variables (effect size estimation) can be used to perform variable selection. I review recent effect size estimation approaches in the context of linear discriminant analysis (LDA) and propose a new conceptually simple effect size estimation method which is at the same time computationally efficient. I then show how to use effect sizes to perform variable selection based on the misclassification rate, which is the data independent expectation of the prediction error. Simulation studies and real data analyses illustrate that the proposed effect size estimation and variable selection methods are com petitive. Particularly, they lead to both compact and interpretable feature sets. Program files to be used with the statistical software R implementing the variable selection approaches presented in this article are available from my homepage: http://b-klaus.de.

PDF XML
PDF XML

Copyright
No copyright data available.

Keywords
Correlation-adjusted t-score effect size estimation misclassification rate

Metrics
since February 2021
610

Article info
views

361

PDF
downloads

Export citation

Copy and paste formatted citation
Placeholder

Download citation in file


Share


RSS

Journal of data science

  • Online ISSN: 1683-8602
  • Print ISSN: 1680-743X

About

  • About journal

For contributors

  • Submit
  • OA Policy
  • Become a Peer-reviewer

Contact us

  • JDS@ruc.edu.cn
  • No. 59 Zhongguancun Street, Haidian District Beijing, 100872, P.R. China
Powered by PubliMill  •  Privacy policy