Support Vector Machines Classification on Class Imbalanced Data: A Case Study with Real Medical Data

Drosou, Krystallenia; Georgiou, Stelios; Koukouvinos, Christos; Stylianou, Stella

doi:10.6339/JDS.201410_12(4).0009

Journal of Data Science

Support Vector Machines Classification on Class Imbalanced Data: A Case Study with Real Medical Data

Volume 12, Issue 4 (2014), pp. 727–754

Krystallenia Drosou Stelios Georgiou Christos Koukouvinos All authors (4)

https://doi.org/10.6339/JDS.201410_12(4).0009

Pub. online: 4 August 2022 Type: Research Article

Open Access

Published
4 August 2022

Abstract

Abstract: support vector machines (SVMs) constitute one of the most popular and powerful classification methods. However, SVMs can be limited in their performance on highly imbalanced datasets. A classifier which has been trained on an imbalanced dataset can produce a biased model towards the majority class and result in high misclassification rate for minority class. For many applications, especially for medical diagnosis, it is of high importance to accurately distinguish false negative from false positive results. The purpose of this study is to successfully evaluate the performance of a classifier, keeping the correct balance between sensitivity and specificity, in order to enable the success of trauma outcome prediction. We compare the standard (or classic) SVM (C SVM) with resampling methods and a cost sensitive method, called Two Cost SVM (TC SVM), which constitute widely accepted strategies for imbalanced datasets and the derived results were discussed in terms of the sensitivity analysis and receiver operating characteristic (ROC) curves.

No copyright data available.

Keywords

class imbalance support vector machines cost sensitive learning

Metrics

since February 2021

2163

Article info
views

640

PDF
downloads

RSS

Authors

Abstract

Export citation

Copy and paste formatted citation

Download citation in file