2016
DOI: 10.1371/journal.pone.0155119
|View full text |Cite
|
Sign up to set email alerts
|

Multilevel Weighted Support Vector Machine for Classification on Healthcare Data with Missing Values

Abstract: This work is motivated by the needs of predictive analytics on healthcare data as represented by Electronic Medical Records. Such data is invariably problematic: noisy, with missing entries, with imbalance in classes of interests, leading to serious bias in predictive modeling. Since standard data mining methods often produce poor performance measures, we argue for development of specialized techniques of data-preprocessing and classification. In this paper, we propose a new method to simultaneously classify l… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
30
0

Year Published

2017
2017
2023
2023

Publication Types

Select...
5
4
1

Relationship

1
9

Authors

Journals

citations
Cited by 66 publications
(34 citation statements)
references
References 45 publications
1
30
0
Order By: Relevance
“…SVM has been used across diverse biomedical classification problems. This includes a patient financial risk model using health claims and clinical encounter data, and a patient response to flu awareness campaign model, both using weighted SVM [10]. A project comparing various machine learning techniques with Logistic Regression for prediction of heart disease also shows no significant difference between Logistic Regression and SVM, with the Linear kernel performing best [11].…”
Section: A Svm Application To Disease Classificationmentioning
confidence: 99%
“…SVM has been used across diverse biomedical classification problems. This includes a patient financial risk model using health claims and clinical encounter data, and a patient response to flu awareness campaign model, both using weighted SVM [10]. A project comparing various machine learning techniques with Logistic Regression for prediction of heart disease also shows no significant difference between Logistic Regression and SVM, with the Linear kernel performing best [11].…”
Section: A Svm Application To Disease Classificationmentioning
confidence: 99%
“…Our work lies in the medical pattern recognition framework, which is known to be highly imbalanced, i.e., the instances of interest in the dataset are relatively rare. We discuss the hardness of classification problems with healthcare imbalanced data with missing values in [44]. Examples are Intense Care Unit (ICU) infection detection events [46], medical diagnoses [34], adverse drug events [51,48], bleeding detection in endoscopic video [17], and so on.…”
Section: Introductionmentioning
confidence: 99%
“…One simple form this method can take is to store all experienced items in a representational space with a similarity gradient around each for use in future prediction or classification. This is exemplified in recent kernel methods, which use a variety of similarity metrics and often remove redundant stored items to provide a more efficient representation [1]; for example, support vector machines use kernel functions to draw decision boundaries between categories [2], often resulting in highly accurate classification performance [3][4][5].…”
Section: Spatial Methodsmentioning
confidence: 99%