IJCNN'01. International Joint Conference on Neural Networks. Proceedings (Cat. No.01CH37222)
DOI: 10.1109/ijcnn.2001.938744
|View full text |Cite
|
Sign up to set email alerts
|

Classification of unbalanced data with transparent kernels

Abstract: Two important issues regarding data driven classification are addressed here: Model interpretation and imbalanced data. The aim is to build data driven classifiers that provide good predictive performance for a set of imbalanced data and enhance the understanding of a model by enabling input/output dependencies that exist to be visualised. Here, the classification method is demonstrated on an imbalanced data set that describes fatigue crack initiation in automotive camshafts. To generate interpretable models, … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
13
0
1

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 13 publications
(14 citation statements)
references
References 8 publications
0
13
0
1
Order By: Relevance
“…The first two real data sets were taken from [50], while the third real data set was from [51]. These three real data sets were chosen in the order of increasing imbalance.…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…The first two real data sets were taken from [50], while the third real data set was from [51]. These three real data sets were chosen in the order of increasing imbalance.…”
Section: Resultsmentioning
confidence: 99%
“…ADI data set: The austempered ductile iron (ADI) material data set was obtained from a study on fatigue cracks from the graphite nodules within the microstructure in an automotive camshaft application [51]. This two-class data set contained 2923 instances in the feature space of dimension m = 9, with 2807 negative instances and 116 positive instances.…”
Section: Resultsmentioning
confidence: 99%
“…The austempered ductile iron (ADI) material data set for automotive camshaft application [43] is used to study why fatigue cracks are initiated from the graphite nodules within the microstructure. There are nine features and two class labels ("crack" and "no crack").…”
Section: Examplementioning
confidence: 99%
“…The data set is very imbalanced with a total of 2923 samples in which 116 samples are "crack class," and 2807 samples are "no crack class." A cost-sensitive support vector machine (CS-SVM) and a cost-sensitive SUPANOVA model [43] were applied to investigate the data set [43]. The cost-sensitive SUPANOVA model used one-norm regularization to derive a reduced model set trading model interpretability with classification performance.…”
Section: Examplementioning
confidence: 99%
See 1 more Smart Citation