2004
DOI: 10.1007/978-3-540-27868-9_76
|View full text |Cite
|
Sign up to set email alerts
|

Clustering Based Under-Sampling for Improving Speaker Verification Decisions Using AdaBoost

Abstract: The class imbalance problem naturally occurs in some classification problems where the amount of training samples available for one class may be much less than that of another. In order to deal with this problem, random sampling based methods are generally used. This paper proposes a clustering based sampling technique to select a subset from the majority class involving much larger amount of training data. The proposed approach is verified in designing a post-classifier using AdaBoost to improve the speaker v… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
15
0

Year Published

2008
2008
2020
2020

Publication Types

Select...
4
2
2

Relationship

0
8

Authors

Journals

citations
Cited by 19 publications
(15 citation statements)
references
References 7 publications
0
15
0
Order By: Relevance
“…Besides, Liu et al [27] also presented a weighted rough set method for this problem. However, all of these techniques have some disadvantages [2]. For instance, the computational load is increased and overtraining may occur due to replicated samples in the case of over-sampling.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Besides, Liu et al [27] also presented a weighted rough set method for this problem. However, all of these techniques have some disadvantages [2]. For instance, the computational load is increased and overtraining may occur due to replicated samples in the case of over-sampling.…”
Section: Introductionmentioning
confidence: 99%
“…The first group involves five approaches: (1) under-sampling, a method in which the minority population is kept intact, while the majority population is under-sampled; (2) over-sampling, methods in which the minority examples are over-sampled so that the desired class distribution is obtained in the training set [6,11,19]; (3) cluster based sampling, methods in which the representative examples are randomly sampled from clusters [2]; (4) moving the decision threshold, methods in which the researcher tries to adapt the decision thresholds to impose bias on the minority class [11,21,24] and (5) adjust costs matrices, a method in which the prediction accuracy is improved by adjusting the cost (weight) for each class [15]. Besides, Liu et al [27] also presented a weighted rough set method for this problem.…”
Section: Introductionmentioning
confidence: 99%
“…Altincay and Ergun [2] used this idea for improving speaker verification decisions by using AdaBoost and by using the k-means approach as a clustering algorithm. They claimed that their approach displayed better performance than the random under-sampling method.…”
Section: Related Workmentioning
confidence: 99%
“…Altmcay et al [18] proposed cluster based synthetic sample creation techniques to under-sample the majority class. They divide the majority class samples into N number of clusters, where N is the number of minority class samples in the dataset.…”
Section: Related Workmentioning
confidence: 99%