2000
DOI: 10.1016/s0004-3702(99)00094-6
|View full text |Cite
|
Sign up to set email alerts
|

Unsupervised stratification of cross-validation for accuracy estimation

Abstract: The rapid development of new learning algorithms increases the need for improved accuracy estimation methods. Moreover, methods allowing the comparison of several different learning algorithms are important for the performance evaluation of new ones. In this paper we propose new accuracy estimation methods which are extensions of the k-fold cross-validation method. The methods proposed construct cross-validation folds deterministically instead of using the random sampling approach. The deterministic constructi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
69
0

Year Published

2009
2009
2023
2023

Publication Types

Select...
8
1
1

Relationship

0
10

Authors

Journals

citations
Cited by 149 publications
(78 citation statements)
references
References 16 publications
0
69
0
Order By: Relevance
“…For most of the listed techniques, this procedure occurs several times on each of these training-validation set pairs. A common approach when partitioning the data into training and validation set is the use of stratification [246]. In stratified validation, the sets have the same fraction of labels as the data of origin.…”
Section: Machine Learning Performance Evaluationmentioning
confidence: 99%
“…For most of the listed techniques, this procedure occurs several times on each of these training-validation set pairs. A common approach when partitioning the data into training and validation set is the use of stratification [246]. In stratified validation, the sets have the same fraction of labels as the data of origin.…”
Section: Machine Learning Performance Evaluationmentioning
confidence: 99%
“…However, results of this procedure can be conceived as indicators of a relative performance or otherwise as an optimistic estimate of the hydrological members' selection process (Diamantidis et al, 2000). Figure 2 shows the generalization or test methodology of the hydrological members' selection at two levels: the local focuses on the extrapolation of results to different FTH within the same catchment and another named regional, while the regional level tests the temporal and spatial performance in nearby catchments, or under a broader perspective on the integration of regional results.…”
Section: Generalization Test Methodologymentioning
confidence: 99%
“…The cross-validation process is then repeated k times; each one of the k sub-samples is used exactly once as the validation data. The average of the k results from the k-folds gives the KCV test accuracy of the algorithm [30]- [31] . Our k-fold crossvalidation is a 10-fold cross-validation.…”
Section: Performance Metricsmentioning
confidence: 99%