2020
DOI: 10.1111/ejss.12893
|View full text |Cite
|
Sign up to set email alerts
|

Synthetic resampling strategies and machine learning for digital soil mapping in Iran

Abstract: Most common machine learning (ML) algorithms usually work well on balanced training sets, that is, datasets in which all classes are approximately represented equally. Otherwise, the accuracy estimates may be unreliable and classes with only a few values are often misclassified or neglected. This is known as a class imbalance problem in machine learning and datasets that do not meet this criterion are referred to as imbalanced data. Most datasets of soil classes are, therefore, imbalanced data. One of our main… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

1
28
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
2
1

Relationship

1
6

Authors

Journals

citations
Cited by 47 publications
(29 citation statements)
references
References 48 publications
1
28
0
Order By: Relevance
“…Therefore, the system cannot capture enough information on the transitions between health states to improve forecasting horizon while ensuring good prediction accuracy. Most common machine learning algorithms usually work well on balanced training sets, that is, datasets in which all classes are approximately represented equally [50]. Because these algorithms treat all misclassifications equally, they bias classes with many instances, resulting in false accuracy estimates.…”
Section: F Classification Resultsmentioning
confidence: 99%
See 3 more Smart Citations
“…Therefore, the system cannot capture enough information on the transitions between health states to improve forecasting horizon while ensuring good prediction accuracy. Most common machine learning algorithms usually work well on balanced training sets, that is, datasets in which all classes are approximately represented equally [50]. Because these algorithms treat all misclassifications equally, they bias classes with many instances, resulting in false accuracy estimates.…”
Section: F Classification Resultsmentioning
confidence: 99%
“…Because these algorithms treat all misclassifications equally, they bias classes with many instances, resulting in false accuracy estimates. Therefore, the accuracy estimates may be unreliable and classes with only a few values are often misclassified or neglected [50]. This issue is known as a class imbalance problem in machine learning.…”
Section: F Classification Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…About 80% of the case studies used at least one tree-based algorithm such as regression tree (e.g. Taghizadeh-Mehrjardi et al, 2019b;Heung et al, 2016), random forest (e.g. Häring et al, 2012) 2017; Ramcharan et al (2018).…”
Section: Machine Learning Modelsmentioning
confidence: 99%