2019
DOI: 10.1007/s11004-019-09813-9
|View full text |Cite
|
Sign up to set email alerts
|

The Effect of Splitting of Raw Data into Training and Test Subsets on the Accuracy of Predicting Spatial Distribution by a Multilayer Perceptron

Abstract: The paper discusses the influence of various methods for splitting raw data into test and training subsets on the accuracy of the prediction of the spatial distribution of the variable for the model based on a multilayer perceptron. A comparison of models was performed taking into account both spatial heterogeneity and the spread of values of the modeled variable with a completely random splitting option. The study was based on the results obtained during soil screening of urbanized territories in the Russian … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 12 publications
(2 citation statements)
references
References 34 publications
0
2
0
Order By: Relevance
“…Motivated by the necessity to leverage non-i.i.d. samples in practical applications, and evidence that model's performance is affected by spatial correlation [16,17], the statistical community devised new error estimation methods using the spatial coordinates of the samples: h-block leave-one-out (1995). Developed for time-series data (i.e., data showing temporal dependency), the h-block leave-oneout method is based on the principle that stationary processes achieve a correlation length (the "h") after which the samples are not correlated.…”
Section: Introductionmentioning
confidence: 99%
“…Motivated by the necessity to leverage non-i.i.d. samples in practical applications, and evidence that model's performance is affected by spatial correlation [16,17], the statistical community devised new error estimation methods using the spatial coordinates of the samples: h-block leave-one-out (1995). Developed for time-series data (i.e., data showing temporal dependency), the h-block leave-oneout method is based on the principle that stationary processes achieve a correlation length (the "h") after which the samples are not correlated.…”
Section: Introductionmentioning
confidence: 99%
“…Motivated by the necessity to leverage non-i.i.d. samples in practical applications, and evidence that model's performance is affected by spatial correlation [16,17], the statistical community devised new error estimation methods using the spatial coordinates of the samples: h-block leave-one-out (1995). Developed for time-series data (i.e.…”
Section: Introductionmentioning
confidence: 99%