2011
DOI: 10.1007/978-3-642-20520-0_13
|View full text |Cite
|
Sign up to set email alerts
|

A Preliminary Investigation of Overfitting in Evolutionary Driven Model Induction: Implications for Financial Modelling

Abstract: Abstract. This paper investigates the effects of early stopping as a method to counteract overfitting in evolutionary data modelling using Genetic Programming. Early stopping has been proposed as a method to avoid model overtraining, which has been shown to lead to a significant degradation of out-of-sample performance. If we assume some sort of performance metric maximisation, the most widely used early training stopping criterion is the moment within the learning process that an unbiased estimate of the perf… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

1
7
0

Year Published

2011
2011
2018
2018

Publication Types

Select...
6
1

Relationship

3
4

Authors

Journals

citations
Cited by 14 publications
(8 citation statements)
references
References 9 publications
(6 reference statements)
1
7
0
Order By: Relevance
“…It can be seen that the model does not overfit the data. Comparison with runs using all the available data for training achieved similar results, suggesting that the use of a 2-set methodology neither hinders nor improves the performance of the obtained models, confirming previous results reported in GP [5] and GE [23]. The best (D1 + N2) model is shown in Eq.…”
Section: Results and Analysissupporting
confidence: 83%
“…It can be seen that the model does not overfit the data. Comparison with runs using all the available data for training achieved similar results, suggesting that the use of a 2-set methodology neither hinders nor improves the performance of the obtained models, confirming previous results reported in GP [5] and GE [23]. The best (D1 + N2) model is shown in Eq.…”
Section: Results and Analysissupporting
confidence: 83%
“…In Fig. 8.3(b) we can see an explosion in the test error towards the end of the run [23], which contrasts with the low value of the training error.…”
Section: Resultsmentioning
confidence: 84%
“…Table 8.1 shows results of interest with respect to the fitness as evaluated on the validation and test dataset, for 9 runs. It shows that stopping evolution before the specified number of generations had elapsed, in the majority of cases would have led to the model extrapolating better beyond the range in which it was trained [23]. Early stopping has been described in Section 8.2.2.…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…The question then becomes, when should early stopping take place? Previous work [9] indicates that early stopping should not necessarily take place the first time validation set error disimproves during a symbolic regression run using Grammarbased GP. With the aim of developing techniques to counteract overfitting in Grammar-based GP, the classes of stopping criteria in [8] were implemented here on symbolic regression problems.…”
Section: Overfitting and Early Stoppingmentioning
confidence: 99%