A Preliminary Investigation of Overfitting in Evolutionary Driven Model Induction: Implications for Financial Modelling

Tuite, Clíodhna; Agapitos, Alexandros; O’Neill, Michael; Brabazon, Anthony

doi:10.1007/978-3-642-20520-0_13

Cited by 14 publications

(8 citation statements)

References 9 publications

(6 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…It can be seen that the model does not overfit the data. Comparison with runs using all the available data for training achieved similar results, suggesting that the use of a 2-set methodology neither hinders nor improves the performance of the obtained models, confirming previous results reported in GP [5] and GE [23]. The best (D1 + N2) model is shown in Eq.…”

Section: Results and Analysissupporting

confidence: 83%

Evolving Interpolating Models of Net Ecosystem CO2 Exchange Using Grammatical Evolution

Nicolau

Saunders

O’Neill

et al. 2012

Lecture Notes in Computer Science

Self Cite

View full text Add to dashboard Cite

Abstract. Accurate measurements of Net Ecosystem Exchange of CO2 between atmosphere and biosphere are required in order to estimate annual carbon budgets. These are typically obtained with Eddy Covariance techniques. Unfortunately, these techniques are often both noisy and incomplete, due to data loss through equipment failure and routine maintenance, and require gap-filling techniques in order to provide accurate annual budgets. In this study, a grammar-based version of Genetic Programming is employed to generate interpolating models for flux data. The evolved models are robust, and their symbolic nature provides further understanding of the environmental variables involved.

show abstract

Section: Results and Analysissupporting

confidence: 83%

Evolving Interpolating Models of Net Ecosystem CO2 Exchange Using Grammatical Evolution

Nicolau

Saunders

O’Neill

et al. 2012

Lecture Notes in Computer Science

Self Cite

View full text Add to dashboard Cite

show abstract

“…In Fig. 8.3(b) we can see an explosion in the test error towards the end of the run [23], which contrasts with the low value of the training error.…”

Section: Resultsmentioning

confidence: 84%

“…Table 8.1 shows results of interest with respect to the fitness as evaluated on the validation and test dataset, for 9 runs. It shows that stopping evolution before the specified number of generations had elapsed, in the majority of cases would have led to the model extrapolating better beyond the range in which it was trained [23]. Early stopping has been described in Section 8.2.2.…”

Section: Resultsmentioning

confidence: 99%

“…We examine the eight runs where the model that we would have evolved using traditional early stopping had better test fitness than the model that would have been evolved at the end of the run. In five out of these eight runs, the optimal generation at which to stop (as measured by test fitness) came later than the generation of the result of training using traditional early stopping [23].…”

Section: Resultsmentioning

confidence: 99%

See 1 more Smart Citation

Tackling Overfitting in Evolutionary-Driven Financial Model Induction

Tuite

Agapitos

O’Neill

et al. 2011

Natural Computing in Computational Finance

Self Cite

View full text Add to dashboard Cite

“…The question then becomes, when should early stopping take place? Previous work [9] indicates that early stopping should not necessarily take place the first time validation set error disimproves during a symbolic regression run using Grammarbased GP. With the aim of developing techniques to counteract overfitting in Grammar-based GP, the classes of stopping criteria in [8] were implemented here on symbolic regression problems.…”

Section: Overfitting and Early Stoppingmentioning

confidence: 99%