An Incremental Framework Based on Cross-Validation for Estimating the Architecture of a Multilayer Perceptron

Aran, Oya; Yıldız, Olcay Taner; Alpaydın, Ethem

doi:10.1142/s0218001409007132

Cited by 18 publications

(8 citation statements)

References 57 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…Two model selection approaches based on a K-fold crossvalidation (K-CV) model evaluation methodology, which is commonly used in the literature [14,[24][25][26], are also applied for comparison purposes. The main differences with respect to MoSe, MoSe-R and MoSe-D model selection methodologies are in terms of the generation of the training and validation sets, the RBFNN optimization process and the iterative procedure typical of a cross-validation approach to evaluate a given model/structure network (Fig.…”

Section: Strategies Based On a K-fold Cross-validation Methodologymentioning

confidence: 99%

“…A network smaller than the optimal architecture underfits and fails to learn the data well (bias is high and variance is low), and a large network suffers the overfitting problem, resulting in poor generalization (bias is low and variance is high). Thus, the optimal architecture is the one with low bias and low variance so that the network learns the function underlying the data and not the existing noise [14].…”

Section: Introductionmentioning

confidence: 99%

“…Regardless of the approach chosen, when the network structure changes, an evaluation is needed to check the suitability of the new network [14]. For this purpose, there are information theory-based model evaluation methodologies such as the Akaike's information criterion (AIC) [21], the Bayesian information criterion (BIC) [22] and the minimum description length (MDL) [23].…”

Section: Introductionmentioning

confidence: 99%

“…-K-fold cross-validation: it is widely used in the literature [14,[24][25][26], but the value of K is based on a subjective judgment: a low value results in small variance and large bias, and a high value makes the evaluation computationally expensive [27]. -Leave-One-Out (LOO): it is less biased, but its variance [24] and computational cost are unacceptable.…”

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

A deterministic model selection scheme for incremental RBFNN construction in time series forecasting

Florido

Pomares

Rojas

et al. 2010

Neural Comput & Applic

View full text Add to dashboard Cite

This paper presents a fast and new deterministic model selection methodology for incremental radial basis function neural network (RBFNN) construction in time series prediction problems. The development of such special designed methodology is motivated by the problems that arise when using a K-fold cross-validation-based model selection methodology for this paradigm: its random nature and the subjective decision for a proper value of K, resulting in large bias for low values and high variance and computational cost for high values. Taking into account these drawbacks, the proposed model selection approach is a combined algorithm that takes advantage of two balanced and representative training and validation sets for their use in RBFNN initialization, optimization and network model evaluation. This way, the model prediction accuracy is improved, getting small variance and bias, reducing the computation time spent in selecting the model and avoiding random and computationally expensive model selection methodologies based on K-fold cross-validation procedures.

show abstract

Section: Strategies Based On a K-fold Cross-validation Methodologymentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

A deterministic model selection scheme for incremental RBFNN construction in time series forecasting

Florido

Pomares

Rojas

et al. 2010

Neural Comput & Applic

View full text Add to dashboard Cite

show abstract

“…The other common approaches for optimizing neural network architecture are basically growing, pruning and a combination of two strategies namely growing and pruning [13]. The first, also called as constructive methods, start with a minimal network and add new hidden units during the training process [14][15][16].…”

Section: Neural Network Architecturementioning

confidence: 99%

A Novel Pruning Algorithm for Optimizing Feedforward Neural Network of Classification Problems

Augasta¹,

Kathirvalavakumar²

2011

Neural Process Lett

View full text Add to dashboard Cite

Optimizing the structure of neural networks is an essential step for the discovery of knowledge from data. This paper deals with a new approach which determines the insignificant input and hidden neurons to detect the optimum structure of a feedforward neural network. The proposed pruning algorithm, called as neural network pruning by significance (N2PS), is based on a new significant measure which is calculated by the Sigmoidal activation value of the node and all the weights of its outgoing connections. It considers all the nodes with significance value below the threshold as insignificant and eliminates them. The advantages of this approach are illustrated by implementing it on six different real datasets namely iris, breast-cancer, hepatitis, diabetes, ionosphere and wave. The results show that the proposed algorithm is quite efficient in pruning the significant number of neurons on the neural network models without sacrificing the networks performance.

show abstract

An Empirically-Sourced Heuristic for Predetermining the Size of the Hidden Layer of a Multi-layer Perceptron for Large Datasets

Lunt

2016

AI 2016: Advances in Artificial Intelligence

View full text Add to dashboard Cite

An Incremental Framework Based on Cross-Validation for Estimating the Architecture of a Multilayer Perceptron

Cited by 18 publications

References 57 publications

A deterministic model selection scheme for incremental RBFNN construction in time series forecasting

A deterministic model selection scheme for incremental RBFNN construction in time series forecasting

A Novel Pruning Algorithm for Optimizing Feedforward Neural Network of Classification Problems

An Empirically-Sourced Heuristic for Predetermining the Size of the Hidden Layer of a Multi-layer Perceptron for Large Datasets

Contact Info

Product

Resources

About