Learning and Generalization in Cascade Network Architectures

Littmann, Enno; Ritter, Helge

doi:10.1162/neco.1996.8.7.1521

Cited by 29 publications

(14 citation statements)

References 6 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Previous research did not find out the conclusive decision about generalization capabilities of standard CCN and flat CCN. Littman and Ritter suggested that standard CCN generalized better than the flat CCN whereas Sjogaard suggested that the flat CCN was a better choice than the standard CCN [39,10]. Prechelt empirically found that the flat variant was superior to the cascade variant for some problems [17].…”

Section: Discussionmentioning

confidence: 99%

An empirical comparison of cascade and flat variants of incremental node creation algorithm

Sharma

Chandra

2013

CSIT

View full text Add to dashboard Cite

We propose incremental node creation algorithm (INCA). INCA emphasizes on architectural adaptation and functional adaptation in a unified frame work. INCA starts from a single hidden node and then trains node one by one incrementally. Two variants of INCA are developed, namely cascade and flat. In the cascade variant, every hidden node is added in a new hidden layer that is connected to the network inputs and all pre-existing hidden nodes. In contrast, the flat variant adds node one by one to a single hidden layer. Sixteen regression problems are used to investigate which network growing strategy provides the better generalization performance. Simulation results reveal that both architectures perform well on all the investigated regression problems of varying complexities. In general, cascade is better than flat architecture except some real world problems. The trigonometric sine activation function provides better approximation capability than log-sigmoid function except some regression problems.

show abstract

Section: Discussionmentioning

confidence: 99%

An empirical comparison of cascade and flat variants of incremental node creation algorithm

Sharma

Chandra

2013

CSIT

View full text Add to dashboard Cite

show abstract

“…Our indirect-mapping EKM resembles the EKM models of (Littmann and Ritter, 1996;Walter and Schulten, 1993), which utilize locally linear mappings. In their models, each neuron stores both the motor control vector and the matrix of motor control parameters as output weights (Eq.…”

Section: Related Workmentioning

confidence: 99%

“…Furthermore, we have shown that training the control parameters with recursive least squares enables faster convergence and better performance compared to gradient descent. Their EKM models (Littmann and Ritter, 1996;Walter and Schulten, 1993) have only used gradient descent to learn the control parameters.…”

Section: Related Workmentioning

confidence: 99%

“…Such a high degree of smoothness, flexibility, and precision in motion control is essential for efficiently executing complex tasks and interacting with humans. It is well understood how a SOFM or EKM is used for learning sensorimotor control (Littmann and Ritter, 1996;Walter and Schulten, 1993). However, the non-trivial problem of combining multiple SOFMs or EKMs for sophisticated control (e.g., negotiation of unforeseen complex obstacles and cooperative multi-robot tracking of moving targets (Low et al, 2003)) is not well studied.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

An Ensemble of Cooperative Extended Kohonen Maps for Complex Robot Motion Tasks

2005

View full text Add to dashboard Cite

Self-organizing feature maps such as Extended Kohonen Maps (EKMs) have been very successful at learning sensorimotor control for mobile robot tasks. This paper presents a new ensemble approach called cooperative EKMs with indirect mapping to achieve complex robot motion. An indirect-mapping EKM self-organizes to map from the sensory input space to the motor control space indirectly via a control parameter space. Quantitative evaluation reveals that indirect mapping can provide finer, smoother, and more efficient motion control than does direct mapping by operating in a continuous, rather than discrete, motor control space. It is also shown to outperform basis function neural networks. Furthermore, training its control parameters with recursive least squares enables faster convergence and better performance compared to gradient descent. The cooperation and competition of multiple self-organized EKMs allow a non-holonomic mobile robot to negotiate unforeseen, concave, closely spaced, and dynamic obstacles. Qualitative and quantitative comparisons with neural network ensembles employing weighted sum reveal that our method can achieve more sophisticated 1 The final version of this article has been published in

show abstract

“…Ash, 1989, Wang et al, 1994, propose addition of units in the hidden layers of standard MLPs during normal backpropagation training, but this approach severely disturbs the training process because of the interaction of hidden units. Not even constructive unit splitting with reasonable initializations for the new units works well Hanson, 1989, WynneJones, 1991. Littmann and Ritter, 1993 propose direct cascading where local linear maps or di erent neural modules are cascaded and produce the output from the union of the original network inputs and the outputs of previous modules.…”

Section: Related Workmentioning

confidence: 99%

Investigation of the CasCor Family of Learning Algorithms

Prechelt

1997

Neural Networks

View full text Add to dashboard Cite

Six learning algorithms are investigated and compared empirically. All of them are based on variants of the candidate training idea of the Cascade Correlation method. The comparison was performed using 42 di erent datasets from the Proben1 benchmark collection. The results indicate: 1 for these problems it is slightly better not to cascade the hidden units, 2 error minimization candidate training is better that covariance maximization for regression problems but may be a little worse for classi cation problems, 3 for most learning tasks, considering validation set errors during the selection of the best candidate will not lead to improved networks, but for a few tasks it will.Section | Computational Analysis.

show abstract

Learning and Generalization in Cascade Network Architectures

Cited by 29 publications

References 6 publications

An empirical comparison of cascade and flat variants of incremental node creation algorithm

An empirical comparison of cascade and flat variants of incremental node creation algorithm

An Ensemble of Cooperative Extended Kohonen Maps for Complex Robot Motion Tasks

Investigation of the CasCor Family of Learning Algorithms

Contact Info

Product

Resources

About