Reservoir Topology in Deep Echo State Networks

Gallicchio, Claudio; Micheli, Alessio

doi:10.1007/978-3-030-30493-5_6

Cited by 13 publications

(9 citation statements)

References 26 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The presented results differ by multiple orders of magnitude from e.g. Gallicchio and Micheli [9], who reached NARMA10 ≈ 10 −4 ± 10 −5 , MG17 ≈ 10 −9 ± 10 −10 , and MG30 ≈ 10 −8 ± 10 −9 . The parameters in the aforementioned work were tuned manually and the sparse topology ended up as the worst of the four.…”

Section: Resultscontrasting

confidence: 92%

“…Some authors (e.g., [25]) use the same constant for all the nonzero connection weights in the ring, chain, and permutation topologies instead of generating the values from N(𝜇 𝑟𝑒𝑠 , 𝜎 2 𝑟𝑒𝑠 ) as is the case for the sparse topology (e.g., [9]). In other words, the reservoir matrix 𝑊 can be expressed as 𝜆𝑊 𝑏 , where 𝑊 𝑏 is a binary matrix and 𝜆 is the desired constant.…”

Section: Topologiesmentioning

confidence: 99%

“…It is worth noting that the sparse topology has O(𝑛 2 ) parameters where 𝑛 is the number of neurons, whereas the ring, chain and permutation topologies have only O(𝑛) parameters. Analogously to other papers (e.g., [9] [31]), we compare topologies with the same number of neurons, not the same number of parameters.…”

Section: Topologiesmentioning

confidence: 99%

“…Unfortunately, researchers have not yet converged to a single and easily comparable performance measure and even though there exist widely used benchmark tasks, many authors have developed their specific modification or parametrization. Unless specified otherwise, we will use the measures from Gallicchio and Micheli [9]. 3.5.1 NARMA10.…”

Section: Benchmarksmentioning

confidence: 99%

“…The hyperparameters of each reservoir topology are optimized so that the instantiated network maximizes its performance on one of the benchmark tasks. Similarly to [9], the experiment uses ESNs with 500 reservoir neurons, regardless of the topology. The reservoir weights are generated from normal distribution N(𝜇 𝑟𝑒𝑠 , 𝜎 2 𝑟𝑒𝑠 ), feedback weights from uniform distribution 𝑈 (−𝜔 𝑓 𝑏 , 𝜔 𝑓 𝑏 ), and input weights from 𝑈 (−𝜔 𝑖𝑛 , 𝜔 𝑖𝑛 ).…”

Section: Experimental Environmentmentioning

confidence: 99%

See 4 more Smart Citations

Hyperparameter tuning in echo state networks

Matzner

2022

Proceedings of the Genetic and Evolutionary Computation Conference

View full text Add to dashboard Cite

Echo State Networks represent a type of recurrent neural network with a large randomly generated reservoir and a small number of readout connections trained via linear regression. The most common topology of the reservoir is a fully connected network of up to thousands of neurons. Over the years, researchers have introduced a variety of alternative reservoir topologies, such as a circular network or a linear path of connections. When comparing the performance of different topologies or other architectural changes, it is necessary to tune the hyperparameters for each of the topologies separately since their properties may significantly differ. The hyperparameter tuning is usually carried out manually by selecting the best performing set of parameters from a sparse grid of predefined combinations. Unfortunately, this approach may lead to underperforming configurations, especially for sensitive topologies. We propose an alternative approach of hyperparameter tuning based on the Covariance Matrix Adaptation Evolution Strategy (CMA-ES). Using this approach, we have improved multiple topology comparison results by orders of magnitude suggesting that topology alone does not play as important role as properly tuned hyperparameters.

show abstract

Section: Resultscontrasting

confidence: 92%

Section: Topologiesmentioning

confidence: 99%

Section: Topologiesmentioning

confidence: 99%

Section: Benchmarksmentioning

confidence: 99%

Section: Experimental Environmentmentioning

confidence: 99%

See 3 more Smart Citations