2018
DOI: 10.48550/arxiv.1812.11391
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

SLIM LSTMs

Fathi M. Salem

Abstract: Long Short-Term Memory (LSTM) Recurrent Neural networks (RNNs) rely on gating signals, each driven by a function of a weighted sum of at least 3 components: (i) one of an adaptive weight matrix multiplied by the incoming external input vector sequence, (ii) one adaptive weight matrix multiplied by the previous memory/state vector, and (iii) one adaptive bias vector. In effect, they augment the simple Recurrent Neural Networks (sRNNs) structure with the addition of a "memory cell" and the incorporation of at mo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
9
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
2
1
1

Relationship

1
3

Authors

Journals

citations
Cited by 4 publications
(9 citation statements)
references
References 12 publications
0
9
0
Order By: Relevance
“…This variant form is close to the so-called basic Recurrent Neural Network (bRNN), see [19], [21] for analysis and details.…”
Section: Lstmmentioning
confidence: 91%
See 2 more Smart Citations
“…This variant form is close to the so-called basic Recurrent Neural Network (bRNN), see [19], [21] for analysis and details.…”
Section: Lstmmentioning
confidence: 91%
“…Different variants have been introduced earlier [3], [21]. For LSTM 6, the gating signals are set at constant values as follows:…”
Section: Lstmmentioning
confidence: 99%
See 1 more Smart Citation
“…More recently, a host of new variants with aggressive reduction of parameters of the LSTM layer have shown reasonable initial success, see [6]- [10]. These mosaic of variants are referred to as SLIM LSTMs [11].…”
Section: B Slim Lstm Variants Overviewmentioning
confidence: 99%
“…The overall equations of this standard LSTM layer are described in [2], and the references therein. Here, we follow the presentation in [10], [11], where one splits the 3 gating equations from the memory cell and the"input block" equations for suitability of the development in the next sections. The 3 gating equations are:…”
Section: Introduction a Lstm Architecture Overviewmentioning
confidence: 99%