2022
DOI: 10.48550/arxiv.2212.14052
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Hungry Hungry Hippos: Towards Language Modeling with State Space Models

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
30
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
1
1

Relationship

0
6

Authors

Journals

citations
Cited by 12 publications
(35 citation statements)
references
References 0 publications
0
30
0
Order By: Relevance
“…Gated State Spaces (GSS) instead compose the operator via gating and a long convolution parametrized via SSMs. Taking this idea further, Hungry Hungry Hippo (H3) (Dao et al, 2022c), motivated by gaps of GSS on associative recall, extend the mechanism to include an additional gate and a short convolution obtained via a shift SSM. Hyena generalizes this body of work by introducing a recurrence of gates and implicit long convolutions, evaluated efficiently.…”
Section: Subquadratic Operatorsmentioning
confidence: 99%
See 4 more Smart Citations
“…Gated State Spaces (GSS) instead compose the operator via gating and a long convolution parametrized via SSMs. Taking this idea further, Hungry Hungry Hippo (H3) (Dao et al, 2022c), motivated by gaps of GSS on associative recall, extend the mechanism to include an additional gate and a short convolution obtained via a shift SSM. Hyena generalizes this body of work by introducing a recurrence of gates and implicit long convolutions, evaluated efficiently.…”
Section: Subquadratic Operatorsmentioning
confidence: 99%
“…Hyena operators build on the H3 mechanism developed by (Dao et al, 2022c). For clarity of exposition, we once again consider the SISO case (D = 1).…”
Section: Hyena Matricesmentioning
confidence: 99%
See 3 more Smart Citations