2022
DOI: 10.3390/electronics11213550
|View full text |Cite
|
Sign up to set email alerts
|

EFA-Trans: An Efficient and Flexible Acceleration Architecture for Transformers

Abstract: The topic of transformers is rapidly emerging as one of the most important key primitives in neural networks. Unfortunately, most hardware designs for transformers are deficient, either hardly considering the configurability of the design or failing to realize the complete inference process of transformers. Specifically, few studies have paid attention to the compatibility of different computing paradigms. Thus, this paper presents EFA-Trans, a highly efficient and flexible hardware accelerator architecture fo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
0
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3
3

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(2 citation statements)
references
References 20 publications
0
0
0
Order By: Relevance
“…Since the hardware implementation of this function is resource-intensive, researchers have proposed various approaches to optimize it. In papers [29][30][31][32], the exponential function is implemented by storing its points in an internal memory of FPGA (look-up table, LUT). This is a simple and effective solution.…”
Section: Current State Of Hardware Implementation Of the Softmax Acti...mentioning
confidence: 99%
See 1 more Smart Citation
“…Since the hardware implementation of this function is resource-intensive, researchers have proposed various approaches to optimize it. In papers [29][30][31][32], the exponential function is implemented by storing its points in an internal memory of FPGA (look-up table, LUT). This is a simple and effective solution.…”
Section: Current State Of Hardware Implementation Of the Softmax Acti...mentioning
confidence: 99%
“…The wide input range of the SoftMax activation increases the complexity of its hardware implementation. To reduce the range, most of the mentioned papers suggest subtracting the maximum input value from all inputs [29,31,32,34]. This is possible due to Equation (2).…”
Section: Current State Of Hardware Implementation Of the Softmax Acti...mentioning
confidence: 99%