A Practical Survey on Faster and Lighter Transformers

Fournier, Quentin; Caron, Gaétan Marceau; Aloise, Daniel

doi:10.1145/3586074

Cited by 25 publications

(13 citation statements)

References 29 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Additional research efforts, like the Pythia suite, are providing new tools to analyze LLMs and address this issue [66]. Other recent survey papers such as [67], seek to address the issue of how to apply more efficient transformer methods to NLP tasks. Approaches are grouped together by sparse, factorized attention, and architectural change.…”

Section: How Large?mentioning

confidence: 99%

See 1 more Smart Citation

A Survey of Text Classification With Transformers: How Wide? How Large? How Long? How Accurate? How Expensive? How Safe?

Fields,

Chovanec,

Madiraju

2024

IEEE Access

View full text Add to dashboard Cite

Text classification is a basic task in natural language processing (NLP) with applications from sentiment analysis to question-answering with chat bots. In recent years, transformer-based models have emerged as the prevailing framework in NLP, demonstrating excellent results across many benchmarks. This paper recommends an expanded taxonomy of applications and provides a review of the performance of different models across these applications. The use of traditional research techniques plus co-citation and bibliographic coupling provides a comprehensive view of the current and past research in this area. The study begins by providing an overview of the history of transformer-based models with an emphasis on recent large language models (LLM). Next, uni-modal (text only) inputs and the emerging area of multi-modal classification are discussed to provide a comparison of current and emerging research in this area. Gaps are highlighted in the use of multi-modal text/numeric/columnar data and recommendations for future research are provided. Finally, the length of text input variables (tokens) is reviewed to explore the evolution from short-text to longer document applications. Furthermore, the accuracy on 358 datasets across 20 applications is reviewed and unexpected results emerge which show that LLMs are not always the most accurate or least expensive option. In addition to model performance, the safety implications of transformer-based models are reviewed, and a summary of issues related to ethics, bias, social implications, and copyright are explored.

show abstract

Section: How Large?mentioning

confidence: 99%

“…Approaches are grouped together by sparse, factorized attention, and architectural change. However, [67] concludes there are, ". .…”

Section: How Large?mentioning

confidence: 99%

A Survey of Text Classification With Transformers: How Wide? How Large? How Long? How Accurate? How Expensive? How Safe?

Fields,

Chovanec,

Madiraju

2024

IEEE Access

View full text Add to dashboard Cite

show abstract

“…This model has enabled researchers to approach textual data with novel methods and has become increasingly popular over time due to its effectiveness in acquiring contextual word representations, leading to numerous studies in this area. Upon reviewing the literature, it is evident that many studies typically focus on various aspects of transformer models, including their architecture, efficiency, computational power, memory efficiency, and the development of fast and lightweight variants [52]. On the other hand, in other studies, various NLP applications have been explored, including visualization of transformers for NLP [53], examination of pre-training methods used in transformer models [54], usage of transformers for text summarization tasks [55], application of transformer models for detecting different sentiment levels from text-based data [56], and using transformers for extracting useful information from large datasets [57].…”

Section: Deep Learning and Transformersmentioning

confidence: 99%

Identification of Perceived Challenges in the Green Energy Transition by Turkish Society through Sentiment Analysis

Bilgin,

Soner Kara

2024

Sustainability

View full text Add to dashboard Cite

Green energy refers to energy derived from renewable sources such as solar, wind, hydro, and biomass, which are environmentally sustainable. It aims to reduce reliance on fossil fuels and mitigate environmental impacts. In the Turkish context, alongside positive sentiments regarding the establishment of energy plants, there are also prevalent negative perspectives. Societal responses to the transition towards green energy can be effectively gauged through the analysis of individual comments. However, manually examining thousands of comments is both time-consuming and impractical. To address this challenge, this study proposes the integration of the Transformer method, a Natural Language Processing (NLP) technique. This study presents a defined NLP procedure that utilizes a multi-labeled NLP model, with a particular emphasis on the analysis of comments on social media classified as “dirty text”. The primary objective of this investigation is to ascertain the evolving perception of Turkish society regarding the transition to green energy over time and to conduct a comprehensive analysis utilizing NLP. The study utilizes a dataset that is multi-labeled, wherein emotions are not equally represented and each dataset may contain multiple emotions. Consequently, the measured accuracy rates for the risk, environment, and cost labels are, respectively, 0.950, 0.924, and 0.913, whereas the ROC AUC scores are 0.896, 0.902, and 0.923. The obtained results indicate that the developed model yielded successful outcomes. This study aims to develop a forecasting model tailored to green energy to analyze the current situation and monitor societal behavior dynamically. The central focus is on determining the reactions of Turkish society during the transition to green energy. The insights derived from the study aim to guide decision-makers in formulating policies for the transition. The research concludes with policy recommendations based on the model outputs, providing valuable insights for decision-makers in the context of the green energy transition.

show abstract

“…Transformers (Vaswani et al, 2017) have emerged as highly effective models for various tasks, but their widespread adoption has been limited by the quadratic cost of the self-attention mechanism and poor performance on long-range tasks. Researchers have pursued diverse approaches to overcome this challenge and to create efficient transformer architectures (Fournier et al, 2021;. From the perspective of efficiency, techniques such as sparse attention , low-rank attention (Wang et al, 2020;Winata et al, 2020), kernel-based attention (Choromanski et al, 2020), recurrent mechanisms (Hutchins et al, 2022;Dai et al, 2019), and efficient IO-awareness-based implementation (Dao et al, 2022a) proved efficient.…”

Section: Long Range Transformersmentioning

confidence: 99%

Focus Your Attention (with Adaptive IIR Filters)

Lutati,

Zimerman,

Wolf

2023

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing

View full text Add to dashboard Cite

We present a new layer in which dynamic (i.e., input-dependent) Infinite Impulse Response (IIR) filters of order two are used to process the input sequence prior to applying conventional attention. The input is split into chunks, and the coefficients of these filters are determined based on previous chunks to maintain causality. Despite their relatively low order, the causal adaptive filters are shown to focus attention on the relevant sequence elements. The new layer is grounded in control theory, and is shown to generalize diagonal state-space layers. The layer performs on-par with state-of-the-art networks, with a fraction of their parameters and with time complexity that is sub-quadratic with input size. The obtained layer is favorable to layers such as Heyna, GPT2, and Mega, both with respect to the number of parameters and the obtained level of performance on multiple long-range sequence problems.

show abstract

A Practical Survey on Faster and Lighter Transformers

Cited by 25 publications

References 29 publications

A Survey of Text Classification With Transformers: How Wide? How Large? How Long? How Accurate? How Expensive? How Safe?

A Survey of Text Classification With Transformers: How Wide? How Large? How Long? How Accurate? How Expensive? How Safe?

Identification of Perceived Challenges in the Green Energy Transition by Turkish Society through Sentiment Analysis

Focus Your Attention (with Adaptive IIR Filters)

Contact Info

Product

Resources

About