EVALITA Evaluation of NLP and Speech Tools for Italian - December 17th, 2020 2020
DOI: 10.4000/books.aaccademia.7092
|View full text |Cite
|
Sign up to set email alerts
|

UNITOR @ Sardistance2020: Combining Transformer-based Architectures and Transfer Learning for Robust Stance Detection

Abstract: The dominant sequence transduction models are based on complex recurrent or convolutional neural networks that include an encoder and a decoder. The best performing models also connect the encoder and decoder through an attention mechanism. We propose a new simple network architecture, the Transformer, based solely on attention mechanisms, dispensing with recurrence and convolutions entirely. Experiments on two machine translation tasks show these models to be superior in quality while being more parallelizabl… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 19 publications
(3 citation statements)
references
References 11 publications
0
3
0
Order By: Relevance
“…Works as (Giorgioni et al, 2020;Ferreira and Vlachos, 2019) have proposed Transformer-based architectures combined with data augmentation and fine-tuning. They trained specific sentence classifiers based on UmBERTo using auxiliary datasets from tasks like sentiment analysis, irony detection, and hate-speech detection.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Works as (Giorgioni et al, 2020;Ferreira and Vlachos, 2019) have proposed Transformer-based architectures combined with data augmentation and fine-tuning. They trained specific sentence classifiers based on UmBERTo using auxiliary datasets from tasks like sentiment analysis, irony detection, and hate-speech detection.…”
Section: Related Workmentioning
confidence: 99%
“…Transfer learning (Zhang et al, 2020;Giorgioni et al, 2020) and unsupervised approaches (Darwish et al, 2020;Rashed et al, 2021;Wei et al, 2019) are promising directions for SD, but they still face challenges in achieving comparable results to supervised machine learning approaches, especially in highly polarized environments such as Twitter. This is due to the difficulty of detecting stances in a noisy and polarized platform such as Twitter, where people express their opinions in nuanced and complex ways.…”
Section: Introductionmentioning
confidence: 99%
“…(i) We analyze emerging trends in the use of inti et al, 2020;Sarti, 2020;Giorgioni et al, 2020). This language model has 12-layer, 768hidden, 12-heads, 110M parameters.…”
Section: Introductionmentioning
confidence: 99%