SheffieldVeraAI at SemEval-2023 Task 3: Mono and Multilingual Approaches for News Genre, Topic and Persuasion Technique Classification

Wu, Benjamin M.; Razuvayevskaya, Olesya; Freddy, Heppell,; Leite, João; Scarton, Carolina; Bontcheva, Kalina; Song, Xingyi

doi:10.18653/v1/2023.semeval-1.275

Cited by 3 publications

(4 citation statements)

References 27 publications

(28 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…For sub-task 1, the articles that are longer than 512 tokens are separated into sentences, which are then sampled sequentially from the beginning and the end of the article, preserving the original order, until the maximum of 512 tokens is reached. Such a truncation approach is motivated by our experiments on sub-task 1 data during the competition stage of SemEval 2023 Task 3 [16]. This approach yielded a significant improvement in the F1 macro score over the setting that…”

Section: Methodsmentioning

confidence: 99%

“…It should be noted that three of the systems that participated in the original SemEval-2023 Task 3 [15] evaluation exercise used adapters. Teams HHU [29] and NAP [30] entered only sub-task 3, in which they used adapters, whereas SheffieldVeraAI [16] applied adapters to sub-tasks 1 and 2. Initial performance analysis in these sub-tasks showed the effect of adapters to be inconsistent across the different sub-tasks.…”

Section: Plos Onementioning

confidence: 99%

“…Importantly, for sub-task 3 experiments, we include sentences that do not have assigned labels into the training data by assigning them a vector of zeros to indicate that they do not belong to any class. This approach was shown to significantly improve classification performance on this sub-task in our initial experiments [16,17] The multilabel sub-tasks 2 and 3 use confidence thresholds of 50% and 30%, respectively, after applying a sigmoid activation function to the logits. The confidence threshold for subtask 3 is purposefully lower and was selected according to our previous experiments [17], which revealed that its careful calibration can significantly influence the performance of the model.…”

Section: Plos Onementioning

confidence: 99%

See 2 more Smart Citations

Comparison between parameter-efficient techniques and full fine-tuning: A case study on multilingual news article classification

Razuvayevskaya,

Wu,

Leite

et al. 2024

PLoS ONE

Self Cite

View full text Add to dashboard Cite

Adapters and Low-Rank Adaptation (LoRA) are parameter-efficient fine-tuning techniques designed to make the training of language models more efficient. Previous results demonstrated that these methods can even improve performance on some classification tasks. This paper complements existing research by investigating how these techniques influence classification performance and computation costs compared to full fine-tuning. We focus specifically on multilingual text classification tasks (genre, framing, and persuasion techniques detection; with different input lengths, number of predicted classes and classification difficulty), some of which have limited training data. In addition, we conduct in-depth analyses of their efficacy across different training scenarios (training on the original multilingual data; on the translations into English; and on a subset of English-only data) and different languages. Our findings provide valuable insights into the applicability of parameter-efficient fine-tuning techniques, particularly for multilabel classification and non-parallel multilingual tasks which are aimed at analysing input texts of varying length.

show abstract

Section: Methodsmentioning

confidence: 99%

Section: Plos Onementioning

confidence: 99%

Section: Plos Onementioning

confidence: 99%

See 1 more Smart Citation

Comparison between parameter-efficient techniques and full fine-tuning: A case study on multilingual news article classification

Razuvayevskaya,

Wu,

Leite

et al. 2024

PLoS ONE

Self Cite

View full text Add to dashboard Cite

show abstract

“…Some works relied solely on real-world data to fine-tune a pre-trained language model for propaganda and persuasive technique detection (Costa et al, 2023), whereas others combined it with synthetically augmented data (Hasanain et al, 2023b). The ensemble approach was also investigated, in which various combined pre-trained language models are fine-tuned in a vanilla setting (Purificato and Navigli, 2023), or by using adapters (Wu et al, 2023).…”

Section: Related Workmentioning

confidence: 99%

HTE at ArAIEval Shared Task: Integrating Content Type Information in Binary Persuasive Technique Detection

Hadjer,

Bouklouha

2023

Proceedings of ArabicNLP 2023

View full text Add to dashboard Cite

Propaganda frequently employs sophisticated persuasive strategies in order to influence public opinion and manipulate perceptions. As a result, automating the detection of persuasive techniques is critical in identifying and mitigating propaganda on social media and in mainstream media. This paper proposes a set of transformer-based models for detecting persuasive techniques in tweets and news that incorporate content type information as extra features or as an extra learning objective in a multitask learning setting. In addition to learning to detect the presence of persuasive techniques in text, our best model learns specific syntactic and lexical cues used to express them based on text genre (type) as an auxiliary task. To optimize the model and deal with data imbalance, a focal loss is used. As part of ArabicNLP2023-ArAIEval shared task, this model achieves the highest score in the shared task 1A out of 13 participants, according to the official results, with a micro-F1 of 76.34% and a macro-F1 of 73.21% on the test dataset. 1

show abstract

FrameFinder: Explorative Multi-Perspective Framing Extraction from News Headlines

Reiter-Haas,

Klösch,

Hadler

et al. 2024

Proceedings of the 2024 ACM SIGIR Conference on Human Information Interaction and Retrieval

View full text Add to dashboard Cite

SheffieldVeraAI at SemEval-2023 Task 3: Mono and Multilingual Approaches for News Genre, Topic and Persuasion Technique Classification

Cited by 3 publications

References 27 publications

Comparison between parameter-efficient techniques and full fine-tuning: A case study on multilingual news article classification

Comparison between parameter-efficient techniques and full fine-tuning: A case study on multilingual news article classification

HTE at ArAIEval Shared Task: Integrating Content Type Information in Binary Persuasive Technique Detection

FrameFinder: Explorative Multi-Perspective Framing Extraction from News Headlines

Contact Info

Product

Resources

About