Proceedings of the the 17th International Workshop on Semantic Evaluation (SemEval-2023) 2023
DOI: 10.18653/v1/2023.semeval-1.275
|View full text |Cite
|
Sign up to set email alerts
|

SheffieldVeraAI at SemEval-2023 Task 3: Mono and Multilingual Approaches for News Genre, Topic and Persuasion Technique Classification

Abstract: This paper describes our approach for SemEval-2023 Task 3: Detecting the category, the framing, and the persuasion techniques in online news in a multilingual setup. For Subtask 1 (News Genre), we propose an ensemble of fully trained and adapter mBERT models which was ranked joint-first for German, and had the highest mean rank of multi-language teams. For Subtask 2 (Framing), we achieved first place in 3 languages, and the best average rank across all the languages, by using two separate ensembles: a monoling… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
3
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
2
1

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(4 citation statements)
references
References 27 publications
(28 reference statements)
0
3
0
Order By: Relevance
“…For sub-task 1, the articles that are longer than 512 tokens are separated into sentences, which are then sampled sequentially from the beginning and the end of the article, preserving the original order, until the maximum of 512 tokens is reached. Such a truncation approach is motivated by our experiments on sub-task 1 data during the competition stage of SemEval 2023 Task 3 [16]. This approach yielded a significant improvement in the F1 macro score over the setting that…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…For sub-task 1, the articles that are longer than 512 tokens are separated into sentences, which are then sampled sequentially from the beginning and the end of the article, preserving the original order, until the maximum of 512 tokens is reached. Such a truncation approach is motivated by our experiments on sub-task 1 data during the competition stage of SemEval 2023 Task 3 [16]. This approach yielded a significant improvement in the F1 macro score over the setting that…”
Section: Methodsmentioning
confidence: 99%
“…It should be noted that three of the systems that participated in the original SemEval-2023 Task 3 [15] evaluation exercise used adapters. Teams HHU [29] and NAP [30] entered only sub-task 3, in which they used adapters, whereas SheffieldVeraAI [16] applied adapters to sub-tasks 1 and 2. Initial performance analysis in these sub-tasks showed the effect of adapters to be inconsistent across the different sub-tasks.…”
Section: Plos Onementioning
confidence: 99%
See 1 more Smart Citation
“…Some works relied solely on real-world data to fine-tune a pre-trained language model for propaganda and persuasive technique detection (Costa et al, 2023), whereas others combined it with synthetically augmented data (Hasanain et al, 2023b). The ensemble approach was also investigated, in which various combined pre-trained language models are fine-tuned in a vanilla setting (Purificato and Navigli, 2023), or by using adapters (Wu et al, 2023).…”
Section: Related Workmentioning
confidence: 99%