Sentence-Level Propaganda Detection in News Articles with Transfer Learning and BERT-BiLSTM-Capsule Model

Vlad, George-Alexandru; Tanase, Mircea-Adrian; Onose, Cristian; Cercel, Dumitru-Clementin

doi:10.18653/v1/d19-5022

Cited by 36 publications

(19 citation statements)

References 15 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Almost all teams used some Transformer-based models (especially BERT (Devlin et al, 2018)) either to get embeddings or as a pretrained model (Yoosuf and Yang, 2019) (Hou and Chen, 2019). Other teams often used ensembles with different features and models inside: LSTM-CRF (Gupta et al, 2019), XGBoost (Tayyar Madabushi et al, 2019), BiLSTM (Vlad et al, 2019) Figure 1: Class distribution in the train data, where A is "Loaded Language", B is "Name Calling or Labeling", C is "Repetition", D is "Doubt", E is "Exaggeration or Minimisation", and F represents all the remaining 9 classes.…”

Section: Approachmentioning

confidence: 99%

Inno at SemEval-2020 Task 11: Leveraging Pure Transfomer for Multi-Class Propaganda Detection

Grigorev¹,

Ivanov

2020

Proceedings of the Fourteenth Workshop on Semantic Evaluation

View full text Add to dashboard Cite

The paper presents the solution of team "Inno" to a SEMEVAL 2020 task 11 "Detection of propaganda techniques in news articles". The goal of the second subtask is to classify textual segments that correspond to one of the 18 given propaganda techniques in news articles dataset. We tested a pure Transformer-based model with an optimized learning scheme on the ability to distinguish propaganda techniques between each other. Our model showed 0.6 and 0.58 overall F1 score on validation set and test set accordingly and non-zero F1 score on each class on both sets.

show abstract

Section: Approachmentioning

confidence: 99%

Inno at SemEval-2020 Task 11: Leveraging Pure Transfomer for Multi-Class Propaganda Detection

Grigorev¹,

Ivanov

2020

Proceedings of the Fourteenth Workshop on Semantic Evaluation

View full text Add to dashboard Cite

show abstract

“…Team Mindcoders (Vlad et al, 2019) combined BERT, Bi-LSTM and Capsule networks (Sabour et al, 2017) into a single deep neural network and pre-trained the resulting network on corpora used for related tasks, e.g., emotion classification.…”

Section: Teams Participating In the Sentence-level Classification Onlymentioning

confidence: 99%

Findings of the NLP4IF-2019 Shared Task on Fine-Grained Propaganda Detection

Martino¹,

Barrón-Cedeño²,

Nakov³

2019

Proceedings of the Second Workshop on Natural Language Processing for Internet Freedom: Censorship, Disinformation, and Propaga

View full text Add to dashboard Cite

We present the shared task on Fine-Grained Propaganda Detection, which was organized as part of the NLP4IF workshop at EMNLP-IJCNLP 2019. There were two subtasks. FLC is a fragment-level task that asks for the identification of propagandist text fragments in a news article and also for the prediction of the specific propaganda technique used in each such fragment (18-way classification task). SLC is a sentence-level binary classification task asking to detect the sentences that contain propaganda. A total of 12 teams submitted systems for the FLC task, 25 teams did so for the SLC task, and 14 teams eventually submitted a system description paper. For both subtasks, most systems managed to beat the baseline by a sizable margin. The leaderboard and the data from the competition are available at http://propaganda.qcri. org/nlp4if-shared-task/.

show abstract

“…For example, [30] describes the detection of government propaganda, [15] focuses on analyzing the title's compliance with the contents of the text, and [27] conducts teaching on relatively small data to distinguish between fake news and satire. Attempts to combat fake news using machine learning and artificial intelligence techniques were also undertaken in other works using methods based on deep neural networks, on RNN recursive networks, or using traditional learnings algorithms like: RF (random forest), LR (logistic regression), NB (naive Bayes), MLP (multilayer perceptron) or support vector machine (SVM).…”

Section: Introductionmentioning

confidence: 99%

Implementation of the BERT-derived architectures to tackle disinformation challenges

Kula

Kozik

Choraś

2021

Neural Comput & Applic

View full text Add to dashboard Cite

Recent progress in the area of modern technologies confirms that information is not only a commodity but can also become a tool for competition and rivalry among governments and corporations, or can be applied by ill-willed people to use it in their hate speech practices. The impact of information is overpowering and can lead to many socially undesirable phenomena, such as panic or political instability. To eliminate the threats of fake news publishing, modern computer security systems need flexible and intelligent tools. The design of models meeting the above-mentioned criteria is enabled by artificial intelligence and, above all, by the state-of-the-art neural network architectures, applied in NLP tasks. The BERT neural network belongs to this type of architectures. This paper presents Transformer-based hybrid architectures applied to create models for detecting fake news.

show abstract

Sentence-Level Propaganda Detection in News Articles with Transfer Learning and BERT-BiLSTM-Capsule Model

Cited by 36 publications

References 15 publications

Inno at SemEval-2020 Task 11: Leveraging Pure Transfomer for Multi-Class Propaganda Detection

Inno at SemEval-2020 Task 11: Leveraging Pure Transfomer for Multi-Class Propaganda Detection

Findings of the NLP4IF-2019 Shared Task on Fine-Grained Propaganda Detection

Implementation of the BERT-derived architectures to tackle disinformation challenges

Contact Info

Product

Resources

About