ProSOUL: A Framework to Identify Propaganda From Online Urdu Content

Kausar, Soufia; Tahir, Bilal; Mehmood, Muhammad Amir

doi:10.1109/access.2020.3028131

Cited by 26 publications

(13 citation statements)

References 33 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…We can look at the features from following points of view: Being imitated: Some features are difficult to be mimicked by malicious users (topographical features), while most of them can be imitated easily. Sensitive to time: For instance, [37] shows that Word n-gram features may provide information which is less relevant at that point of time. Required level of computational resources: Some features are readily available, while some others like layer ratio in [38] need processing…”

Section: Proachesmentioning

confidence: 99%

Detecting and Mitigating the Dissemination of Fake News: Challenges and Future Research Opportunities

Shahid

Jamshidi

Hakak

et al. 2024

IEEE Trans. Comput. Soc. Syst.

View full text Add to dashboard Cite

Fake news is a major threat to democracy (e.g., influencing public opinion), and its impact cannot be understated particularly in our current socially and digitally connected society. The research community from different disciplines (e.g., computer science, political science, information science, and linguistics) have also studied the dissemination, detection and mitigation of fake news, however it remains challenging to detect and prevent the dissemination of fake news in practice. With AI powered systems, its highly crucial to understand the detector's decision of fake news by means of proper user-friendly explanations when it comes to social media. Hence, in this paper, we systematically survey existing state-of-the-art approaches designed to detect and mitigate the dissemination of fake news, and based on the analysis, we discuss several key challenges and present potential future research agenda specially incorporating AI explainable Fake news credibility system.

show abstract

Section: Proachesmentioning

confidence: 99%

Detecting and Mitigating the Dissemination of Fake News: Challenges and Future Research Opportunities

Shahid

Jamshidi

Hakak

et al. 2024

IEEE Trans. Comput. Soc. Syst.

View full text Add to dashboard Cite

show abstract

“…UrduWeb20 is effectively used to develop and test NLP and IR applications for the Urdu language. For instance, UrduWeb20 is employed by Kausar et al [61] for the propaganda detection from the Urdu content. Authors train machine learning models on the gold standard dataset of Urdu content.…”

Section: B Nlp/ir Applicationsmentioning

confidence: 99%

Corpulyzer: A Novel Framework for Building Low Resource Language Corpora

Tahir

Mehmood

2021

IEEE Access

Self Cite

View full text Add to dashboard Cite

“…These dense vector representations have been leveraged extensively, for example, as input representations in neural network architectures for NLP tasks [10], e.g., detecting 'fake news' and phenomena related to the setting of this work [29]. In a recent study identifying online propaganda [18], Word2vec embeddings were found to outperform a multilingual version of BERT in Urdu [7], which the authors ascribe to the limited vocabulary of Urdu in the model. In another study, Word2vec has been leveraged as a feature in the detection of fake news where researchers found that it performs well in comparison to other textual features across multiple datasets and languages [9].…”

Section: Deriving Insights From Twitter Datamentioning

confidence: 99%

Deriving Disinformation Insights from Geolocalized Twitter Callouts

Tuxworth,

Antypas,

Espinosa-Anke

et al. 2021

Preprint

View full text Add to dashboard Cite

This paper demonstrates a two-stage method for deriving insights from social media data relating to disinformation by applying a combination of geospatial classification and embedding-based language modelling across multiple languages. In particular, the analysis in centered on Twitter and disinformation for three European languages: English, French and Spanish. Firstly, Twitter data is classified into European and non-European sets using BERT. Secondly, Word2vec is applied to the classified texts resulting in Eurocentric, non-Eurocentric and global representations of the data for the three target languages. This comparative analysis demonstrates not only the efficacy of the classification method but also highlights geographic, temporal and linguistic differences in the disinformationrelated media. Thus, the contributions of the work are threefold: (i) a novel language-independent transformer-based geolocation method; (ii) an analytical approach that exploits lexical specificity and word embeddings to interrogate user-generated content; and (iii) a dataset of 36 million disinformation related tweets in English, French and Spanish.

show abstract

ProSOUL: A Framework to Identify Propaganda From Online Urdu Content

Cited by 26 publications

References 33 publications

Detecting and Mitigating the Dissemination of Fake News: Challenges and Future Research Opportunities

Detecting and Mitigating the Dissemination of Fake News: Challenges and Future Research Opportunities

Corpulyzer: A Novel Framework for Building Low Resource Language Corpora

Deriving Disinformation Insights from Geolocalized Twitter Callouts

Contact Info

Product

Resources

About