2021
DOI: 10.48550/arxiv.2102.04567
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

NELA-GT-2020: A Large Multi-Labelled News Dataset for The Study of Misinformation in News Articles

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
3
3

Relationship

1
5

Authors

Journals

citations
Cited by 6 publications
(5 citation statements)
references
References 0 publications
0
5
0
Order By: Relevance
“…Studies on this topic have examined the phenomenon of Twitter content being used in news articles, but these studies have been on small data sets of national news (Broersma and Graham 2013;Oschatz, Stier, and Maier 2021), not local news. Because the NELA-Local dataset contains the original URLs to all articles, embedded social media data can be scraped using a similar method to that used in (Gruppi, Horne, and Adalı 2021). If social media content is quoted or used as the source of the article, this will be captured in the article text already in the dataset.…”
Section: Characterizing Hybrid Local Mediamentioning
confidence: 99%
“…Studies on this topic have examined the phenomenon of Twitter content being used in news articles, but these studies have been on small data sets of national news (Broersma and Graham 2013;Oschatz, Stier, and Maier 2021), not local news. Because the NELA-Local dataset contains the original URLs to all articles, embedded social media data can be scraped using a similar method to that used in (Gruppi, Horne, and Adalı 2021). If social media content is quoted or used as the source of the article, this will be captured in the article text already in the dataset.…”
Section: Characterizing Hybrid Local Mediamentioning
confidence: 99%
“…To the best of our knowledge, our data is the most comprehensive corpus of local news outlets available for research. While there are similarly complete news datasets at a national level (Nørregaard, Horne, and Adalı 2019;Gruppi, Horne, and Adalı 2021), no such collections exist at the local level.…”
Section: Introductionmentioning
confidence: 99%
“…Note that such claims are inherently expert annotated. Other sources of claims are social media (Potthast et al, 2018;Shu, Sliva, Wang, Tang and Liu, 2017), news outlets (Horne, Khedr and Adali, 2018;Gruppi, Horne and Adalı, 2021;Nørregaard, Horne and Adalı, 2019), blogs, discussions in QA forums, or similar user-generated publishing platforms (Mihaylova, Nakov, Màrquez, Barrón-Cedeño, Mohtarami, Karadzhov and Glass, 2018).…”
Section: Veracity Predictionmentioning
confidence: 99%