CLIMATE-FEVER: A Dataset for Verification of Real-World Climate Claims

Diggelmann, Thomas; Boyd-Graber, Jordan; Bulian, Jannis; Ciaramita, Massimiliano; Leippold, Markus

doi:10.48550/arxiv.2012.00614

Cited by 20 publications

(28 citation statements)

References 19 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…et al, 2021), Touché(Bondarenko et al, 2020), Ar-guAna(Wachsmuth et al, 2018), Climate-FEVER (C-FEVER)(Diggelmann et al, 2020), FEVER(Thorne et al, 2018), Quora, SCIDOCS, and SciFact(Wadden et al, 2020).…”

unclassified

ColBERTv2: Effective and Efficient Retrieval via Lightweight Late Interaction

Santhanam¹,

Khattab²,

Saad-Falcon³

et al. 2021

Preprint

View full text Add to dashboard Cite

Neural information retrieval (IR) has greatly advanced search and other knowledgeintensive language tasks. While many neural IR methods encode queries and documents into single-vector representations, late interaction models produce multi-vector representations at the granularity of each token and decompose relevance modeling into scalable token-level computations. This decomposition has been shown to make late interaction more effective, but it inflates the space footprint of these models by an order of magnitude. In this work, we introduce ColBERTv2, a retriever that couples an aggressive residual compression mechanism with a denoised supervision strategy to simultaneously improve the quality and space footprint of late interaction. We evaluate ColBERTv2 across a wide range of benchmarks, establishing state-of-the-art quality within and outside the training domain while reducing the space footprint of late interaction models by 5-8×.

show abstract

unclassified

ColBERTv2: Effective and Efficient Retrieval via Lightweight Late Interaction

Santhanam¹,

Khattab²,

Saad-Falcon³

et al. 2021

Preprint

View full text Add to dashboard Cite

show abstract

“…many datasets and task variants Karadzhov et al, 2017;Baly et al, 2018;Augenstein et al, 2019;Hanselowski et al, 2019;Chen et al, 2020;Wadden et al, 2020;Diggelmann et al, 2020;Schuster et al, 2021, inter alia) that have enabled the development of new models and methods for machine reading and comprehension.…”

Section: (Supporting)mentioning

confidence: 99%

“…The largest claim verification dataset, FEVER , contains 185,000 claims which were manually constructed; annotated with evidence supporting or refuting them from the introductory sections of Wikipedia pages. In contrast, MultiFC (Augenstein et al, 2019) (Derczynski et al, 2017) 330 yes no Twitter (Baly et al, 2018) 442 yes no News + fact checking websites (Thorne and Vlachos, 2018) 185, 445 no yes Wikipedia (constructed claims) 1, 000 yes no Debate websites (Augenstein et al, 2019) 36, 534 yes no Fact checking websites (Hanselowski et al, 2019) 6, 422 yes yes Fact checking websites (Wadden et al, 2020) 1,409 no yes Scientific articles (Diggelmann et al, 2020) 1,535 no yes 2 Wikipedia (Climate change) verification through stance classification. While stance classification considers whether claims are supported or refuted by evidence, unlike FEVER, it does not involve document retrieval or evidence extraction.…”

Section: (Supporting)mentioning

confidence: 99%

“…Assuming that evidence is pre-selected, similar model architectures can be applied to both settings. However, in FEVER , CLIMATE-FEVER 2 (Diggelmann et al, 2020) and SciFact (Wadden et al, 2020), individual evidence sentences must be retrieved from a corpus (such as Wikipedia). These evidence sentences were manually labelled and form part of the scoring protocol for the task.…”

Section: Related Workmentioning

confidence: 99%

See 1 more Smart Citation

Evidence-based Verification for Real World Information Needs

Glockner¹,

Staliūnaitė²,

Thorne³

et al. 2021

Preprint

View full text Add to dashboard Cite

Claim verification is the task of predicting the veracity of written statements against evidence. Previous large-scale datasets model the task as classification, ignoring the need to retrieve evidence, or are constructed for research purposes, and may not be representative of real-world needs. In this paper, we introduce a novel claim verification dataset with instances derived from search-engine queries, yielding 10,987 claims annotated with evidence that represent real-world information needs. For each claim, we annotate evidence from full Wikipedia articles with both section and sentence-level granularity. Our annotation allows comparison between two complementary approaches to verification: stance classification, and evidence extraction followed by entailment recognition. In our comprehensive evaluation, we find no significant difference in accuracy between these two approaches. This enables systems to use evidence extraction to summarize a rationale for an end-user while maintaining the accuracy when predicting a claim's veracity. With challenging claims and evidence documents containing hundreds of sentences, our dataset presents interesting challenges that are not captured in previous work -evidenced through transfer learning experiments. We release code and data 1 to support further research on this task.

show abstract

“…Metadata such as page viewership statistics is helpful to rank webpages [Nie et al, 2019]. However, when search engines are not available, such as PolitiFact [Vlachos and Riedel, 2014] 106 claims Politics Very small; metadata and evidence of various forms Emergent [Ferreira and Vlachos, 2016] 300 claims News Very small; 2595 associated documents LIAR 12,836 claims Politics Medium; metadata Snopes [Popat et al, 2017] 4,956 claims Snopes website Medium; 30 Google retrieved documents for each claim FEVER [Thorne et al, 2018a] 185,445 claims Wikipedia Big; associated Wikipeida evidence LIAR-PLUS [Alhindi et al, 2018] 12,836 claims Politics Medium; automatically extracted justifications Perspectrum [Chen et al, 2019b] 907 claims Debates Small; evidence and perspectives UKP Snopes [Hanselowski et al, 2019] 6,422 claims Snopes website Medium; associated evidence MultiFC [Augenstein et al, 2019] 34,918 claims Fact-checking websites Medium; metadata and 10 Google retrieved webpages for each claim Scifact [Wadden et al, 2020] 1,409 claims Scientific papers Small; associated documents PolitiHop [Ostrowski et al, 2020] 500 claims Politics Very small; evidence chains for multi-hop reasoning WikiFactCheck-English [Sathe et al, 2020] 124,821 claims Wikipedia Big; context and evidence Climate-FEVER [Diggelmann et al, 2021] 1,535 claims Climate Medium; 7,675 claim-evidence pairs with climate related claims verified against Wikipedia evidence COVID-Fact [Saakyan et al, 2021] 4,086 claims COVID-19 Medium; 1,296 supported claims from r/COVID19 subreddit and 2,790 automatically generated refuted claims Vitamin-C [Schuster et al, 2021] 488,904 pairs Wikipedia Big; contrastive evidence from Wikipedia edits FEVEROUS [Aly et al, 2021] 87,026 claims Wikipedia Biggest; evidence collected from both structured and unstructured information on whole Wikipedia in the SCIVER shared task, the majority of effort goes into exploring similarity metrics that are used as a proxy to determine the documents' relevance to a claim. TF-IDF similarity is a common baseline [Wadden et al, 2020, Malon, 2018 and BM25 [Robertson et al, 1994] is demonstrated to be effective [Pradeep et al, 2020].…”

Section: Evidence Retrievalmentioning

confidence: 99%

Automated Fact-Checking: A Survey

Zeng¹,

Abumansour²,

Zubiaga³

2021

Preprint

View full text Add to dashboard Cite

As online false information continues to grow, automated fact-checking has gained an increasing amount of attention in recent years. Researchers in the field of Natural Language Processing (NLP) have contributed to the task by building fact-checking datasets, devising automated fact-checking pipelines and proposing NLP methods to further research in the development of different components. This paper reviews relevant research on automated fact-checking covering both the claim detection and claim validation components.

show abstract

CLIMATE-FEVER: A Dataset for Verification of Real-World Climate Claims

Cited by 20 publications

References 19 publications

ColBERTv2: Effective and Efficient Retrieval via Lightweight Late Interaction

ColBERTv2: Effective and Efficient Retrieval via Lightweight Late Interaction

Evidence-based Verification for Real World Information Needs

Automated Fact-Checking: A Survey

Contact Info

Product

Resources

About