“…As mentioned in the introduction, the NLI task (Dagan et al, 2006(Dagan et al, , 2013, sometimes called Recognizing Textual Entailment (RTE), was extensively studied by the NLP community over the past several years as a semantic reasoning benchmark (see Poliak, 2020;Storks et al, 2019, for surveys). The field of fact verification (Vlachos and Riedel, 2014) also recently gained increased attention (Bekoulis et al, 2021;Kotonya and Toni, 2020;Guo et al, 2022;Zeng et al, 2021), sharing similar pair-wise semantic inference challenges, together with evidence retrieval.…”