“…Since WSC was proposed as a benchmark for commonsense (Levesque et al, 2012), there were many attempts to improve performance on this benchmark, that involved different approaches including web queries (Rahman and Ng, 2012;Sharma et al, 2015;Emami et al, 2018), using external knowledge sources (Sharma, 2019), information extraction and reasoning (Isaak and Michael, 2016) and more (Peng et al, 2015;Liu et al, 2017a,b;Fähndrich et al, 2018;Klein and Nabi, 2019;Zhang et al, 2019Zhang et al, , 2020a. Newer approaches use LMs to assign a probability to a sentence by replacing the pronoun with an entity, one at a time, and pick the more probable sentence (Trinh and Le, 2018;Opitz and Frank, 2018;Radford et al, 2019;Kocijan et al, 2019).…”