“…Research on lexical entailment using distributional semantics has now spanned more than a decade, and has been approached using both unsupervised (Weeds et al, 2004;Kotlerman et al, 2010;Lenci and Benotto, 2012;Santus, 2013) and supervised techniques (Baroni et al, 2012;Fu et al, 2014;Roller et al, 2014;Weeds et al, 2014;Kruszewski et al, 2015;Turney and Mohammad, 2015;Santus et al, 2016). Most of the work in unsupervised methods is based on the Distributional Inclusion Hypothesis (Weeds et al, 2004;Zhitomirsky-Geffet and Dagan, 2005), which states that the contexts in which a hypernym appear should be a superset over its hyponyms' contexts.…”