“…A score close to 1 indicates an embedding close to the human judgement. We use MC-30 (Miller and Charles, 1991), MEN (Bruni et al, 2014), MTurk-287 (Radinsky et al, 2011), MTurk-771 (Halawi et al, 2012), RG-65 (Rubenstein and Goodenough, 1965), RW (Luong et al, 2013), SimVerb-3500 (Gerz et al, 2016), WordSim-353 (Finkelstein et al, 2001) and YP-130 (Yang and Powers, 2006) classic datasets. We follow the same protocol used by Word2vec and fastText by discarding pairs which contain a word that is not in our embedding.…”