“…These approaches consider gloss prediction based on sensespecific word embeddings (Gadetsky et al, 2018;Kabiri and Cook, 2020;Zhu et al, 2019), and on a word-based context indicating the word sense (Bevilacqua et al, 2020;Gadetsky et al, 2018;Mickus et al, 2019;Yang et al, 2020;. The proposed approaches are based either on RNNs (Gadetsky et al, 2018;Kabiri and Cook, 2020;Zhu et al, 2019) or Transformers (Bevilacqua et al, 2020;Mickus et al, 2019). All of the previous approaches rely on word embeddings pre-trained on large corpora, most commonly word2vec (Mikolov et al, 2013).…”