EViLBERT: Learning Task-Agnostic Multimodal Sense Embeddings

Calabrese, Agostina; Bevilacqua, Michele; Navigli, Roberto

doi:10.24963/ijcai.2020/67

Cited by 6 publications

(8 citation statements)

References 22 publications

(1 reference statement)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The image encoder is used to capture the semantic information contained in the images in a BabelNet synset. Previous studies have shown that images can help learn better semantic representations for concepts and entities (Xie et al, 2017a;Calabrese et al, 2020). We believe that images are also beneficial to SPBS.…”

Section: Image Encodermentioning

confidence: 82%

“…It has been utilized in multiple NLP tasks (Navigli et al, 2021), especially the cross-lingual or multilingual tasks, such as multilingual word sense disambiguation (Navigli and Ponzetto, 2012b), cross-lingual lexical entailment (Vyas and Carpuat, 2016) and cross-lingual AMR parsing (Blloshmi et al, 2020). Most of these studies regard BabelNet as a large multilingual sense inventory and utilize the multilingual synonyms and glosses in BabelNet synsets, and some studies also use images in it, e.g., Calabrese et al (2020) learn multimodal sense embeddings with the concepts and images in BabelNet.…”

Section: Babelnetmentioning

confidence: 99%

See 1 more Smart Citation

Sememe Prediction for BabelNet Synsets using Multilingual and Multimodal Information

Qi¹,

Lv²,

Liu³

et al. 2022

Preprint

View full text Add to dashboard Cite

In linguistics, a sememe is defined as the minimum semantic unit of languages. Sememe knowledge bases (KBs), which are built by manually annotating words with sememes, have been successfully applied to various NLP tasks. However, existing sememe KBs only cover a few languages, which hinders the wide utilization of sememes. To address this issue, the task of sememe prediction for BabelNet synsets (SPBS) is presented, aiming to build a multilingual sememe KB based on BabelNet, a multilingual encyclopedia dictionary. By automatically predicting sememes for a BabelNet synset, the words in many languages in the synset would obtain sememe annotations simultaneously. However, previous SPBS methods have not taken full advantage of the abundant information in BabelNet. In this paper, we utilize the multilingual synonyms, multilingual glosses and images in BabelNet for SPBS. We design a multimodal information fusion model to encode and combine this information for sememe prediction. Experimental results show the substantial outperformance of our model over previous methods (about 10 MAP and F1 scores). All the code and data of this paper can be obtained at https: //github.com/thunlp/MSGI.

show abstract

Section: Image Encodermentioning

confidence: 82%

Section: Babelnetmentioning

confidence: 99%

Sememe Prediction for BabelNet Synsets using Multilingual and Multimodal Information

Qi¹,

Lv²,

Liu³

et al. 2022

Preprint

View full text Add to dashboard Cite

show abstract

“…We note that different kinds of knowledge are orthogonal to each other and can be exploited in conjunction. For example, token classification models benefit from the logits-adjacency matrix multiplication , binary cross-entropy training , translation-based refinement [Luan et al, 2020] and visual information [Calabrese et al, 2020a].…”

Section: Discussionmentioning

confidence: 99%

“…In a different direction, Calabrese et al [2020a] leverage images from the BabelPic dataset [Calabrese et al, 2020b] to build multimodal gloss vectors, which are shown to be stronger than text-only vectors when used to initialize the weights of the classification matrix ( in Eq. 1).…”

Section: Supervised Wsd Exploiting Other Knowledgementioning

confidence: 99%

Recent Trends in Word Sense Disambiguation: A Survey

Bevilacqua

Pasini

Raganato

et al. 2021

Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence

Self Cite

View full text Add to dashboard Cite

Word Sense Disambiguation (WSD) aims at making explicit the semantics of a word in context by identifying the most suitable meaning from a predefined sense inventory. Recent breakthroughs in representation learning have fueled intensive WSD research, resulting in considerable performance improvements, breaching the 80% glass ceiling set by the inter-annotator agreement. In this survey, we provide an extensive overview of current advances in WSD, describing the state of the art in terms of i) resources for the task, i.e., sense inventories and reference datasets for training and testing, as well as ii) automatic disambiguation approaches, detailing their peculiarities, strengths and weaknesses. Finally, we highlight the current limitations of the task itself, but also point out recent trends that could help expand the scope and applicability of WSD, setting up new promising directions for the future.

show abstract

“…using LSTMs (Melamud et al, 2016) or the Transformer architecture (Devlin et al, 2019;Conneau et al, 2020), and are capable of representing words based on the context in which they occur. Contextualized representations have also been used to obtain effective sense embeddings (Loureiro and Jorge, 2019;Scarlini et al, 2020a,b;Calabrese et al, 2020).…”

Section: Introductionmentioning

confidence: 99%

SemEval-2021 Task 2: Multilingual and Cross-lingual Word-in-Context Disambiguation (MCL-WiC)

Martelli¹,

Kalach²,

Tola³

et al. 2021

Proceedings of the 15th International Workshop on Semantic Evaluation (SemEval-2021)

Self Cite

View full text Add to dashboard Cite

In this paper, we introduce the first SemEval task on Multilingual and Cross-Lingual Wordin-Context disambiguation (MCL-WiC). This task allows the largely under-investigated inherent ability of systems to discriminate between word senses within and across languages to be evaluated, dropping the requirement of a fixed sense inventory. Framed as a binary classification, our task is divided into two parts. In the multilingual sub-task, participating systems are required to determine whether two target words, each occurring in a different context within the same language, express the same meaning or not. Instead, in the crosslingual part, systems are asked to perform the task in a cross-lingual scenario, in which the two target words and their corresponding contexts are provided in two different languages. We illustrate our task, as well as the construction of our manually-created dataset including five languages, namely Arabic, Chinese, English, French and Russian, and the results of the participating systems. Datasets and results are available at: https://github.com/ SapienzaNLP/mcl-wic.

show abstract

EViLBERT: Learning Task-Agnostic Multimodal Sense Embeddings

Cited by 6 publications

References 22 publications

Sememe Prediction for BabelNet Synsets using Multilingual and Multimodal Information

Sememe Prediction for BabelNet Synsets using Multilingual and Multimodal Information

Recent Trends in Word Sense Disambiguation: A Survey

SemEval-2021 Task 2: Multilingual and Cross-lingual Word-in-Context Disambiguation (MCL-WiC)

Contact Info

Product

Resources

About