LIPN: Introducing a new Geographical Context Similarity Measure and a Statistical Similarity Measure based on the Bhattacharyya coefficient

Buscaldi, Davide; Flores, Jorge J. García; Roux, Joseph Le; Tomeh, Nadi; Sánchez, Belém Priego

doi:10.3115/v1/s14-2069

Cited by 3 publications

(4 citation statements)

References 3 publications

(2 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Table 3 shows the results of the English subtask, with runs listed in alphabetical order. The correlation in each dataset is given, followed 11 Participating teams: Bielefeld SC (McCrae et al, 2013), BUAP (Vilariño et al, 2014), DLS@CU (Sultan et al, 2014b), FBK-TR (Vo et al, 2014), IBM EG (no information), LIPN (Buscaldi et al, 2014), Meerkat Mafia (Kashyap et al, 2014), NTNU (Lynum et al, 2014), RTM-DCU (Biçici and Way, 2014), SemantiKLUE (Proisi et al, 2014), StanfordNLP (Socher et al, 2014), TeamZ (Gupta, 2014), UMCC DLSI SemSim (Chavez et al, 2014), UNAL-NLP , UNED (Martinez-Romo et al, 2011), UoW (Rios, 2014 Table 3: English evaluation results. Results at the top correspond to out-of-the-box systems.…”

Section: English Subtaskmentioning

confidence: 99%

“…Overall, most systems were cross-lingual, relying on different translation approaches, such as 1) translating the test data into English (as the two systems above), and then exporting the score obtained for the English sentences back to Spanish, or 2) performing automatic translation of the English training data, and learning a classifier directly in Spanish. (Buscaldi et al, 2014) supplemented their training dataset with human annotations conducted in Spanish, using definition pairs extracted from a Spanish dictionary. A different angle was explored by (Rios, 2014), who proposed a multilingual framework using transfer learning across English and Spanish by training on traditional lexical, knowledge-based and corpus-based features.…”

Section: Spanish Subtaskmentioning

confidence: 99%

See 1 more Smart Citation

SemEval-2014 Task 10: Multilingual Semantic Textual Similarity

Agirre

Banea

Cardie

et al. 2014

Proceedings of the 8th International Workshop on Semantic Evaluation (SemEval 2014)

580

603

View full text Add to dashboard Cite

In this paper, we introduce an approach to combining word embeddings and machine translation for multilingual semantic word similarity, the task2 of SemEval-2017. Thanks to the unsupervised translit-eration model, our cross-lingual word em-beddings encounter decreased sums of OOVs. Our results are produced using only monolingual Wikipedia corpora and a limited amount of sentence-aligned data. Although relatively little resources are utilized , our system ranked 3rd in the mono-lingual subtask and can be the 6th in the cross-lingual subtask.

show abstract

Section: English Subtaskmentioning

confidence: 99%

Section: Spanish Subtaskmentioning

confidence: 99%

SemEval-2014 Task 10: Multilingual Semantic Textual Similarity

Agirre

Banea

Cardie

et al. 2014

Proceedings of the 8th International Workshop on Semantic Evaluation (SemEval 2014)

580

603

View full text Add to dashboard Cite

show abstract

“…And select the utterance that has a high n-gram overlap score, and add it to the set of diverse utterances. iii) Iterate over the remaining set of generated utterances, in each iteration compute the ngram similarity score (Buscaldi et al 2013) between the current utterance and set of diverse utterances and based on the computed scores the utterance with least n-gram similarity is added to the set of diverse utterances. Also, during each iteration we check for the stopping criteria i.e.…”

Section: Figure 6: Llm Promptsmentioning

confidence: 99%

Building Conversational Artifacts to Enable Digital Assistant for APIs and RPAs

Bandlamudi,

Mukherjee,

Agarwal

et al. 2024

AAAI

View full text Add to dashboard Cite

In the realm of business automation, digital assistants/chatbots are emerging as the primary method for making automation software accessible to users in various business sectors. Access to automation primarily occurs through APIs and RPAs. To effectively convert APIs and RPAs into chatbots on a larger scale, it is crucial to establish an automated process for generating data and training models that can recognize user intentions, identify questions for conversational slot filling, and provide recommendations for subsequent actions. In this paper, we present a technique for enhancing and generating natural language conversational artifacts from API specifications using large language models (LLMs). The goal is to utilize LLMs in the "build" phase to assist humans in creating skills for digital assistants. As a result, the system doesn't need to rely on LLMs during conversations with business users, leading to efficient deployment. Experimental results highlight the effectiveness of our proposed approach. Our system is deployed in the IBM Watson Orchestrate product for general availability.

show abstract

“…Our participation in SemEval 2015 was focused on solving the technical problems that afflicted our previous participation (Buscaldi et al, 2014) and including additional features based on alignments, such as the Sultan similarity (Sultan et al, 2014b) and the measure available in CMU Sphinx-4 (Lamere et al, 2003) for speech recognition. We baptised the new system SOPA from the Spanish word for "soup", since it uses a heterogeneous mix of features.…”

Section: Introductionmentioning

confidence: 99%