Proceedings of the Ninth Conference on Computational Natural Language Learning - CONLL '05 2005
DOI: 10.3115/1706543.1706548
|View full text |Cite
|
Sign up to set email alerts
|

New experiments in distributional representations of synonymy

Abstract: Recent work on the problem of detecting synonymy through corpus analysis has used the Test of English as a Foreign Language (TOEFL) as a benchmark. However, this test involves as few as 80 questions, prompting questions regarding the statistical significance of reported results. We overcome this limitation by generating a TOEFL-like test using WordNet, containing thousands of questions and composed only of words occurring with sufficient corpus frequency to support sound distributional comparisons. Experiments… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
51
0

Year Published

2011
2011
2018
2018

Publication Types

Select...
6
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 44 publications
(51 citation statements)
references
References 14 publications
0
51
0
Order By: Relevance
“…In the well-known TOEFL evaluation (Freitag et al, 2005) models are required to identify true synonyms to question words from a selection of possible answers. To test our models on this task, for each question in the dataset, we rank the multiple-choice answers according to the cosine similarity between their word embeddings and that of the target word, and choose the highestranked option.…”
Section: Toefl Synonym Taskmentioning
confidence: 99%
“…In the well-known TOEFL evaluation (Freitag et al, 2005) models are required to identify true synonyms to question words from a selection of possible answers. To test our models on this task, for each question in the dataset, we rank the multiple-choice answers according to the cosine similarity between their word embeddings and that of the target word, and choose the highestranked option.…”
Section: Toefl Synonym Taskmentioning
confidence: 99%
“…The B 2 SG [25] test is similar to the WordNet-Based Synonymy Test [26], but based on the Portuguese part of BabelNet [13] and partially evaluated by humans. It contains frequent Portuguese nouns and verbs (target), each followed by four candidates, from which only one is related, and is organised in six files: two for synonymy, two for hypernymy, and two for antonymy, respectively, between nouns and for verbs.…”
Section: Selecting the Most Similar Word From A Small Setmentioning
confidence: 99%
“…As a secondary evaluation, authors often use TOEFL synonymy tests [19,6] where the task is to distinguish the synonym of a given word in a given context, among four candidate items. This is a sort of easier word disambiguation test where the task is to separate a word from unrelated distractors, instead of distinguishing between word senses.…”
Section: Related Workmentioning
confidence: 99%
“…We lemmatized both French and English sentences using TreeTager. 6 Then, we trained in both directions 7 (English-to-French and French-to-English) statistical translation models, running the Giza++ toolkit in its standard setting. 8 Our translation mirror approach makes use of the lexical distribution of the two models, 9 p e2f and p f 2e , we obtained this way (see tab 3 for an example).…”
Section: Translation Mirrorsmentioning
confidence: 99%
See 1 more Smart Citation