Word sense disambiguation and text relatedness based on word thesauri

Τσατσαρώνης, Γεώργιος

doi:10.12681/eadd/17724

Cited by 2 publications

References 82 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

Paraphrasing and textual entailment recognition and generation

Μαλακασιώτης¹

View full text Add to dashboard Cite

Paraphrasing methods recognize, generate, or extract phrases, sentences, or longer natural language expressions that convey almost the same information. Textual entailment methods, on the other hand, recognize, generate, or extract pairs of natural language expressions, such that a human who reads (and trusts) the first element of a pair would most likely infer that the other element is also true. Paraphrasing can be seen as bidirectional textual entailment and methods from the two areas are often very similar. Both kinds of methods are useful, at least in principle, in a wide range of natural language processing applications, including question answering, summarization, text generation, and machine translation.In this thesis, we focus on paraphrase and textual entailment recognition, as well as paraphrase generation. We propose three paraphrase and textual entailment recognition methods, experimentally evaluated on existing benchmarks. The key idea is that by capturing similarities at various abstractions of the inputs, we can recognize paraphrases and textual entailment reasonably well. Additionally, we exploit WordNet and use features that operate on the syntactic level of the language expressions. The best of our three recognition methods achieves state of the art results on the widely used MSR paraphrasing corpus, but the simplest of our methods is also a very competitive baseline. On textual entailment datasets, our methods achieve worse results. Nevertheless, they perform reasonably well, despite being simpler than several other proposed methods; therefore, they can be considered as competitive baselines for future work.

show abstract