2013
DOI: 10.5120/11638-7118
|View full text |Cite
|
Sign up to set email alerts
|

A Survey of Text Similarity Approaches

Abstract: Measuring the similarity between words, sentences, paragraphs and documents is an important component in various tasks such as information retrieval, document clustering, word-sense disambiguation, automatic essay scoring, short answer grading, machine translation and text summarization. This survey discusses the existing works on text similarity through partitioning them into three approaches; String-based, Corpus-based and Knowledgebased similarities. Furthermore, samples of combination between these similar… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

0
282
0
13

Year Published

2014
2014
2022
2022

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 560 publications
(320 citation statements)
references
References 25 publications
0
282
0
13
Order By: Relevance
“…Text similarity measurement is a text mining approach that could be overcome this overwhelming problem. Finding the similarity between words is a primary stage for sentence, paragraph and document similarities [2]. Text similarity approach may alleviate people on finding relevant information.…”
Section: Introductionmentioning
confidence: 99%
See 2 more Smart Citations
“…Text similarity measurement is a text mining approach that could be overcome this overwhelming problem. Finding the similarity between words is a primary stage for sentence, paragraph and document similarities [2]. Text similarity approach may alleviate people on finding relevant information.…”
Section: Introductionmentioning
confidence: 99%
“…Lexical and semantic similarity words is an essential element of sentence, paragraph and document similarity measurement [2]. Lexical similarity a degree of two given string are similar in its character sequence.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Text similarity measures have been widely used in several natural language processing applications such as automatic essay grading, paraphrase recognition, etc [1][2][3]. Previous studies on text similarity were mostly concerned about the semantic typing in terms of two mechanisms: the detection of similarity and difference in the form of judgments of likeness in which other potential inconsistency that can be resulted from judgments of difference.…”
Section: Introductionmentioning
confidence: 99%
“…Sentence textual similarity is a crucial and a prerequisite subtask for many text processing and NLP tasks including text summarization, document classification, text clustering, topic detection, automatic question answering, automatic text scoring, plagiarism detection, machine translation, conversational agents among others (Ali, Ghosh, & Al-Mamun, 2009;Gomaa & Fahmy, 2013;Haque, Naskar, Way, Costa-Jussà, & Banchs, 2010;K. O'Shea, 2012;Osman, Salim, Binwahlan, Alteeb, & Abuobieda, 2012).…”
Section: Introductionmentioning
confidence: 99%