2005
DOI: 10.1007/11564126_22
|View full text |Cite
|
Sign up to set email alerts
|

Mining Paraphrases from Self-anchored Web Sentence Fragments

Abstract: Abstract. Near-synonyms or paraphrases are beneficial in a variety of natural language and information retrieval applications, but so far their acquisition has been confined to clean, trustworthy collections of documents with explicit external attributes. When such attributes are available, such as similar time stamps associated to a pair of news articles, previous approaches rely on them as signals of potentially high content overlap between the articles, often embodied in sentences that are only slight, para… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2010
2010
2015
2015

Publication Types

Select...
2
1
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(4 citation statements)
references
References 15 publications
(21 reference statements)
0
4
0
Order By: Relevance
“…Ganitkevitch et al (2013) use the bilingual pivoting technique (Bannard and Callison-Burch 2005) along with distributional similarity features to extract lexical, and phrasal paraphrases. Some other approaches (Paşca 2005;Lin and Pantel 2001;Berant et al 2012) differ from ours in that, they use manually coded linguistic patterns to align only specific text fragment contexts to generate paraphrases (Paşca 2005), and require language specific resources such as part-of-speech taggers (Paşca 2005) and parsers (Lin and Pantel 2001). Furthermore, the latter two only find alternate constructions with the same content words, such as "X manufactures Y" infers "X's Y factory" (Lin and Pantel 2001).…”
Section: Multi-word Phrasesmentioning
confidence: 95%
See 1 more Smart Citation
“…Ganitkevitch et al (2013) use the bilingual pivoting technique (Bannard and Callison-Burch 2005) along with distributional similarity features to extract lexical, and phrasal paraphrases. Some other approaches (Paşca 2005;Lin and Pantel 2001;Berant et al 2012) differ from ours in that, they use manually coded linguistic patterns to align only specific text fragment contexts to generate paraphrases (Paşca 2005), and require language specific resources such as part-of-speech taggers (Paşca 2005) and parsers (Lin and Pantel 2001). Furthermore, the latter two only find alternate constructions with the same content words, such as "X manufactures Y" infers "X's Y factory" (Lin and Pantel 2001).…”
Section: Multi-word Phrasesmentioning
confidence: 95%
“…The Near Synonym System (NeSS) introduces a new method which differs from other approaches in that it does not require parallel resources, (unlike Barzilay and McKeown 2001;Lin et al 2003;Callison-Burch et al 2006;Ganitkevitch et al 2013) nor does it use pre-determined sets of manually coded patterns (Lin et al 2003;Paşca, 2005). NeSS captures semantic similarity via n-gram distributional methods that implicitly preserve local syntactic structure without parsing, making the underlying method language independent.…”
Section: Ness: Near-synonym Systemmentioning
confidence: 99%
“…Performance of applications relying on natural language processing may suffer from the fact that the processed documents might contain lexically different, yet semantically related, text segments. The task of recognizing synonym text segments, which is better known as paraphrase recognition, or detection, is challenging and difficult to solve, as shown in the work of Pasca (2005). The task itself is important for many text related applications, like summarization (Hirao, Fukusima, Oku-mura, Nobata, & Nanba, 2005), information extraction (Shinyama & Sekine, 2003) and question answering (Pasca, 2003).…”
Section: Paraphrase Recognition and Sentence-to-sentence Similaritymentioning
confidence: 99%
“…The performance of document processing applications relying on natural language processing may suffer from the fact that the processed documents might contain lexically different, yet semantically related, text segments. The task of recognizing pairs of text segments, with identical or almost identical semantics, which is better known as paraphrase detection, is challenging and difficult to solve, as shown in the work of Mihalcea et al [66], and Pasca [85]. The task itself is important for many text related applications, like summarization [38], information extraction [104] and question answering [84].…”
Section: Paraphrasingmentioning
confidence: 99%