Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics - ACL '05 2005
DOI: 10.3115/1219840.1219914
|View full text |Cite
|
Sign up to set email alerts
|

Paraphrasing with bilingual parallel corpora

Abstract: Previous work has used monolingual parallel corpora to extract and generate paraphrases. We show that this task can be done using bilingual parallel corpora, a much more commonly available resource. Using alignment techniques from phrasebased statistical machine translation, we show how paraphrases in one language can be identified using a phrase in another language as a pivot. We define a paraphrase probability that allows paraphrases extracted from a bilingual parallel corpus to be ranked using translation p… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
324
0
7

Year Published

2006
2006
2021
2021

Publication Types

Select...
6
2
1

Relationship

1
8

Authors

Journals

citations
Cited by 322 publications
(335 citation statements)
references
References 18 publications
0
324
0
7
Order By: Relevance
“…PPDB is a collection of more than 100 million English paraphrase pairs (Ganitkevitch et al, 2013). These pairs were extracted using a bilingual pivoting technique (Bannard and Callison-Burch, 2005), which assumes that two English phrases that translate to the same foreign phrase have the same meaning. PPDB was updated by Pavlick et al (2015) to assign labels stating the precise entailment relationship between paraphrase pairs (e.g.…”
Section: Datamentioning
confidence: 99%
“…PPDB is a collection of more than 100 million English paraphrase pairs (Ganitkevitch et al, 2013). These pairs were extracted using a bilingual pivoting technique (Bannard and Callison-Burch, 2005), which assumes that two English phrases that translate to the same foreign phrase have the same meaning. PPDB was updated by Pavlick et al (2015) to assign labels stating the precise entailment relationship between paraphrase pairs (e.g.…”
Section: Datamentioning
confidence: 99%
“…This system operates by extracting bilingual correspondences from a bilingual corpus, identifying phrases and words in Language A that all correspond to the same phrase or word in Language B and pivoting on the common phrase or word to extract all Language A words and phrases as paraphrases of each other. More details can be found in [7]. We use the French-English parallel corpus (approx.…”
Section: Raw Paramentioning
confidence: 99%
“…Then, they translated the sentence using the new phrase table. The paraphrases were generated using bilingual parallel corpora where phrases were extracted when a single source phrase was translated into multiple target phrases [8]. While this method may suffer from errors when substituting the paraphrases into the sentence, syntactic constraints were applied later to improve the results [9].…”
Section: [By Bus] [To Nara] *[Go]mentioning
confidence: 99%
“…However, one can also use paraphrases as the input source without changing the translation model. [12] generated a paraphrase lattice for each sentence by building a lattice based on the paraphrase model [8]. Translation is carried out by lattice decoding over the paraphrase lattice with additional features: paraphrase probability, language model score and paraphrase length.…”
Section: Previous Workmentioning
confidence: 99%