Proceedings of the Workshop on Data-Driven Methods in Machine Translation - 2001
DOI: 10.3115/1118037.1118043
|View full text |Cite
|
Sign up to set email alerts
|

A best-first alignment algorithm for automatic extraction of transfer mappings from bilingual corpora

Abstract: Translation systems that automatically extract transfer mappings (rules or examples) from bilingual corpora have been hampered by the difficulty of achieving accurate alignment and acquiring high quality mappings. We describe an algorithm that uses a bestfirst strategy and a small alignment grammar to significantly improve the quality of the transfer mappings extracted.For each mapping, frequencies are computed and sufficient context is retained to distinguish competing mappings during translation. Variants of… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

0
51
0

Year Published

2001
2001
2016
2016

Publication Types

Select...
7
2
1

Relationship

0
10

Authors

Journals

citations
Cited by 53 publications
(51 citation statements)
references
References 13 publications
0
51
0
Order By: Relevance
“…Besides being used in SMT, it is also used in translation lexicon building [8], transfer rule learning [9], example-based machine translation [13], translation memory systems [12], etc.…”
Section: Introductionmentioning
confidence: 99%
“…Besides being used in SMT, it is also used in translation lexicon building [8], transfer rule learning [9], example-based machine translation [13], translation memory systems [12], etc.…”
Section: Introductionmentioning
confidence: 99%
“…This requires syntactically parsing these corpora and aligning them on the constituent level, producing a collection of parallel treebanks. Various systems have been developed for non-terminal constituent alignment, either making use of hand-crafted rules (Menezes and Richardson, 2001;Groves et al, 2004) or fully unsupervised methods (Zhechev and Way, 2008;Lavie et al, 2008). We have opted to use Lingua-Align, a supervised tree-to-tree aligner implementing discriminative models (Tiedemann, 2010).…”
Section: Introductionmentioning
confidence: 99%
“…Along with the efforts made to accumulate bilingual corpora for many language pairs, quite a few machine translation (MT) systems that automatically construct their knowledge from corpora have been proposed (Brown et al, 1993;Menezes and Richardson, 2001;Imamura, 2002). However, if we use corpora without any restriction, redundant rules are acquired due to translation varieties.…”
Section: Introductionmentioning
confidence: 99%