Using Word Embeddings to Enforce Document-Level Lexical Consistency in Machine Translation

Garcia, Eva Martínez; Creus, Carles; España-Bonet, Cristina; Màrquez, Lluı́s

doi:10.1515/pralin-2017-0011

Cited by 16 publications

(8 citation statements)

References 4 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…is word segmentation method is not efficient, but its proposal lays the foundation for Chinese automatic word segmentation technology [7]. Relevant scholars have theorized the Chinese word segmentation method and proposed the "minimum number of words" segmentation theory; that is, each sentence should be segmented with the least number of words [8]. is word segmentation method is an improvement on the "word dictionary" word segmentation method, which has promoted the development of Chinese word segmentation technology.…”

Section: Introductionmentioning

confidence: 99%

Numerical Simulation of Ambiguity Resolution in Multiple Information Streams Based on Network Machine Translation

Wang

2020

Complexity

View full text Add to dashboard Cite

In natural language, the phenomenon of polysemy is widespread, which makes it very difficult for machines to process natural language. Word sense disambiguation is a key issue in the field of natural language processing. This paper introduces the more common statistical learning methods used in the field of word sense disambiguation. Using the naive Bayesian machine learning method and the feature vector set extracted and constructed by the Dice coefficient method, a semantic word disambiguation model based on semantics is realized. The results of comparative experiments show that the proposed method is better compared with known systems. This paper proposes a method for disambiguation of word segmentation in professional fields based on unsupervised learning. This method does not rely on professional domain knowledge and training corpus and only uses the frequency, mutual information, and boundary entropy information of the string in the test corpus to solve the problem of word segmentation ambiguity. The experimental results show that these three evaluation standards can solve the problem of word segmentation ambiguity in professional fields and improve the effect of word segmentation. Among them, the segmentation result using mutual information is the best, and the performance is stable.

show abstract

Section: Introductionmentioning

confidence: 99%

Numerical Simulation of Ambiguity Resolution in Multiple Information Streams Based on Network Machine Translation

Wang

2020

Complexity

View full text Add to dashboard Cite

show abstract

“…Another line of document-level NMT work (Xiong et al, 2018;Voita et al, 2019b) proposed a twopass document decoding model inspired by the deliberation network (Xia et al, 2017) in order to incorporate target side document context. A parallel line of work (Garcia et al, 2017(Garcia et al, , 2019Yu et al, 2019) introduced document-level approaches that do not require training the context-conditional NMT model by introducing a separate language model to enforce the consistency in the outputs of sentence-level NMT model. Garcia et al (2019) used a simple n-gram based semantic space language model (Hardmeier et al, 2012) to re-rank the outputs of the sentence-level NMT model inside the beam-search algorithm to enforce documentlevel consistency.…”

Section: Related Workmentioning

confidence: 99%

Capturing document context inside sentence-level neural machine translation models with self-training

Mansimov¹,

Melis²,

Lei³

2021

Proceedings of the 2nd Workshop on Computational Approaches to Discourse

View full text Add to dashboard Cite

Neural machine translation (NMT) has arguably achieved human level parity when trained and evaluated at the sentence-level. Document-level neural machine translation has received less attention and lags behind its sentence-level counterpart. The majority of the proposed document-level approaches investigate ways of conditioning the model on several source or target sentences to capture document context. These approaches require training a specialized NMT model from scratch on parallel document-level corpora. We propose an approach that doesn't require training a specialized model on parallel document-level corpora and is applied to a trained sentence-level NMT model at decoding time. We process the document from left to right multiple times and self-train the sentence-level model on pairs of source sentences and generated translations. Our approach reinforces the choices made by the model, thus making it more likely that the same choices will be made in other sentences in the document. We evaluate our approach on three document-level datasets: NIST Chinese-English, WMT19 Chinese-English and Open-Subtitles English-Russian. We demonstrate that our approach has higher BLEU score and higher human preference than the baseline. Qualitative analysis of our approach shows that choices made by model are consistent across the document.

show abstract

“…Xiong et al [29] propose to learn the topic structure of source document and then map the structure to the target translation. In addition to these approaches leveraging discourse-level linguistic features for document translation, Garcia et al [4] incorporate new word embedding features into decoder to improve the lexical consistency of translations. Document-level NMT In the context of neural machine translation, previous studies first incorporate contextual information into NMT models built on RNN networks.…”

Section: Related Workmentioning

confidence: 99%

“…For English-German translation, we used the WMT19 bilingual document-level training data 5 , which contains 39k documents with 3 http://nlp.nju.edu.cn/cwmt-wmt. 4 https://www.sogou.com/labs/resource/list news.php. 5 https://s3-eu-west-1.amazonaws.com/tilde-model/rapid2019.de-en.zip.…”

Section: Experimental Settingmentioning

confidence: 99%

Learning Contextualized Sentence Representations for Document-Level Neural Machine Translation

Zhang,

Chen

et al. 2020

Preprint

View full text Add to dashboard Cite

Document-level machine translation incorporates intersentential dependencies into the translation of a source sentence. In this paper, we propose a new framework to model cross-sentence dependencies by training neural machine translation (NMT) to predict both the target translation and surrounding sentences of a source sentence. By enforcing the NMT model to predict source context, we want the model to learn "contextualized" source sentence representations that capture document-level dependencies on the source side. We further propose two different methods to learn and integrate such contextualized sentence embeddings into NMT: a joint training method that jointly trains an NMT model with the source context prediction model and a pre-training & fine-tuning method that pretrains the source context prediction model on a large-scale monolingual document corpus and then fine-tunes it with the NMT model. Experiments on Chinese-English and English-German translation show that both methods can substantially improve the translation quality over a strong document-level Transformer baseline.

show abstract

Using Word Embeddings to Enforce Document-Level Lexical Consistency in Machine Translation

Cited by 16 publications

References 4 publications

Numerical Simulation of Ambiguity Resolution in Multiple Information Streams Based on Network Machine Translation

Numerical Simulation of Ambiguity Resolution in Multiple Information Streams Based on Network Machine Translation

Capturing document context inside sentence-level neural machine translation models with self-training

Learning Contextualized Sentence Representations for Document-Level Neural Machine Translation

Contact Info

Product

Resources

About