Evgeny Matusov scite author profile

This paper describes state-of-the-art interfaces between speech recognition and machine translation. We modify two different machine translation systems to effectively process dense speech recognition lattices. In addition, we describe how to fully integrate speech translation with machine translation based on weighted finite-state transducers. With a thorough set of experiments, we show that both the acoustic model scores and the source language model positively and significantly affect the translation quality. We have found consistent improvements on three different corpora compared with translations of single best recognition results.

show abstract

Symmetric word alignments for statistical machine translation

Matusov

Zens

Ney

2004

View full text Add to dashboard Cite

In this paper, we address the word alignment problem for statistical machine translation. We aim at creating a symmetric word alignment allowing for reliable one-to-many and many-to-one word relationships. We perform the iterative alignment training in the source-to-target and the target-to-source direction with the well-known IBM and HMM alignment models. Using these models, we robustly estimate the local costs of aligning a source word and a target word in each sentence pair. Then, we use efficient graph algorithms to determine the symmetric alignment with minimal total costs (i. e. maximal alignment probability). We evaluate the automatic alignments created in this way on the German-English Verbmobil task and the French-English Canadian Hansards task. We show statistically significant improvements of the alignment quality compared to the best results reported so far. On the Verbmobil task, we achieve an improvement of more than 1% absolute over the baseline error rate of 4.7%.

show abstract

Novel reordering approaches in phrase-based statistical machine translation

Kanthak

Vilar

Matusov

et al. 2005

View full text Add to dashboard Cite

This paper presents novel approaches to reordering in phrase-based statistical machine translation. We perform consistent reordering of source sentences in training and estimate a statistical translation model. Using this model, we follow a phrase-based monotonic machine translation approach, for which we develop an efficient and flexible reordering framework that allows to easily introduce different reordering constraints. In translation, we apply source sentence reordering on word level and use a reordering automaton as input. We show how to compute reordering automata on-demand using IBM or ITG constraints, and also introduce two new types of reordering constraints. We further add weights to the reordering automata. We present detailed experimental results and show that reordering significantly improves translation quality.

show abstract

Speech segmentation and spoken document processing

Ostendorf

Favre

Grishman

et al. 2008

IEEE Signal Process. Mag.

View full text Add to dashboard Cite

Neural Machine Translation Leveraging Phrase-based Models in a Hybrid Search

Dahlmann¹,

Matusov²,

Petrushkov³

et al. 2017

View full text Add to dashboard Cite

In this paper, we introduce a hybrid search for attention-based neural machine translation (NMT). A target phrase learned with statistical MT models extends a hypothesis in the NMT beam search when the attention of the NMT model focuses on the source words translated by this phrase. Phrases added in this way are scored with the NMT model, but also with SMT features including phrase-level translation probabilities and a target language model. Experimental results on German→English news domain and English→Russian ecommerce domain translation tasks show that using phrase-based models in NMT search improves MT quality by up to 2.3% BLEU absolute as compared to a strong NMT baseline.

show abstract

Start-Before-End and End-to-End: Neural Speech Translation by AppTek and RWTH Aachen University

Bahar¹,

Wilken²,

Alkhouli³

et al. 2020

View full text Add to dashboard Cite

AppTek and RWTH Aachen University team together to participate in the offline and simultaneous speech translation tracks of IWSLT 2020. For the offline task, we create both cascaded and end-to-end speech translation systems, paying attention to careful data selection and weighting. In the cascaded approach, we combine high-quality hybrid automatic speech recognition (ASR) with the Transformer-based neural machine translation (NMT). Our endto-end direct speech translation systems benefit from pretraining of adapted encoder and decoder components, as well as synthetic data and fine-tuning and thus are able to compete with cascaded systems in terms of MT quality. For simultaneous translation, we utilize a novel architecture that makes dynamic decisions, learned from parallel data, to determine when to continue feeding on input or generate output words. Experiments with speech and text input show that even at low latency this architecture leads to superior translation results.

show abstract

Can Neural Machine Translation be Improved with User Feedback?

Kreutzer¹,

Khadivi²,

Matusov³

et al. 2018

View full text Add to dashboard Cite

We present the first real-world application of methods for improving neural machine translation (NMT) with human reinforcement, based on explicit and implicit user feedback collected on the eBay ecommerce platform. Previous work has been confined to simulation experiments, whereas in this paper we work with real logged feedback for offline bandit learning of NMT parameters. We conduct a thorough analysis of the available explicit user judgments-five-star ratings of translation quality-and show that they are not reliable enough to yield significant improvements in bandit learning. In contrast, we successfully utilize implicit taskbased feedback collected in a cross-lingual search task to improve task-specific and machine translation quality metrics.

show abstract

Customizing Neural Machine Translation for Subtitling

Matusov¹,

Wilken²,

Georgakopoulou³

2019

View full text Add to dashboard Cite

In this work, we customized a neural machine translation system for translation of subtitles in the domain of entertainment. The neural translation model was adapted to the subtitling content and style and extended by a simple, yet effective technique for utilizing intersentence context for short sentences such as dialog turns. The main contribution of the paper is a novel subtitle segmentation algorithm that predicts the end of a subtitle line given the previous word-level context using a recurrent neural network learned from human segmentation decisions. This model is combined with subtitle length and duration constraints established in the subtitling industry. We conducted a thorough human evaluation with two post-editors (English-to-Spanish translation of a documentary and a sitcom). It showed a notable productivity increase of up to 37% as compared to translating from scratch and significant reductions in human translation edit rate in comparison with the post-editing of the baseline non-adapted system without a learned segmentation model.

show abstract

12 3 4 5 6

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Evgeny Matusov

Integrating Speech Recognition and Machine Translation: Where do We Stand?

Symmetric word alignments for statistical machine translation

Novel reordering approaches in phrase-based statistical machine translation

Speech segmentation and spoken document processing

Neural Machine Translation Leveraging Phrase-based Models in a Hybrid Search

Start-Before-End and End-to-End: Neural Speech Translation by AppTek and RWTH Aachen University

Can Neural Machine Translation be Improved with User Feedback?

Customizing Neural Machine Translation for Subtitling

Contact Info

Product

Resources

About