Analysis of resist pattern collapse and optimization of DUV process for patterning sub-0.20-μm gate line

This article describes in detail an n-gram approach to statistical machine translation. This approach consists of a log-linear combination of a translation model based on n-grams of bilingual units, which are referred to as tuples, along with four specific feature functions. Translation\ud performance, which happens to be in the state of the art, is demonstrated with Spanish-to-English and English-to-Spanish translations of the European Parliament Plenary Sessions (EPPS).Peer Reviewe

show abstract

Neural Machine Translation Decoding with Terminology Constraints

Hasler¹,

Gispert²,

Iglesias³

et al. 2018

View full text Add to dashboard Cite

Despite the impressive quality improvements yielded by neural machine translation (NMT) systems, controlling their translation output to adhere to user-provided terminology constraints remains an open problem. We describe our approach to constrained neural decoding based on finite-state machines and multistack decoding which supports target-side constraints as well as constraints with corresponding aligned input text spans. We demonstrate the performance of our framework on multiple translation tasks and motivate the need for constrained decoding with attentions as a means of reducing misplacement and duplication when translating user constraints.

show abstract

The University of Cambridge’s Machine Translation Systems for WMT18

Stahlberg¹,

Gispert²,

Byrne³

2018

View full text Add to dashboard Cite

The University of Cambridge submission to the WMT18 news translation task focuses on the combination of diverse models of translation. We compare recurrent, convolutional, and self-attention-based neural models on German-English, English-German, and Chinese-English. Our final system combines all neural models together with a phrase-based SMT system in an MBR-based scheme. We report small but consistent gains on top of strong Transformer ensembles.

show abstract

Hierarchical Phrase-Based Translation with Weighted Finite-State Transducers and Shallow-n Grammars

Gispert

Iglesias

Blackwood

et al. 2010

Computational Linguistics

View full text Add to dashboard Cite

In this article we describe HiFST, a lattice-based decoder for hierarchical phrase-based translation and alignment. The decoder is implemented with standard Weighted Finite-State Transducer (WFST) operations as an alternative to the well-known cube pruning procedure. We find that the use of WFSTs rather than k-best lists requires less pruning in translation search, resulting in fewer search errors, better parameter optimization, and improved translation performance. The direct generation of translation lattices in the target language can improve subsequent rescoring procedures, yielding further gains when applying long-span language models and Minimum Bayes Risk decoding. We also provide insights as to how to control the size of the search space defined by hierarchical rules. We show that shallow-n grammars, low-level rule catenation, and other search constraints can help to match the power of the translation system to specific language pairs.

show abstract

Neural Machine Translation by Minimising the Bayes-risk with Respect to Syntactic Translation Lattices

Stahlberg

Gispert

Hasler

et al. 2017

View full text Add to dashboard Cite

We present a novel scheme to combine neural machine translation (NMT) with traditional statistical machine translation (SMT). Our approach borrows ideas from linearised lattice minimum Bayes-risk decoding for SMT. The NMT score is combined with the Bayes-risk of the translation according the SMT lattice. This makes our approach much more flexible than n-best list or lattice rescoring as the neural decoder is not restricted to the SMT search space. We show an efficient and simple way to integrate risk estimation into the NMT decoder which is suitable for word-level as well as subword-unit-level NMT. We test our method on EnglishGerman and Japanese-English and report significant gains over lattice rescoring on several data sets for both single and ensembled NMT. The MBR decoder produces entirely new hypotheses far beyond simply rescoring the SMT search space or fixing UNKs in the NMT output.

show abstract

A Comparison of Neural Models for Word Ordering

Hasler¹,

Stahlberg²,

Tomalin³

et al. 2017

View full text Add to dashboard Cite

We compare several language models for the word-ordering task and propose a new bagto-sequence neural model based on attentionbased sequence-to-sequence models. We evaluate the model on a large German WMT data set where it significantly outperforms existing models. We also describe a novel search strategy for LM-based word ordering and report results on the English Penn Treebank. Our best model setup outperforms prior work both in terms of speed and quality.

show abstract

Multi-representation ensembles and delayed SGD updates improve syntax-based NMT

Saunders¹,

Stahlberg²,

Gispert³

et al. 2018

View full text Add to dashboard Cite

We explore strategies for incorporating target syntax into Neural Machine Translation. We specifically focus on syntax in ensembles containing multiple sentence representations. We formulate beam search over such ensembles using WFSTs, and describe a delayed SGD update training procedure that is especially effective for long representations like linearized syntax. Our approach gives state-of-the-art performance on a difficult Japanese-English task.

show abstract

Domain Adaptive Inference for Neural Machine Translation

Saunders¹,

Stahlberg²,

Gispert³

et al. 2019

View full text Add to dashboard Cite

We investigate adaptive ensemble weighting for Neural Machine Translation, addressing the case of improving performance on a new and potentially unknown domain without sacrificing performance on the original domain. We adapt sequentially across two Spanish-English and three English-German tasks, comparing unregularized fine-tuning, L2 and Elastic Weight Consolidation. We then report a novel scheme for adaptive NMT ensemble decoding by extending Bayesian Interpolation with source information, and show strong improvements across test domains without access to the domain label.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Adrià de Gispert

N-gram-based Machine Translation

Neural Machine Translation Decoding with Terminology Constraints

The University of Cambridge’s Machine Translation Systems for WMT18

Hierarchical Phrase-Based Translation with Weighted Finite-State Transducers and Shallow-n Grammars

Neural Machine Translation by Minimising the Bayes-risk with Respect to Syntactic Translation Lattices

A Comparison of Neural Models for Word Ordering

Multi-representation ensembles and delayed SGD updates improve syntax-based NMT

Domain Adaptive Inference for Neural Machine Translation

Contact Info

Product

Resources

About