Hermann Ney scite author profile

We present and compare various methods for computing word alignments using statistical or heuristic models. We consider the five alignment models presented in Brown, Della Pietra, Della Pietra, and Mercer (1993), the hidden Markov alignment model, smoothing techniques, and refinements. These statistical models are compared with two heuristic models based on the Dice coefficient. We present different methods for combining word alignments to perform a symmetrization of directed statistical alignment models. As evaluation criterion, we use the quality of the resulting Viterbi alignment compared to a manually produced reference alignment. We evaluate the models on the German-English Verbmobil task and the French-English Hansards task. We perform a detailed analysis of various design decisions of our statistical alignment system and evaluate these on training corpora of various sizes. An important result is that refined alignment models with a first-order dependence and a fertility model yield significantly better results than simple heuristic models. In the Appendix, we present an efficient training algorithm for the alignment models presented.

show abstract

The 2005 PASCAL Visual Object Classes Challenge

Everingham

et al. 2006

View full text Add to dashboard Cite

International audienceThe PASCAL Visual Object Classes Challenge ran from February to March 2005. The goal of the challenge was to recognize objects from a number of visual object classes in realistic scenes (i.e. not pre-segmented objects). Four object classes were selected: motorbikes, bicycles, cars and people. Twelve teams entered the challenge. In this chapter we provide details of the datasets, algorithms used by the teams, evaluation criteria, and results achieved

show abstract

Improved backing-off for M-gram language modeling

View full text Add to dashboard Cite

From Feedforward to Recurrent LSTM Neural Networks for Language Modeling

Sundermeyer

Ney

Schlüter

2015

IEEE/ACM Trans. Audio Speech Lang. Process.

766

662

View full text Add to dashboard Cite

LSTM neural networks for language modeling

Sundermeyer¹,

Schlüter²,

Ney³

2012

1,363

511

View full text Add to dashboard Cite

Neural networks have become increasingly popular for the task of language modeling. Whereas feed-forward networks only exploit a fixed context length to predict the next word of a sequence, conceptually, standard recurrent neural networks can take into account all of the predecessor words. On the other hand, it is well known that recurrent networks are difficult to train and therefore are unlikely to show the full potential of recurrent models.These problems are addressed by a the Long Short-Term Memory neural network architecture. In this work, we analyze this type of network on an English and a large French language modeling task. Experiments show improvements of about 8 % relative in perplexity over standard recurrent neural network LMs. In addition, we gain considerable improvements in WER on top of a state-of-the-art speech recognition system.

show abstract

Joint-sequence models for grapheme-to-phoneme conversion

Bisani¹,

Ney²

2008

Speech Communication

496

471

View full text Add to dashboard Cite

HMM-based word alignment in statistical translation

1996

View full text Add to dashboard Cite

In this paper, we describe a new model for word alignment in statistical translation and present experimental results. The idea of the model is to make the alignment probabilities dependent on the differences in the alignment positions rather than on the absolute positions. To achieve this goal, the approach uses a first-order Hidden Markov model (HMM) for the word alignment problem as they are used successfully in speech recognition for the time alignment problem. The difference to the time alignment HMM is that there is no monotony constraint for the possible word orderings. We describe the details of the model and test the model on several bilingual corpora.

show abstract

Improved statistical alignment models

2000

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Hermann Ney

A Systematic Comparison of Various Statistical Alignment Models

The 2005 PASCAL Visual Object Classes Challenge

Improved backing-off for M-gram language modeling

From Feedforward to Recurrent LSTM Neural Networks for Language Modeling

LSTM neural networks for language modeling

Joint-sequence models for grapheme-to-phoneme conversion

HMM-based word alignment in statistical translation

Improved statistical alignment models

Contact Info

Product

Resources

About