Interspeech 2007 2007
DOI: 10.21437/interspeech.2007-579
|View full text |Cite
|
Sign up to set email alerts
|

The RWTH 2007 TC-STAR evaluation system for european English and Spanish

Abstract: In this work, the RWTH automatic speech recognition systems developed for the third TC-STAR evaluation campaign 2007 are presented. The RWTH systems make systematic use of internal system combination, combining systems with differences in feature extraction, adaptation methods, and training data used. To take advantage of this, novel feature extraction methods were employed; this year saw the introduction of Gammatone features and MLP based phone posterior features. Further improvements were achieved using uns… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
9
0

Year Published

2009
2009
2019
2019

Publication Types

Select...
3
3
2

Relationship

3
5

Authors

Journals

citations
Cited by 25 publications
(9 citation statements)
references
References 13 publications
0
9
0
Order By: Relevance
“…Using a baseline 1-pass 60k-words recognition system for the automatic transcription of European Parliament Plenary Sessions (EPPS) in English as described in [4], a recognition quality of about 15% word error rate is achievable with a real-time factor of ∼4 (see Figure 2) on the evaluation corpus of the 2007 TC-STAR Evaluation Campaign. This corpus consists of 2.9h of speech with an out-of-vocabulary rate of 1.1%.…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…Using a baseline 1-pass 60k-words recognition system for the automatic transcription of European Parliament Plenary Sessions (EPPS) in English as described in [4], a recognition quality of about 15% word error rate is achievable with a real-time factor of ∼4 (see Figure 2) on the evaluation corpus of the 2007 TC-STAR Evaluation Campaign. This corpus consists of 2.9h of speech with an out-of-vocabulary rate of 1.1%.…”
Section: Resultsmentioning
confidence: 99%
“…The applicability of our toolkit to real-life tasks has been proven by building several large vocabulary systems in recent international research projects. The European English and Spanish recognition systems developed during the TC-STAR Project are based on our RWTH ASR toolkit [4]. These two systems achieved the best results in the 2007 TC-STAR Evaluation Campaign.…”
Section: Introductionmentioning
confidence: 99%
“…Experiments were conducted based on the European Parliament plenary sessions (EPPS) corpus, from the TC-STAR project [3].…”
Section: Methodsmentioning
confidence: 99%
“…In order to successfully use log-linear modeling in a stateof-the-art speech recognition system, it is necessary to reproduce or replace all important methods used in improving such a system. Speaker adaptation is one important method to improve the performance of a speech recognition system, and especially the use of feature space maximum likelihood linear regression (fMLLR) speaker adaptive training (SAT) [2], has proved to be an important part of state-of-the-art systems [3]. Thus, it is important to develop and investigate adaptation methods for loglinear models if they are to replace Gaussian models in a stateof-the-art speech recognition system.…”
Section: Introductionmentioning
confidence: 99%
“…questions on central-phoneme, segment, context, position or alternative properties are permitted to be posed in any desired order, and (2) the number of CART leafs, and thus, the tied CD-HMM state emission models, become controllable by keeping the n-best variants instead of solely relying on the construction criteria minimum log-likelihood gain τ ∆ log L and minimum observations τ N S(m) . The single joint CART is fundamental to LVCSR systems in [Lööf et al, 2007, and applied throughout the experimentation described in the subsequent chapters of this thesis.…”
Section: Hmm State Tyingmentioning
confidence: 99%