2002
DOI: 10.1006/csla.2001.0185
|View full text |Cite
|
Sign up to set email alerts
|

An overview of decoding techniques for large vocabulary continuous speech recognition

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
51
0
2

Year Published

2002
2002
2014
2014

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 89 publications
(53 citation statements)
references
References 38 publications
0
51
0
2
Order By: Relevance
“…Time-asynchronous decoding pursues a depthfirst strategy, in which the best hypotheses are explored forward in time before being compared to their competitors. The distinction between time-synchronous and time-asynchronous decoding is not absolute [12]. In practice, time-synchronous decoders limit the number of hypotheses under examination at any given point using a process called beam pruning, which eliminates unpromising hypotheses from consideration.…”
Section: Recognitionmentioning
confidence: 99%
See 2 more Smart Citations
“…Time-asynchronous decoding pursues a depthfirst strategy, in which the best hypotheses are explored forward in time before being compared to their competitors. The distinction between time-synchronous and time-asynchronous decoding is not absolute [12]. In practice, time-synchronous decoders limit the number of hypotheses under examination at any given point using a process called beam pruning, which eliminates unpromising hypotheses from consideration.…”
Section: Recognitionmentioning
confidence: 99%
“…LVCSR is an incredibly complex process. In [12], a "trilogy" of factors is identified as being responsible for the bestperforming LVCSR systems: combination of multiple algorithms, clever design cooperative with the hardware, careful parameter tuning. If any of these factors goes awry during the process of integrating the ASR and indexing system, sub-optimal performance could result.…”
Section: Strategies For Combining Asr and Irmentioning
confidence: 99%
See 1 more Smart Citation
“…Nevertheless, the complexity of acoustic and language models used in speech recognition tasks still imposes growing requirements for the efficiency and accuracy of LVCSR decoders, and fosters the development of new approaches and techniques such as, e.g. cross-word acoustic models and longspan language models, already resulted in the development of several solutions for the speech-decoding problem [1,2,5,6,8,10,21,22].…”
Section: Introductionmentioning
confidence: 99%
“…Typically these knowledge sources are represented in the form of hidden Markov models (HMM), pronunciation lexica, and N-gram language models. The means for combining these knowledge sources and efficient decoding of the acoustic input is a demanding task and a range of optimisation techniques and heuristics are employed to achieve lower computational and memory requirements with minimal sacrifice to recognition accuracy [1]. In this paper we present the "Juicer" decoding software that has been developed at IDIAP.…”
Section: Introductionmentioning
confidence: 99%