5th European Conference on Speech Communication and Technology (Eurospeech 1997) 1997
DOI: 10.21437/eurospeech.1997-548
|View full text |Cite
|
Sign up to set email alerts
|

Nonlinear discriminant analysis for improved speech recognition

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
1
0

Year Published

2005
2005
2024
2024

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 24 publications
(3 citation statements)
references
References 0 publications
0
1
0
Order By: Relevance
“…One of the main benefits of the explicit alignment approaches such as CTC, RNN-T, or RNA is that they result in ASR models that are easily amenable to frame-synchronous decoding 6 In this section, we discuss the attention-based encoder-decoder (AED) models (also known as, listen-attendand-spell (LAS)) [15], [16], [53], which employs the attention mechanism [43] to implicitly identify and model the portions of the input acoustics which are relevant to each output unit. These models were first popularized in the context of machine translation [54].…”
Section: Implicit Alignment Modeling Approachesmentioning
confidence: 99%
See 1 more Smart Citation
“…One of the main benefits of the explicit alignment approaches such as CTC, RNN-T, or RNA is that they result in ASR models that are easily amenable to frame-synchronous decoding 6 In this section, we discuss the attention-based encoder-decoder (AED) models (also known as, listen-attendand-spell (LAS)) [15], [16], [53], which employs the attention mechanism [43] to implicitly identify and model the portions of the input acoustics which are relevant to each output unit. These models were first popularized in the context of machine translation [54].…”
Section: Implicit Alignment Modeling Approachesmentioning
confidence: 99%
“…Within the classical approach, deep learning has been introduced into acoustic and language modeling. In acoustic modeling, deep learning has replaced Gaussian mixture distributions (hybrid HMM [4], [5]) or augmented the acoustic feature set (e.g., non-linear discriminant/tandem approach [6], [7]). In language modeling, deep learning has replaced count-based approaches [8], [9], [10].…”
Section: Introductionmentioning
confidence: 99%
“…Within the classical approach, deep learning has been introduced to acoustic and language modeling. In acoustic modeling, deep learning replaced Gaussian mixture distributions (hybrid HMM [3], [4]) or augmented the acoustic feature set (nonlinear disciminant/tandem approach [5], [6]). In language modeling, deep learning replaced count-based approaches [7], [8], [9].…”
Section: Introductionmentioning
confidence: 99%