1986
DOI: 10.1109/tassp.1986.1164911
|View full text |Cite
|
Sign up to set email alerts
|

Frame-specific statistical features for speaker independent speech recognition

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
16
0

Year Published

1988
1988
2011
2011

Publication Types

Select...
4
4

Relationship

0
8

Authors

Journals

citations
Cited by 33 publications
(16 citation statements)
references
References 11 publications
0
16
0
Order By: Relevance
“…A significant aspect of our research is that it represents the effort toward the automatic definition of speechdependent acoustic parameters, which are subject to statistical optimization rather than relying on heuristic construction. Along this line, we note an earlier work as a representative of the nonparametric (speech-frame based) approach to this problem [3]. Our own earlier parametric (HMMstate based) approach [7], [24] has been extended in this study from the previous level of MFCC to the present level of log-channel energy computed from DFT's, a step closer toward the most primitive form of the data as speech waveform.…”
Section: Summary and Discussionmentioning
confidence: 88%
See 1 more Smart Citation
“…A significant aspect of our research is that it represents the effort toward the automatic definition of speechdependent acoustic parameters, which are subject to statistical optimization rather than relying on heuristic construction. Along this line, we note an earlier work as a representative of the nonparametric (speech-frame based) approach to this problem [3]. Our own earlier parametric (HMMstate based) approach [7], [24] has been extended in this study from the previous level of MFCC to the present level of log-channel energy computed from DFT's, a step closer toward the most primitive form of the data as speech waveform.…”
Section: Summary and Discussionmentioning
confidence: 88%
“…As described above, the static features are obtained by a linear transformation of andimensional input space for the MFB log channel energies, represented by the vector , to a transformed -dimensional feature space according to (1). Instead of taking the temporal difference of the transformed static features fixed a priori in THMM-1, the dynamic feature vector at frame in THMM-2 is constructed as additional state-dependent, trainable linear combinations of the static features stretching over the interval frames forward and frames backward according to (3) where is the th scalar weighting coefficient associated with the th mixture residing in the Markov state . (Note that in this THMM-2, is trainable, in contrast to THMM-1 where weights are prefixed).…”
Section: B Construction Of State-dependent Joint Transforms For Statmentioning
confidence: 99%
“…As part of a continuing trend to better characterize temporal variations in the signal, higher order time derivatives of signal measurements (Doddington 1989;Bocchieri and Doddington 1986;Furui 1986) were added to the signal model. The absolute measurements previously discussed can be thought of as zero th order derivatives.…”
Section: Differentiationmentioning
confidence: 99%
“…The minimum error is obtained by choosing the ( ) smallest (zero in our case) eigenvalues and their corresponding eigenvectors as the ones to discard [8]. Since the number of largest (nonzero) eigenvalues is limited by the number ( ) when , the dimension of the subspace spanned by the eigenvectors corresponding to the largest eigenvalues can be extended up to ( ).…”
Section: Principal Component Analysismentioning
confidence: 99%