2005
DOI: 10.1109/tsa.2005.848885
|View full text |Cite
|
Sign up to set email alerts
|

Time-domain isolated phoneme classification using reconstructed phase spaces

Abstract: This paper introduces a novel time-domain approach to modeling and classifying speech phoneme waveforms. The approach is based on statistical models of reconstructed phase spaces, which offer significant theoretical benefits as representations that are known to be topologically equivalent to the state dynamics of the underlying production system. The lag and dimension parameters of the reconstruction process for speech are examined in detail, comparing common estimation heuristics for these parameters with cor… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
31
0
1

Year Published

2006
2006
2020
2020

Publication Types

Select...
6
2

Relationship

0
8

Authors

Journals

citations
Cited by 43 publications
(32 citation statements)
references
References 27 publications
0
31
0
1
Order By: Relevance
“…This is because the time complexity of the Viterbi algorithm [1], which is used for the recognition of speech, is far greater for the RPS approach, due to the amount of data [30]. For RPS based methods to become useful, this issue must be solved.…”
Section: Applications To Asrmentioning
confidence: 99%
See 2 more Smart Citations
“…This is because the time complexity of the Viterbi algorithm [1], which is used for the recognition of speech, is far greater for the RPS approach, due to the amount of data [30]. For RPS based methods to become useful, this issue must be solved.…”
Section: Applications To Asrmentioning
confidence: 99%
“…Again, GMMs are used to model the RPS features, and are learned using binarysplit EM. The number of mixtures used, which was determined empirically in [30], is 128. The classification accuracy for an RPS of dimension 10, with delta dimensions is 38.81%.…”
Section: Fullband Rpsmentioning
confidence: 99%
See 1 more Smart Citation
“…Recentes pesquisas relacionadas às séries temporais, geradas a partir dos mecanismos de produção da voz humana, têm sido realizadas considerando-se as técnicas da dinâmica não linear e da teoria do caos com objetivos variados, dentre os quais podem ser destacados: classificação de fonemas (Johnson et al, 2005;Kokkinos e Maragos, 2005), reconhecimento automático de locutor (Petry, 2002), discriminação entre vozes saudáveis e patológicas, diagnóstico de patologias laríngeas e avaliação de efeitos de tratamentos clínicos (Dajer, 2006;Henríquez et al, 2009;Jiang et al, 2006;Scalassara et al, 2008;Torres et al, 2003;Zhang e Jiang, 2008).…”
Section: Introductionunclassified
“…The anal-4 ysis may be followed by measurement of invariant quantities on the reconstructed space. Early works in the field employing phase-space reconstruction include [21,22,26,27,17,18,28], whereas recently there has been increasing interest in the area [19,29,30]. These employ concepts on Lyapunov exponents [18,19,29], density models of the phase-space [30], correlation dimension measurements [18,28], especially for fricative consonants [17], or surrogate analysis on the nonlinear dynamics of vowels [31].…”
Section: Introductionmentioning
confidence: 99%