Non-linear speech representation based on local predictability exponents

Khanagha, Vahid; Daoudi, Khalid; Pont, Oriol; Yahia, Hussein; Turiel, Antonio

doi:10.1016/j.neucom.2012.12.061

Cited by 3 publications

(2 citation statements)

References 17 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…However, For the case of speech signals, the direct 1-D adaptation of the same procedure reduces the resulting Γ r (•) to simple directional finite differences. In [38], we followed a similar path and searched for a Γ r (•) that results in a relatively more compact MSM from which the whole speech signal can be reconstructed; we used a classical method for reconstruction of a given signal from a subset of its irregularly spaced samples (MSM in our case) and compared various definition of Γ r (•) to find the one that results in a more compact MSM from which the signal can be reconstructed with good perceptual quality (evaluated using the PESQ measure of signal quality). As such, the multi-scale integral of the following scaledependent functional was defined:…”

Section: A the Choice Of γ R (•)mentioning

confidence: 99%

“…We discussed in [38] that such definition reduces the effect of inter-sample correlations of the speech signal in estimation of SEs and we showed that it effectively results in a compact representation of the speech signal. On the other hand, the GCI detection application that we are considering in this paper allows us to provide an intuitive justification for this multiscale measure.…”

Section: A the Choice Of γ R (•)mentioning

confidence: 99%

See 1 more Smart Citation

Detection of Glottal Closure Instants Based on the Microcanonical Multiscale Formalism

Khanagha

Daoudi

Yahia

2014

IEEE/ACM Trans. Audio Speech Lang. Process.

View full text Add to dashboard Cite

This paper presents a novel algorithm for automatic detection of Glottal Closure Instants (GCI) from the speech signal. Our approach is based on a novel multiscale method that relies on precise estimation of a multiscale parameter at each time instant in the signal domain. This parameter quantifies the degree of signal singularity at each sample from a multi-scale point of view and thus its value can be used to classify signal samples accordingly. We use this property to develop a simple algorithm for detection of GCIs and we show that for the case of clean speech, our algorithm performs almost as well as a recent stateof-the-art method. Next, by performing a comprehensive comparison in presence of 14 different types of noises, we show that our method is more accurate (particularly for very low SNRs). Our method has lower computational times compared to others and does not rely on an estimate of pitch period or any critical choice of parameters.

show abstract

Section: A the Choice Of γ R (•)mentioning

confidence: 99%

Section: A the Choice Of γ R (•)mentioning

confidence: 99%