2014
DOI: 10.1016/j.neucom.2012.12.061
|View full text |Cite
|
Sign up to set email alerts
|

Non-linear speech representation based on local predictability exponents

Abstract: Looking for new perspectives to analyze non-linear dynamics of speech, this paper presents a novel approach based on a microcanonical multiscale formulation which allows the geometric and statistical description of multiscale properties of the complex dynamics. Speech is a complex system whose dynamics can be, to some extent, geometrically and statistically accessed by the computation of Local Predictability Exponents (LPEs) unlocking the determination of the most informative subset (Most Singular Manifold or … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2

Citation Types

0
2
0

Year Published

2014
2014
2020
2020

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(2 citation statements)
references
References 17 publications
0
2
0
Order By: Relevance
“…However, For the case of speech signals, the direct 1-D adaptation of the same procedure reduces the resulting Γ r (•) to simple directional finite differences. In [38], we followed a similar path and searched for a Γ r (•) that results in a relatively more compact MSM from which the whole speech signal can be reconstructed; we used a classical method for reconstruction of a given signal from a subset of its irregularly spaced samples (MSM in our case) and compared various definition of Γ r (•) to find the one that results in a more compact MSM from which the signal can be reconstructed with good perceptual quality (evaluated using the PESQ measure of signal quality). As such, the multi-scale integral of the following scaledependent functional was defined:…”
Section: A the Choice Of γ R (•)mentioning
confidence: 99%
See 1 more Smart Citation
“…However, For the case of speech signals, the direct 1-D adaptation of the same procedure reduces the resulting Γ r (•) to simple directional finite differences. In [38], we followed a similar path and searched for a Γ r (•) that results in a relatively more compact MSM from which the whole speech signal can be reconstructed; we used a classical method for reconstruction of a given signal from a subset of its irregularly spaced samples (MSM in our case) and compared various definition of Γ r (•) to find the one that results in a more compact MSM from which the signal can be reconstructed with good perceptual quality (evaluated using the PESQ measure of signal quality). As such, the multi-scale integral of the following scaledependent functional was defined:…”
Section: A the Choice Of γ R (•)mentioning
confidence: 99%
“…We discussed in [38] that such definition reduces the effect of inter-sample correlations of the speech signal in estimation of SEs and we showed that it effectively results in a compact representation of the speech signal. On the other hand, the GCI detection application that we are considering in this paper allows us to provide an intuitive justification for this multiscale measure.…”
Section: A the Choice Of γ R (•)mentioning
confidence: 99%