2007
DOI: 10.1109/tbme.2006.889780
|View full text |Cite
|
Sign up to set email alerts
|

The Use of Wavelet Packet Transform and Artificial Neural Networks in Analysis and Classification of Dysphonic Voices

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
6
0
1

Year Published

2013
2013
2022
2022

Publication Types

Select...
8
2

Relationship

0
10

Authors

Journals

citations
Cited by 30 publications
(7 citation statements)
references
References 12 publications
0
6
0
1
Order By: Relevance
“…The popular parameters in this category are computed using the fractal dimension or the correlation dimension [28], [45], [51]- [53]. The complex measures investigated in several studies consist of the following: the largest Lyapunov exponent, the recurrence period density entropy, Hurst exponent, detrended fluctuation analysis, approximate entropy, sample entropy, modified sample entropy, Gaussian kernel sample entropy, fuzzy entropy, hidden Markov model (HMM) entropy and Shannon HMM entropy [38], [39], [54], [55]. These features capture the dynamic variants/invariants, long-range correlations, regularity or predictability information present in the signal.…”
Section: Introductionmentioning
confidence: 99%
“…The popular parameters in this category are computed using the fractal dimension or the correlation dimension [28], [45], [51]- [53]. The complex measures investigated in several studies consist of the following: the largest Lyapunov exponent, the recurrence period density entropy, Hurst exponent, detrended fluctuation analysis, approximate entropy, sample entropy, modified sample entropy, Gaussian kernel sample entropy, fuzzy entropy, hidden Markov model (HMM) entropy and Shannon HMM entropy [38], [39], [54], [55]. These features capture the dynamic variants/invariants, long-range correlations, regularity or predictability information present in the signal.…”
Section: Introductionmentioning
confidence: 99%
“…Since most of the vocal response perceived by human ear lies in low frequency range, [6] literatures suggested frequency measure based technique like Mel-frequency scale to get a high resolution in low frequency region, and a low resolution in high frequency region. Wavelet packet based feature extraction is also suggested in literature [2][10] [11] [12]. Reference [13] compared seven breathiness measures with glottal to noise excitation ratio which they established as a discriminator for carcinoma, disturbed and normal speech signals.…”
Section: Literature Studymentioning
confidence: 99%
“…In the last decade, interesting works had been proposed on artificial neural networks for classification problems. For instance, in [23] a classification system to identify voice dysphonia via Wavelet Packet Transform and the Best Basis Algorithm is designed. Outstanding results were reported by reaching from 87.5% to 96.8% of accuracy.…”
Section: Introductionmentioning
confidence: 99%