2011
DOI: 10.1109/tbme.2010.2089052
|View full text |Cite
|
Sign up to set email alerts
|

Automatic Detection of Pathological Voices Using Complexity Measures, Noise Parameters, and Mel-Cepstral Coefficients

Abstract: This paper proposes a new approach to improve the amount of information extracted from the speech aiming to increase the accuracy of a system developed for the automatic detection of pathological voices. The paper addresses the discrimination capabilities of 11 features extracted using nonlinear analysis of time series. Two of these features are based on conventional nonlinear statistics (largest Lyapunov exponent and correlation dimension), two are based on recurrence and fractal-scaling analysis, and the rem… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
82
0
1

Year Published

2011
2011
2024
2024

Publication Types

Select...
6
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 131 publications
(83 citation statements)
references
References 43 publications
0
82
0
1
Order By: Relevance
“…The recordings are preprocessed by means of its division into frames with 55ms of length with an overlap of 50%, according to [9]. After, NLD features are calculated for each frame and four statistics are calculated for each feature.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…The recordings are preprocessed by means of its division into frames with 55ms of length with an overlap of 50%, according to [9]. After, NLD features are calculated for each frame and four statistics are calculated for each feature.…”
Section: Methodsmentioning
confidence: 99%
“…In this paper, different state of the art NLD features are implemented and their discrimination capability is objectively evaluated on the automatic classification of speech signals from PPD and CS. The set of features considered in this study includes a total of 10 measures which have been used for the automatic detection of different speech disorders such as hypernasality [8] and dysphonia [5], [9]. The features are: correlation dimension, largest Lyapunov exponent, Lempel-Ziv complexity, Hurst exponent, RPDE, DFA, approximate entropy, approximate entropy with Gaussian kernel, sample entropy, sample entropy with Gaussian kernel.…”
Section: Introductionmentioning
confidence: 99%
“…Changes in spectral envelope shape and slope are widely reported as a consequence of voice pathology [13] or Parkinson's [14,15]. The use of mel-frequency cepstral coefficients (MFCC) to describe the short-time speech spectral envelope is very common.…”
Section: Features For Detecting Health Changesmentioning
confidence: 99%
“…From past decade mel frequency cepstral coefficients (MFCCs) have been widely used in pathological voice detection systems with good results. [8][9][10] The main advantage of MFCCs is that, its calculation does not require any previous pitch information, which is a difficult task in the presence of pathology. The basic time domain features and MFCCs can be used in developing classifiers models for detection of pathological voices.…”
Section: Introductionmentioning
confidence: 99%