2019
DOI: 10.1109/access.2019.2911314
|View full text |Cite
|
Sign up to set email alerts
|

A Pathological Multi-Vowels Recognition Algorithm Based on LSP Feature

Abstract: At present, pathological voice recognition is mainly based on the classification of pathological voice. However, almost all the researches are based on the single vowel \a\ samples, but few on multivowels. In addition, the current researches on multi-vowels recognition are mainly for normal voices, which are unsuitable for the speech recognition of normal and pathological multi-vowels simultaneously. This paper concentrates on developing an accurate and robust feature called enhanced-bark line spectrum pair (E… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 16 publications
(1 citation statement)
references
References 28 publications
0
1
0
Order By: Relevance
“…Speech signals in the time domain are transformed into the time-frequency domain because changes in frequencies and in the time of the signals are important cues for further analysis processes. On the basis of spectrograms, feature-emphasis techniques such as mel-frequency cepstral coefficients (MFCCs) [2]- [4] and linear prediction cepstral coefficients (LPCCs) [5], [6] can be applied to obtain significant acoustical feature vectors. Pattern-analysis techniques (e.g., support vector machine [7] for voice pathology detection, Gaussian mixture model [8] for speaker recognition, and deep neural networks [9] for emotion recognition) are then applied to the feature vectors.…”
Section: Introductionmentioning
confidence: 99%
“…Speech signals in the time domain are transformed into the time-frequency domain because changes in frequencies and in the time of the signals are important cues for further analysis processes. On the basis of spectrograms, feature-emphasis techniques such as mel-frequency cepstral coefficients (MFCCs) [2]- [4] and linear prediction cepstral coefficients (LPCCs) [5], [6] can be applied to obtain significant acoustical feature vectors. Pattern-analysis techniques (e.g., support vector machine [7] for voice pathology detection, Gaussian mixture model [8] for speaker recognition, and deep neural networks [9] for emotion recognition) are then applied to the feature vectors.…”
Section: Introductionmentioning
confidence: 99%