2018
DOI: 10.1121/1.5039718
|View full text |Cite
|
Sign up to set email alerts
|

Detection of hypernasality based on vowel space area

Abstract: This study proposes a method for differentiating hypernasal-speech from normal speech using the vowel space area (VSA). Hypernasality introduces extra formant and anti-formant pairs in vowel spectrum, which results in shifting of formants. This shifting affects the size of the VSA. The results show that VSA is reduced in hypernasal-speech compared to normal speech. The VSA feature plus Mel-frequency cepstral coefficient feature for support vector machine based hypernasality detection leads to an accuracy of 86… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
10
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
5
1
1

Relationship

2
5

Authors

Journals

citations
Cited by 19 publications
(10 citation statements)
references
References 13 publications
0
10
0
Order By: Relevance
“…Based on previous evidence, the following selection of features were computed from each utterance: MFCC coefficients, the first three formants together with their bandwidths (BW) and distances and the VLTHTR [6,10,22,23]. The 13-dimensional MFCC features were calculated using moving Hamming windows.…”
Section: Acoustic Featuresmentioning
confidence: 99%
See 3 more Smart Citations
“…Based on previous evidence, the following selection of features were computed from each utterance: MFCC coefficients, the first three formants together with their bandwidths (BW) and distances and the VLTHTR [6,10,22,23]. The 13-dimensional MFCC features were calculated using moving Hamming windows.…”
Section: Acoustic Featuresmentioning
confidence: 99%
“…In the past there have been many proposals to evaluate HN based on acoustic information. While the results in terms of accuracy are generally excellent, most studies have used only a limited number of utterance types, such as sustained vowels [6][7][8][9][10][11]. This is not clearly compatible with standard clinical recommendations [2], which strongly recommend that patients are evaluated using a variety of phonemes and utterances with varying complexity (e.g., CAPS-A protocol, [2,12]).…”
Section: Introductionmentioning
confidence: 99%
See 2 more Smart Citations
“…The important works on hypernasality detection are based on Teager energy operator (TEO) based feature [11], TEO feature with frequency cepstral coefficient (MFCC) feature [12], pitch-adaptive MFCC feature [13], linear prediction cepstral coefficient (LPCC) feature [14] and features extracted from high spectral resolution group delay spectrum [4] and zero time windowing technique [15], [16]. Besides that the feature set obtained from acoustic, noise and cepstral analysis, nonlinear dynamic and entropy measurements [17], [18], [19], based on energy distribution [20], [21] and using vowel space area (VSA) [22] are also used. The hypernasality detection is also done using recorded sentences speech database using jitter, shimmer, MFCC, bionic wavelet transform entropy and bionic wavelet transform energy features.…”
Section: Introductionmentioning
confidence: 99%