2009
DOI: 10.1109/tbme.2008.2007910
|View full text |Cite
|
Sign up to set email alerts
|

Automatic Modeling of Acoustic Perception of Breathiness in Pathological Voices

Abstract: This paper revisits the modeling of acoustic perceptions of breathy voice (BV) quality for automatic assessment of perturbations in pathologic speech. Several acoustic measures related with the signal periodicity, harmonic components, and aspiration noise are studied to predict breathiness judgments performed on sustained vowel phonations. A novel comprehensive automatic measure is proposed that provides the highest correlation index (88.5%) with breathiness judgment performed by trained specialists on simulat… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
10
1

Year Published

2011
2011
2023
2023

Publication Types

Select...
6

Relationship

0
6

Authors

Journals

citations
Cited by 19 publications
(11 citation statements)
references
References 22 publications
0
10
1
Order By: Relevance
“…These findings are in contrast to those of Castillo-Guerra and Ruíz (2009), who compared acoustic measures to perceptual judgments of breathy voice. Two measures were used to assess the ratio between H1 and H2: harmonic energy (HE) which was based on the radiated acoustic signal, and harmonic energy of residue (HE RES ), which was calculated after passage through an inverse filter model.…”
Section: Discussioncontrasting
confidence: 99%
See 1 more Smart Citation
“…These findings are in contrast to those of Castillo-Guerra and Ruíz (2009), who compared acoustic measures to perceptual judgments of breathy voice. Two measures were used to assess the ratio between H1 and H2: harmonic energy (HE) which was based on the radiated acoustic signal, and harmonic energy of residue (HE RES ), which was calculated after passage through an inverse filter model.…”
Section: Discussioncontrasting
confidence: 99%
“…Two measures were used to assess the ratio between H1 and H2: harmonic energy (HE) which was based on the radiated acoustic signal, and harmonic energy of residue (HE RES ), which was calculated after passage through an inverse filter model. The measure HE RES better accounted for variance in the perceptual judgments across multiple types of perturbation, leading the authors to conclude that filtering effects of the vocal tract from the signal improves the accuracy of the information conveyed about breathy voice (Castillo-Guerra & Ruíz, 2009). The most likely explanation for this is that they did not appear to correct the harmonic amplitudes for the influences of F1 in their calculation.…”
Section: Discussionmentioning
confidence: 99%
“…Breath feature is a vocal feature according to the mechanism of speech production, 8 which includes periodic perturbation coefficient, harmonic‐noise ratio, glottic‐noise excitation ratio, harmonic structure energy, harmonic energy residual, and harmonic‐signal ration.…”
Section: Experiments and Evaluationmentioning
confidence: 99%
“…Tone quality feature generally compose of formant frequency, frequency perturbation, and amplitude perturbation 7 . Castillo‐Guerra and Ruiz 8 studied the automatic evaluation model to the quality of acoustic breathing and proposed a tone quality feature for recognizing speech emotion, the feature consists of periodic disturbance coefficient, amplitude disturbance coefficient, harmonic noise ratio, glottic noise excitation ratio, harmonic energy, harmonic energy residue, and harmonic signal ratio. Tone quality feature has a high recognition performance in vigorous active emotions, such as anger and surprise.…”
Section: Introductionmentioning
confidence: 99%
“…In this paper, traditional acoustic measures concerning the signal periodicity, harmonic components, and aspiration noise were calculated in addition to harmonic energy of residue (HE RES ), harmonic-to-signal ratio (HSR), and number of voiced frames (NVF). The best classification performance (88.5%) was achieved with a best subset regression (BSR) analysis using linear combinations of multiple parameters [15]. Wester suggested two methods to automatically classify voice quality: regression analysis and hidden markov models (HMMs).…”
Section: Introductionmentioning
confidence: 99%