2019
DOI: 10.1121/1.5129121
|View full text |Cite
|
Sign up to set email alerts
|

Skewing of the glottal flow with respect to the glottal area measured in natural production of vowels

Abstract: In the production of voiced speech, glottal flow skewing refers to the tilting of the glottal flow pulses to the right, often characterized as a delay of the peak, compared to the glottal area. In the past four decades, several studies have addressed this phenomenon using modeling of voice production with analog circuits and computer simulations. However, previous studies measuring flow skewing in natural production of speech are sparse and they contain little quantitative data about the degree of skewing betw… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
3
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4

Relationship

1
3

Authors

Journals

citations
Cited by 4 publications
(3 citation statements)
references
References 24 publications
(36 reference statements)
0
3
0
Order By: Relevance
“…Furthermore, the exact size of the image and the phonatory step within the laryngoscope video do not matter. On the contrary, many computer vision/machine learning algorithms that aim to segment the glottis or vocal folds have several requirements, such as a specific image resolution (usually around 256 × 256 pixels; Alku et al, 2019;Fehling et al, 2020;Wurzbacher et al, 2008), the high frame rates associated with HSV, and sometimes manual intervention to handpick frames that depict the glottis or certain laryngeal characteristics. Our system is not limited by strict specifications regarding video quality or time within a nasolaryngoscopic video.…”
Section: Relevance For Rating Voice Quality In Adsdmentioning
confidence: 99%
See 1 more Smart Citation
“…Furthermore, the exact size of the image and the phonatory step within the laryngoscope video do not matter. On the contrary, many computer vision/machine learning algorithms that aim to segment the glottis or vocal folds have several requirements, such as a specific image resolution (usually around 256 × 256 pixels; Alku et al, 2019;Fehling et al, 2020;Wurzbacher et al, 2008), the high frame rates associated with HSV, and sometimes manual intervention to handpick frames that depict the glottis or certain laryngeal characteristics. Our system is not limited by strict specifications regarding video quality or time within a nasolaryngoscopic video.…”
Section: Relevance For Rating Voice Quality In Adsdmentioning
confidence: 99%
“…Once segmented, various geometric measures that characterize the shape can be easily computed. These segmentation methods have included edge detection and region growing algorithms (Alku et al, 2019;Chen et al, 2013), pixel thresholding (Wurzbacher et al, 2006), a watershed transform and a linear predictor (Osma-Ruiz et al, 2008), a convolutional neural network-based semantic segmentation (Laves et al, 2019), and a deep convolutional long short-term memory network (Fehling et al, 2020). Many of these studies use high-speed video (HSV) recordings of the larynx.…”
mentioning
confidence: 99%
“…are many recent investigations where GIF methods have been studied jointly with glottal area information extracted using HSV or with physical models of voice production. These investigations have addressed issues, such as the relationship between the glottal flow and glottal area in the presence of source-filter interaction [76], [77] and in phonation onsets [55], the computation of parameter values for physical models [78], [79], and the estimation of subglottal pressure, laryngeal muscle activation, and vocal fold contact pressure [80]. We argue that the strategy used in these investigations to study excitation information of speech (i.e., using GIF jointly with HSV and with physical modeling approaches) will become increasingly important and also increasingly feasible in the future due to the progress in HSV [50], [81], physical modeling [82], [83], and GIF [59], [67], [71].…”
Section: The Ac-flow (Fac) Minimum Flow (F Min ) and The Minimum Of The Derivative (D Min )mentioning
confidence: 99%