Group singing events have been associated with several outbreaks of infection during the coronavirus disease (COVID-19) pandemic (1). This link supports the possibility that aerosols are partly responsible for person-to-person infection. This study aims to analyze the impulse dispersion dynamics of aerosols in professional singers concerning the differences between singing a text, singing a vowel, or speaking at different levels of loudness.Some of the results of these studies have been previously reported in the form of a preprint (
Laryngeal videoendoscopy is one of the main tools in clinical examinations for voice disorders and voice research. Using high-speed videoendoscopy, it is possible to fully capture the vocal fold oscillations, however, processing the recordings typically involves a time-consuming segmentation of the glottal area by trained experts. Even though automatic methods have been proposed and the task is particularly suited for deep learning methods, there are no public datasets and benchmarks available to compare methods and to allow training of generalizing deep learning models. In an international collaboration of researchers from seven institutions from the EU and USa, we have created BaGLS, a large, multihospital dataset of 59,250 high-speed videoendoscopy frames with individually annotated segmentation masks. The frames are based on 640 recordings of healthy and disordered subjects that were recorded with varying technical equipment by numerous clinicians. the BaGLS dataset will allow an objective comparison of glottis segmentation methods and will enable interested researchers to train their own models and compare their methods.
For the investigation of the physical processes of human phonation, inhomogeneous synthetic vocal folds were developed to represent the full fluid-structure-acoustic coupling. They consisted of polyurethane rubber with a stiffness in the range of human vocal folds and were mounted in a channel, shaped like the vocal tract in the supraglottal region. This test facility permitted extensive observations of flow-induced vocal fold vibrations, the periodic flow field, and the acoustic signals in the far field of the channel. Detailed measurements were performed applying particle-image velocimetry, a laser-scanning vibrometer, a microphone, unsteady pressure sensors, and a hot-wire probe, with the aim of identifying the physical mechanisms in human phonation. The results support the existence of the Coanda effect during phonation, with the flow attaching to one vocal fold and separating from the other. This behavior is not linked to one vocal fold and changes stochastically from cycle to cycle. The oscillating flow field generates a tonal sound. The broadband noise is presumed to be caused by the interaction of the asymmetric flow with the downstream-facing surfaces of the vocal folds, analogous to trailing-edge noise.
Quantitative analysis of phonatory characteristics of rabbits has been widely neglected. However, preliminary studies established the rabbit larynx as a potential model of human phonation. This study reports quantitative data on phonation using ex vivo rabbit larynx models to achieve more insight into dependencies of three main components of the phonation process, including airflow, vocal fold dynamics, and the acoustic output. Sustained phonation was induced in 11 ex vivo rabbit larynges. For 414 phonatory conditions, vocal fold vibrations, acoustic, and aerodynamic parameters were analyzed as functions of longitudinal vocal fold pre-stress, applied air flow, and glottal closure insufficiency. Dimensions of the vocal folds were measured and histological data were analyzed. Glottal closure characteristics improved for increasing longitudinal pre-stress and applied airflow. For the subglottal pressure signal only the cepstral peak prominence showed dependency on glottal closure. In contrast, vibrational, acoustic, and aerodynamic parameters were found to be highly dependent on the degree of glottal closure: The more complete the glottal closure during phonation, the better the aerodynamic and acoustic characteristics. Hence, complete or at least partial glottal closure appears to enhance acoustic signal quality. Finally, results validate the ex vivo rabbit larynx as an effective model for analyzing the phonatory process.
A hybrid aeroacoustic approach was developed for the efficient numerical computation of human phonation. In the first step, an incompressible flow simulation on a three-dimensional (3 D) computational grid, which is capable of resolving all relevant turbulent scales, is performed using STARCCM+ and finite volume method. In the second step, the acoustic source terms on the flow grid are computed and a conservative interpolation to the acoustic grid is performed. Finally, the perturbed convective wave equation is solved to obtain the acoustic field in 3 D with the finite element solver CFS++. Thereby, the conservative transformation of the acoustic sources from the flow grid to the acoustic grid is a key step to allow coarse acoustic grids without reducing accuracy. For this transformation, two different interpolation strategies are compared and grid convergence is assessed. Overall, 16 simulation setups are compared. The initial (267 000 degrees of freedom) and the optimized (21 265 degrees of freedom) simulation setup were validated by measurements of a synthetic larynx model. To conclude, the total computational time of the acoustic simulation is reduced by 95% compared to the initial simulation setup without a significant reduction of accuracy, being 7%, in the frequency range of interest.
This study presents a framework for a direct comparison of experimental vocal fold dynamics data to a numerical two-mass-model (2MM) by solving the corresponding inverse problem of which parameters lead to similar model behavior. The introduced 2MM features improvements such as a variable stiffness and a modified collision force. A set of physiologically sensible degrees of freedom is presented, and three optimization algorithms are compared on synthetic vocal fold trajectories. Finally, a total of 288 high-speed video recordings of six excised porcine larynges were optimized to validate the proposed framework. Particular focus lay on the subglottal pressure, as the experimental subglottal pressure is directly comparable to the model subglottal pressure. Fundamental frequency, amplitude and objective function values were also investigated. The employed 2MM is able to replicate the behavior of the porcine vocal folds very well. The model trajectories' fundamental frequency matches the one of the experimental trajectories in [Formula: see text] of the recordings. The relative error of the model trajectory amplitudes is on average [Formula: see text]. The experiments feature a mean subglottal pressure of 10.16 (SD [Formula: see text]) [Formula: see text]; in the model, it was on average 7.61 (SD [Formula: see text]) [Formula: see text]. A tendency of the model to underestimate the subglottal pressure is found, but the model is capable of inferring trends in the subglottal pressure. The average absolute error between the subglottal pressure in the model and the experiment is 2.90 (SD [Formula: see text]) [Formula: see text] or [Formula: see text]. A detailed analysis of the factors affecting the accuracy in matching the subglottal pressure is presented.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.