As the primary means of communication, voice plays an important role in daily life. Voice also conveys personal information such as social status, personal traits, and the emotional state of the speaker. Mechanically, voice production involves complex fluid-structure interaction within the glottis and its control by laryngeal muscle activation. An important goal of voice research is to establish a causal theory linking voice physiology and biomechanics to how speakers use and control voice to communicate meaning and personal information. Establishing such a causal theory has important implications for clinical voice management, voice training, and many speech technology applications. This paper provides a review of voice physiology and biomechanics, the physics of vocal fold vibration and sound production, and laryngeal muscular control of the fundamental frequency of voice, vocal intensity, and voice quality. Current efforts to develop mechanical and computational models of voice production are also critically reviewed. Finally, issues and future challenges in developing a causal theory of voice production and perception are discussed.
Many previous laboratory investigations of phonation involving physical models, excised larynges, and in vivo canine larynges have failed to fully specify the subglottal system. Many of these same studies have reported a variety of nonlinear phenomena, including bifurcations (e.g., various classes of phonation onset and offset, register changes, frequency jumps), subharmonics, and chaos, and attributed such phenomena to the biomechanical properties of the larynx. However, such nonlinear phenomena may also be indicative of strong coupling between the voice source and the subglottal tract. Consequently, in such studies, it has not been clear whether the underlying mechanisms of such nonlinear phenomena were acoustical, biomechanical, or a coupling of the acoustical and biomechanical systems. Using a physical model of vocal fold vibration, and tracheal tube lengths which have been commonly reported in the literature, it is hypothesized and subsequently shown that such nonlinear phenomena may be replicated solely on the basis of laryngeal interactions with the acoustical resonances of the subglottal system. Recommendations are given for ruling out acoustical resonances as the source of nonlinear phenomena in future laboratory studies of phonation.
Objectives-To assess accuracy of the indentation method for stiffness measurements and to estimate the Young's modulus of the vocal fold using this technique. Study Design-Basic science.Methods-Indentation tests were performed using a range of indenter diameters and indentation depths on single and double layer silicone rubber models with a range of cover layer thicknesses with known geometry and Young's moduli. Measurements were repeated on intact vocal folds and isolated muscle and cover layer samples from three cadaveric human larynges.Results-Indentation on single-layer rubber models yielded Young's moduli with acceptable accuracy when the indentation depth was equal to or smaller than the indenter diameter, and both were smaller than the physical dimensions of the material sample. On two-layer models the stiffness estimation was similarly influenced by indenter diameter and indentation depth, and acceptable accuracy was reached when indentation depth was much smaller than the height of the top cover layer. Measurements on mid-membranous vocal fold tissue revealed location-dependent Young's moduli (in kPa) as follows: intact hemilarynx 8.6 (range 5.3 -13.1), isolated inferior medial surface cover 7.5 (range 7 -7.9), isolated medial surface cover 4.8 (range 3.9-5.7), isolated superior surface cover 2.9 (range 2.7 -3.2), and isolated thyroarytenoid muscle 2.0 (range 1.3 -2.7).Conclusions-Indenter diameter, indentation depth, and material thickness are important parameters in measurement of vocal fold stiffness using the indentation technique. Measurements on human larynges showed location-dependent differences in stiffness. The stiffness of the vocal folds was also found to be higher when the vocal fold structure was still attached to the laryngeal framework as compared to when the vocal fold was separated from the framework.
The goal of this study is to better understand the cause-effect relation between vocal fold physiology and the resulting vibration pattern and voice acoustics. Using a three-dimensional continuum model of phonation, the effects of changes in vocal fold stiffness, medial surface thickness in the vertical direction, resting glottal opening, and subglottal pressure on vocal fold vibration and different acoustic measures are investigated. The results show that the medial surface thickness has dominant effects on the vertical phase difference between the upper and lower margins of the medial surface, closed quotient, H1-H2, and higher-order harmonics excitation. The main effects of vocal fold approximation or decreasing resting glottal opening are to lower the phonation threshold pressure, reduce noise production, and increase the fundamental frequency. Increasing subglottal pressure is primarily responsible for vocal intensity increase but also leads to significant increase in noise production and an increased fundamental frequency. Increasing AP stiffness significantly increases the fundamental frequency and slightly reduces noise production. The interaction among vocal fold thickness, stiffness, approximation, and subglottal pressure in the control of F0, vocal intensity, and voice quality is discussed.
Current theories of voice production depend critically upon knowledge of the near field flow which emanates from the glottis. While most modern theories predict complex, three-dimensional structures in the near field flow, few investigations have attempted to quantify such structures. Using methods of flow visualization and digital particle image velocimetry, this study measured the near field flow structures immediately downstream of a self-oscillating, physical model of the vocal folds, with a vocal tract attached. A spatio-temporal analysis of the structures was performed using the method of empirical orthogonal eigenfunctions. Some of the observed flow structures included vortex generation, vortex convection, and jet flapping. The utility of such data in the future development of more accurate, low-dimensional models of voice production is discussed.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.