A theory of vocal fold oscillation is developed on the basis of the body-cover hypothesis. The cover is represented by a distributed surface layer that can propagate a mucosal surface wave. Linearization of the surface-wave displacement and velocity, and further small-amplitude approximations, yields closed-form expressions for conditions of oscillation. The theory predicts that the lung pressure required to sustain oscillation, i.e., the oscillation threshold pressure, is reduced by reducing the mucosal wave velocity, by bringing the vocal folds closer together and by reducing the convergence angle in the glottis. The effect of vocal tract acoustic loading is included. It is shown that vocal tract inertance reduces the oscillation threshold pressure, whereas vocal tract resistance increases it. The treatment, which is applicable to falsetto and breathy voice, as well as onset or release of phonation in the absence of vocal fold collision, is harmonized with former treatments based on two-mass models and collapsible tubes.
The efficacious effects of a lip semi-occlusion can also be realized for nonoccluded vocal tracts by a combination of vocal fold adduction and epilarynx tube adjustments. It is reasoned that therapy approaches are designed to match the glottal impedance to the input impedance of the vocal tract.
A theory of interaction between the source of sound in phonation and the vocal tract filter is developed. The degree of interaction is controlled by the cross-sectional area of the laryngeal vestibule ͑epilarynx tube͒, which raises the inertive reactance of the supraglottal vocal tract. Both subglottal and supraglottal reactances can enhance the driving pressures of the vocal folds and the glottal flow, thereby increasing the energy level at the source. The theory predicts that instabilities in vibration modes may occur when harmonics pass through formants during pitch or vowel changes. Unlike in most musical instruments ͑e.g., woodwinds and brasses͒, a stable harmonic source spectrum is not obtained by tuning harmonics to vocal tract resonances, but rather by placing harmonics into favorable reactance regions. This allows for positive reinforcement of the harmonics by supraglottal inertive reactance ͑and to a lesser degree by subglottal compliant reactance͒ without the risk of instability. The traditional linear source-filter theory is encumbered with possible inconsistencies in the glottal flow spectrum, which is shown to be influenced by interaction. In addition, the linear theory does not predict bifurcations in the dynamical behavior of vocal fold vibration due to acoustic loading by the vocal tract.
Comparison is drawn between male and female larynges on the basis of overall size, vocal fold membranous length, elastic properties of tissue, and prephonatory glottal shape. Two scale factors are proposed that are useful for explaining differences in fundamental frequency, sound power, mean airflow, and glottal efficiency. Fundamental frequency is scaled primarily according to the membranous length of the vocal folds (scale factor of 1.6), whereas mean airflow, sound power, glottal efficiency, and amplitude of vibration include another scale factor (1.2) that relates to overall larynx size. Some explanations are given for observed sex differences in glottographic waveforms. In particular, the simulated (computer-modeled) vocal fold contact area is used to infer male-female differences in the shape of the glottis. The female glottis appears to converge more linearly (from bottom to top) than the male glottis, primarily because of medial surface bulging of the male vocal folds.
A finite-element model of the vocal fold is developed from basic laws of continuum mechanics to obtain the oscillatory characteristics of the vocal folds. The model is capable of accommodating inhomogeneous, anisotropic material properties and irregular geometry of the boundaries. It has provisions for asymmetry across the midplane, both from the geometric and tension point of view, which enables one to simulate certain kinds of voice disorders due to vocal-fold paralysis. It employs the measured viscoelastic properties of the vocal-fold tissues. The detailed construction of the matrix differential equations of motion is presented followed by the solution scheme. Finally, typical results are presented and validated using an eigenvalue method and a commercial finite-element package (ABAQUS).
Proteoglycans, particularly hyaluronic acid, play important roles in determining biomechanical properties of tissue oscillation. Future research will likely make these proteins of important therapeutic interest.
There have been considerable research efforts in the area of vocal tract modeling but there is still a small body of information regarding direct 3-D measurements of the vocal tract shape. The purpose of this study was to acquire, using magnetic resonance imaging (MRI), an inventory of speaker-specific, three-dimensional, vocal tract air space shapes that correspond to a particular set of vowels and consonants. A set of 18 shapes was obtained for one male subject who vocalized while being scanned for 12 vowels, 3 nasals, and 3 plosives. The 3-D shapes were analyzed to find the cross-sectional areas evaluated within planes always chosen to be perpendicular to the centerline extending from the glottis to the mouth to produce an "area function." This paper provides a speaker-specific catalogue of area functions for 18 vocal tract shapes. Comparisons of formant locations extracted from the natural (recorded) speech of the imaged subject and from simulations using the newly acquired area functions show reasonable similarity but suggest that the imaged vocal tract shapes may be somewhat centralized. Additionally, comparisons of the area functions reported in this study are compared with those from four previous studies and demonstrate general similarities in shape but also obvious differences that can be attributed to differences in imaging techniques, image processing methods, and anatomical differences of the imaged subjects.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.