Using the voices of six subjects, representing various social and educational backgrounds, fifty-four synthetic voices were generated by computer. Each normal voice was both increased and decreased in rate by 121/2, 25, 371/2, and 50 per cent. Judges evaluated the fifty-four voices using a series of adjectives representing two major personality factors of " competence " and " benevolence ". Several statistical analyses were performed, and it was found that the competence factor was much more sensitive to rate manipulations than was the benevolence factor. Ratings of competence were found to increase as rate increases and decrease as rate decreases, in a linear fashion. Benevolence had an inverted U-relationship with speech rate; the highest benevolence ratings occurred with normal speech rate.
A speech analysis-synthesis system was used to manipulate variance of fundamental frequency and a mechanical rate changer was used to manipulate speech rate. The synthesized and altered voices were tested for realism. Synthesized voices were mistaken for normal 50% to 58% of the time and rate-changed voices were mistaken for normal 78% of the time. Additional studies were conducted to test the effects of these acoustical manipulations on the adjective ratings judges made of speakers. Variance of intonation was increased and decreased by 50% for eight speakers. There was a significant trend for increased intonation to cause voices to be rated more "benevolent" by judges and decreased intonation to cause them to be rated less "benevolent." In two additional studies, rate was decreased and increased by varying amounts. Slowing the voices caused them to be rated less "competent." Speeding the voices caused them to be rated less "benevolent." Results were more consistent over speakers for rate manipulations than for intonation manipulations.
The human vocal cord is treated as an elastic material capable of propagating compressional, shear, and surface waves. A mathematical formulation for commonly observed vibrational modes is developed. This includes horizontal as well as vertical tissue displacements. Current vocal cord models are discussed and evaluated in terms of their ability to describe adequately the vibrational degrees of freedom of the cords.Subject Classification: 70.20; 20.40.
Utterances of two adults males were analyzed and synthesized by a fast Fourier Transforms method. Each of the two voices was synthesized in each of the twenty-seven combinations of three levels each of rate, mean FO, and variance of FO (a total of fifty-four "voices" generated from two). The effects of the rate, mean FO, and variance of FO manipulations, the interactive effects of rate and variance of FO, and the effects due to speaker were all statistically significant predictors of personality ratings given the voices. They accounted for 86%, 4%, 3%, 2%, and 1% of the variance, respectively, in competence ratings and 48%, 1%, 6%, 1%, and 8% of the variance, respectively, in benevolence ratings. Increased speaking rate was found to decrease the benevolence ratings, and decreased rate was found to decrease competence ratings. Decreased variance of FO was found to decrease the ratings on both competence and benevolence. Increased mean FO in these mhle voices was also found to decrease competence and benevolence ratings.Subject Classification: 70.30.
The purpose of the present study was to obtain detailed photographic sequences and lip motion data on which lip models for brass instruments may be more accurately based. The study expands upon an earlier study by Martin [J. Acoust. Soc. Am. 13, 305–307 (1942)] by using advanced fiber-optic stroboscopy, a real instrument mouthpiece, and by studying two dynamic levels. The trombone was selected as representative of the brass family because its relatively large mouthpiece permitted the use of an optic probe. Lip motion was observed from the front and side for six notes (Bflat2, F3, Bflat3, D4, F4, Aflat4) played at loud and soft dynamic levels. The video sequences were used to obtain information on lip opening area, lip motion perpendicular to airflow, and lip motion parallel to airflow. The data are tabulated and represented in graphic form.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.