Two types of analyses have been performed on the measured durations of recordings produced by six talkers reading two scripts of approximately 300 words each. The texts, the combined visual–auditory marking technique, and preliminary results were reported earlier by Crystal and House [J. Acoust. Soc. Am. 72, 705–716 (1982)]. The average durations and standard deviations of various classes of speech sounds, as well as individual speech sounds, have been determined and segmental measurements are compared to earlier data and to various pertinent published reports. The histograms of the measured durations of various sounds and categories have been fitted with distributions which are, equivalently, the exit-probability sequence for a Markov chain or the impulse response of an IIR digital-filter network.
The data base, methods for a study of the durations of phonetic units in connected speech, and some preliminary results are described. From readings of two scripts by many talkers, two sets of seven talkers each were selected, based on total reading time, to form a fast group and a slow group of talkers. Using computer graphics and digital playback procedures, the recordings were segmented into breath groups and pauses, and the first four sentences in each script were segmented into phones. The hold and release (that is, plosion and/or frication) portions of stops were identified and measured; less than 50% of the stops included releases. To establish the usefulness of the data base, the first-order statistics of the phonetic segments were determined, and a variety of durational characteristics were compared to existing reports. Analysis of number of breath groups, phonation time, and pause characterized the difference between so-called average fast and average slow talkers; however, no script-independent measure of these variables was found which would accurately predict the classification of individual talkers. The mean durations of various phonetic categories showed essentially the same percentage change when the fast and slow talkers were compared. Preliminary analyses of contextual influences on durations showed some expected changes, and also indicated that certain traditional predictions may not hold for informal connected speech. Gamma functions were fitted to the distributions of durations of various gross categories.
Further analyses have been made on readings of two scripts by six talkers [T. H. Crystal and A. S. House, J. Acoust. Soc. Am. 72, 705-716 (1982); 84, 1932-1935 (1988); 83, 1553-1573 (1988)]. Durations of syllables and stress groups are compared to earlier data and various pertinent published reports, and are used to evaluate reports of articulation rate variability. The average durations of syllables of different complexity have a quasilinear dependency on the number of phones in the syllable, where the linear factor and the vowel durations are functions of stress. The duration of stress groups has a quasilinear dependency on the number of syllables and the number of phones. It was found that variability of articulation rate, measured as the average syllable duration for interpause intervals (runs), is not random, but is the natural consequence of the content of the run. Durations of comparable runs of different talkers are highly correlated. Major determinants of average syllable duration are the average number of phones per syllable and the proportion of either + stress phones or + stress syllables in the run.
Analyses into the effect of syllabic stress on the durations of speech sounds have been performed on the recordings produced by six talkers reading two scripts of approximately 300 words each. The texts, the combined visual-auditory marking technique, and preliminaryresults were reported earlier in Crystal and House [J. Acoust. Soc. Am. 72, 705-716 (1982) ] and further results were reported in Crystal and House [J. Acoust. Soc. Am. 83, 1553-1573 (1988a) ]. The average durations and standard deviations of various classes of speech sounds, as well as individual speech sounds, have been determined in syllables where the stress characteristic is known. Measurements are compared to earlier data and to various pertinent published reports.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.