Three subjects were given extensive practice in discriminating syllables which differed in voice onset time. For these subjects, there were two major findings. First, discrimination of speech follows normal psychophysical laws: long-onset-time stimuli require larger differences than shorter ones for comparable discrimination. Second, the shape of the discrimination function for experienced subjects is more like a leaning W than an inverted V, the usual shape for naive subjects. The data support a model of speech perception with both an acoustic and a phonetic component. The phonetic component is best characterized as a prototype matching process, with the prototype including information on the simultaneity of formant onset.For the last 20 years, the laws governing speech perception have been thought to differ from the laws governing psychophysical perception. In most psychophysical experiments, subjects can discriminate many more stimuli than they can identify. In most speech perception studies, discrimination seems to be bounded by identification; subjects can only discriminate two speech items if they can give them different phonetic labels. Speech perception appears to be categorical. Largely on the basis of this finding, researchers at the Haskins Laboratories spectrographic analysis, these studies established that two acoustic features are most important in the transition from voiced to voiceless stops. In voiced stops, the first formant (Fl) begins at the same time as the higher formants. Removing the initial portion of the first formant, and thereby delaying its onset, leads to the perception of voicelessness. A more realistic continuum is obtained if the higher formants are aspirated (energized by a noise source) during the period of Fl cutback.The Haskins researchers have generally tested discrimination with the ABX paradigm. In this paradigm, subjects hear three syllables per trial. The first two (A and B) always differ from each other, while the third (X) is identical to one of the first two. The subject's task is to determine if X is the same as A orB.Liberman et al. (1961) used this paradigm in their study of voicing. The authors synthesized a continuum of speech syllables which varied in voice onset time (VaT) by varying the Fl cutback and aspiration cues in lO-msec steps. The other parameters were appropriate for an alveolar consonant followed by the vowel 10/, yielding a continuum perceived as Idol at one end (O-msec VaT) and as Itol at the other (60-msec VaT). The data generally indicate better discrimination between phonetic categories than within them, an example of categorical perception.Early Haskins papers (e.g., Liberman et aI., 1967; Liberman et aI., Note 1) cited this finding as evidence for a motor theory of speech perception. In more recent work (e.g., , no specific mechanism has been offered which would produce the categorical results, but the general position of a special speech mode has been maintained. As Liberman (1970) puts it, "The [speech] decoder is not merely an extension ...