In a reverberant environment, sounds reach the ears through several paths. Although the direct sound is followed by multiple reflections, which would be audible in isolation, the first-arriving wavefront dominates many aspects of perception. The "precedence effect" refers to a group of phenomena that are thought to be involved in resolving competition for perception and localization between a direct sound and a reflection. This article is divided into five major sections. First, it begins with a review of recent work on psychoacoustics, which divides the phenomena into measurements of fusion, localization dominance, and discrimination suppression. Second, buildup of precedence and breakdown of precedence are discussed. Third measurements in several animal species, developmental changes in humans, and animal studies are described. Fourth, recent physiological measurements that might be helpful in providing a fuller understanding of precedence effects are reviewed. Fifth, a number of psychophysical models are described which illustrate fundamentally different approaches and have distinct advantages and disadvantages. The purpose of this review is to provide a framework within which to describe the effects of precedence and to help in the integration of data from both psychophysical and physiological experiments. It is probably only through the combined efforts of these fields that a full theory of precedence will evolve and useful models will be developed.
Informational masking ͑IM͒ has a long history and is currently receiving considerable attention. Nevertheless, there is no clear and generally accepted picture of how IM should be defined, and once defined, explained. In this letter, consideration is given to the problems of defining IM and specifying research that is needed to better understand and model IM.
Informational masking was reduced using three stimulus presentation schemes that were intended to perceptually segregate the signal from the masker. The maskers were sets of sinusoids chosen randomly in frequency and intensity on each stimulus interval or, in some conditions, on every masker burst in a series of bursts within intervals. Masker components were excluded from the frequency region surrounding the 1000-Hz signal to minimize the energetic masking. Masked thresholds as great as 60–70 dB above quiet threshold were observed for some subjects in some conditions. It was shown that this informational masking could be reduced as much as 40 dB by: (1) presenting the masker to both ears and signal to one ear; (2) playing different masker samples sequentially in each interval of every trial; or (3) presenting the signal in alternate bursts of multiple, identical masker samples. For the binaural manipulation, informational masking was reduced because the masker and signal were perceived as originating from different interaural locations. In the latter two manipulations, a difference in the spectral or temporal pattern of the signal and masker provided the detection cue. These effects were interpreted as evidence of the importance of perceptual segregation of sounds in noisy listening environments where signal reception is not limited by energetic masking.
A method for calculating psychophysical performance limits based on stochastic neural responses is introduced and compared to previous analytical methods for evaluating auditory discrimination of tone frequency and level. The method uses signal detection theory and a computational model for a population of auditory nerve (AN) fiber responses. The use of computational models allows predictions to be made over a wider parameter range and with more complete descriptions of AN responses than in analytical models. Performance based on AN discharge times (all-information) is compared to performance based only on discharge counts (rate-place). After the method is verified over the range of parameters for which previous analytical models are applicable, the parameter space is then extended. For example, a computational model of AN activity that extends to high frequencies is used to explore the common belief that rate-place information is responsible for frequency encoding at high frequencies due to the rolloff in AN phase locking above 2 kHz. This rolloff is thought to eliminate temporal information at high frequencies. Contrary to this belief, results of this analysis show that rate-place predictions for frequency discrimination are inconsistent with human performance in the dependence on frequency for high frequencies and that there is significant temporal information in the AN up to at least 10 kHz. In fact, the all-information predictions match the functional dependence of human performance on frequency, although optimal performance is much better than human performance. The use of computational AN models in this study provides new constraints on hypotheses of neural encoding of frequency in the auditory system; however, the method is limited to simple tasks with deterministic stimuli. A companion article in this issue ("Evaluating Auditory Performance Limits: II") describes an extension of this approach to more complex tasks that include random variation of one parameter, for example, random-level variation, which is often used in psychophysics to test neural encoding hypotheses.
A model for the subjective lateral position of 500-Hz tones is presented and compared with experimental lateralization data. Previous papers in this series have explicitly described the auditory-nerve response to these stimuli and proposed a binaural displayer that interaurally compares the auditory-nerve firing times. The outputs of the displayer are postulated to represent the only information about detailed firing times that is available to the brain. In the present paper, lateral-position predictions are obtained by a central nonoptimal weighting of these outputs that depends on the interaural intensity difference of the tone. These predictions describe the results of lateralization-matching experiments more accurately and over a wider range of stimulus conditions than previous theories, except for those results which suggest that low-frequency binaural tones can generate multiple perceptual images. The predictions of our model are also consistent with the results of centering and laterality-comparison experiments. It is argued that the data discussed in this paper are generally incompatible with theories that propose a peripheral interaction of interaural timing and intensity information such as the latency hypothesis.
Natural environments typically contain sound sources other than the source of interest that may interfere with the ability of listeners to extract information about the primary source. Studies of speech intelligibility and localization by normal-hearing listeners in the presence of competing speech are reported on in this work. One, two or three competing sentences [IEEE Trans. Audio Electroacoust. 17(3), 225-246 (1969)] were presented from various locations in the horizontal plane in several spatial configurations relative to a target sentence. Target and competing sentences were spoken by the same male talker and at the same level. All experiments were conducted both in an actual sound field and in a virtual sound field. In the virtual sound field, both binaural and monaural conditions were tested. In the speech intelligibility experiment, there were significant improvements in performance when the target and competing sentences were spatially separated. Performance was similar in the actual sound-field and virtual sound-field binaural listening conditions for speech intelligibility. Although most of these improvements are evident monaurally when using the better ear, binaural listening was necessary for large improvements in some situations. In the localization experiment, target source identification was measured in a seven-alternative absolute identification paradigm with the same competing sentence configurations as for the speech study. Performance in the localization experiment was significantly better in the actual sound-field than in the virtual sound-field binaural listening conditions. Under binaural conditions, localization performance was very good, even in the presence of three competing sentences. Under monaural conditions, performance was much worse. For the localization experiment, there was no significant effect of the number or configuration of the competing sentences tested. For these experiments, the performance in the speech intelligibility experiment was not limited by localization ability.
Previous work has indicated that target-masker similarity, as well as stimulus uncertainty, influences the amount of informational masking that occurs in detection, discrimination, and recognition tasks. In each of five experiments reported in this paper, the detection threshold for a tonal target in random multitone maskers presented simultaneously with the target tone was measured for two conditions using the same set of five listeners. In one condition, the target was constructed to be "similar" (S) to the masker; in the other condition, it was constructed to be "dissimilar" (D) to the masker. The specific masker varied across experiments, but was constant for the two conditions. Target-masker similarity varied in dimensions such as duration, perceived location, direction of frequency glide, and spectro-temporal coherence. Group-mean results show large decreases in the amount of masking for the D condition relative to the S condition. In addition, individual differences (a hallmark of informational masking) are found to be much greater in the S condition than in the D condition. Furthermore, listener vulnerability to informational masking is found to be consistent to at least a moderate degree across experiments.
Sensitivity to interaural time difference (ITD) in constant-amplitude pulse trains was measured in four sequentially implanted bilateral cochlear implant (CI) subjects. The sensitivity measurements were made as a function of time beginning directly after the second ear was implanted, continued for periods of months before subjects began wearing bilateral sound processors, and extended for months while the subjects used bilateral sound processors in day-to-day listening. Measurements were also made as a function of the relative position of the left/right electrodes. The two subjects with the shortest duration of binaural deprivation before implantation demonstrated ITD sensitivity soon after second-ear implantation (before receiving the second sound processor), while the other two did not demonstrate sensitivity until after months of daily experience using bilateral processors. The interaural mismatch in electrode position required to decrease ITD sensitivity by a factor of 2 (half-width) for CI subjects was five times greater than the half-width for interaural carrier-frequency disparity in normal-hearing subjects listening to sinusoidally amplitude-modulated high-frequency tones. This large half-width is likely to contribute to poor binaural performance in CI users, especially in environments with multiple broadband sound sources.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.