It is well known that the detection thresholds for stationary auditory and visual signals are lower if the signals are presented bimodally rather than unimodally, provided the signals coincide in time and space. Recent work on auditory-visual motion detection suggests that the facilitation seen for stationary signals is not seen for motion signals. We investigate the conditions under which motion perception also benefits from the integration of auditory and visual signals. We show that the integration of cross-modal local motion signals that are matched in position and speed is consistent with thresholds predicted by a neural summation model. If the signals are presented in different hemi-fields, move in different directions, or both, then behavioural thresholds are predicted by a probability-summation model. We conclude that cross-modal signals have to be co-localised and co-incident for effective motion integration. We also argue that facilitation is only seen if the signals contain all localisation cues that would be produced by physical objects.
Abstract-Based on an information theoretical approach, we investigate feature selection processes in saccadic object and scene analysis. Saccadic eye movements of human observers are recorded for a variety of natural and arti cial test images. These experimental data are used for a statistical evaluation of the xated image regions. Analysis of second-order statistics indicates that regions with higher spatial variance have a higher probability to be xated, but no signi cant differences beyond these variance effects could be found at the level of power spectra. By contrast, an investigation with higher-order statistics, as re ected in the bispectral density, yielded clear structural differences between the image regions selected by saccadic eye movements as opposed to regions selected by a random process. These results indicate that nonredundant, intrinsically two-dimensional image features like curved lines and edges, occlusions, isolated spots, etc. play an important role in the saccadic selection process which must be integrated with top-down knowledge to fully predict object and scene analysis by human observers.
Local intrinsic dimensionality is shown to be an elementary structural property of multidimensional signals that cannot be evaluated using linear filters. We derive a class of polynomial operators for the detection of intrinsically 2-D image features like curved edges and lines, junctions, line ends, etc. Although it is a deterministic concept, intrinsic dimensionality is closely related to signal redundancy since it measures how many of the degrees of freedom provided by a signal domain are in fact used by an actual signal. Furthermore, there is an intimate connection to multidimensional surface geometry and to the concept of ;Gaussian curvature'. Nonlinear operators are inevitably required for the processing of intrinsic dimensionality since linear operators are, by the superposition principle, restricted to OR-combinations of their intrinsically 1-D eigenfunctions. The essential new feature provided by polynomial operators is their potential to act on multiplicative relations between frequency components. Therefore, such operators can provide the AND-combination of complex exponentials, which is required for the exploitation of intrinsic dimensionality. Using frequency design methods, we obtain a generalized class of quadratic Volterra operators that are selective to intrinsically 2-D signals. These operators can be adapted to the requirements of the signal processing task. For example, one can control the "curvature tuning" by adjusting the width of the stopband for intrinsically 1-D signals, or the operators can be provided in isotropic and in orientation-selective versions. We first derive the quadratic Volterra kernel involved in the computation of Gaussian curvature and then present examples of operators with other arrangements of stop and passbands. Some of the resulting operators show a close relationship to the end-stopped and dot-responsive neurons of the mammalian visual cortex.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.