This paper discusses a unified decision theoretic framework f o r automating the establishment of feature point correspondences in a temporally dense sequence of images. The proposed approach, extends a recent sequential detection algorithm [l], t o guide the detection and tracking of object feature points through an image sequence. The resultant extended feature tracks provide robust feature correspondences, for the estimation of three-dimensional structure and motion, over an extended number of image frames.
In real-world decision and control applications requiring motion information as sensory input, the detection and estimation of motion from image sequences is constrained by the available computational resources (time,memory,cpu). In this context, a flexible image processing algorithm which can balance computational resources against desired accuracy in motion estimation has been developed. This algorithm extends the Multistage Hypothesis Testing algorithm [2] to detect and track moving objects in a computationally constrained environment. Unlike standard feature-based or gradient-based techniques for motion analysis, the extended MHT algorithm (1) integrates its motion estimates across multiple image frames, and (2) directs computational resources to the moving regions of interest.The MHT algorithm can be used to detect and track a variety of point and/or linear object features. The initial detection of object features is posed as a rate-constrained detection problem [l]. This ensures that the number of detected features does not overload the computational resources of the motion estimator. The MHT algorithm exploits the timeoptimality of sequential hypothesis testing (Truncated Sequential Probability Ratio Tests) to rapidly search the large space of possible feature trajectories. The resultant detected trajectories are clustered by the hypothesis-tree data structure of the MHT algorithm. This allows for robust motion estimation incorporating information from previous image frames. Multiple trajectory segments are then extended in time, yielding a set of feature trajectories in the image plane. Three-dimensional motion information is then extracted from these 2dimensional feature trajectories. The resulting system provides flexible and efficient motion detection and feature correspondence, within computational constraints, suitable for input to the motion estimation module of an intelligent autonomous system.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.