The early recognition and understanding of the actions performed by pedestrians in traffic scenes leads to an anticipation of pedestrian intentions in advance and helps in the process of collision warning and avoidance in the context of autonomous vehicles. An environment with low visibility conditions such as night-time, fog, heavy rain or smoke increases the number of difficult situations in traffic. A complete and original model for assessing if a pedestrian is engaged in a street cross action using only infrared monocular scene perception is proposed in this paper. The assessment of a street cross action is done by the time series analysis of features like: pedestrian motion, position of pedestrians with respect to the drivable area and their distance with respect to the ego-vehicle. The extraction of these features emerges from the combination of a deep learning based pedestrian detector with an original tracking algorithm, a semantic segmentation of the road surface and a time series long-short term memory network based action recognition. In order to validate the proposed method we introduce a new dataset named CROSSIR. It is formed of pedestrian annotations, action annotations and semantic labels for the road. The CROSSIR dataset is suitable for several common computer vision algorithms: (1) pedestrian detection and tracking algorithms because each pedestrian has a unique identifier over the frames in which it appears; (2) pedestrian action recognition; (3) semantic segmentation of the road pixels in the infrared image.
This paper presents a method for the detection and localization of stop-lines and other horizontal road markings as bicycle or pedestrian crossings encountered in road environments. The few existing stop-line detection approaches found in literature are based on monocular vision, and therefore cannot infer high accuracy information from the visual data. The proposed solution uses a hybrid approach that combines 2D detections in grayscale images with stereo-vision based 3D validations in order to increase the robustness and accuracy of the results. Model based reasoning is also used to eliminate false positive situations and to detect exactly the objects of interest (stop-lines, pedestrian or bicycle crossings). The detected horizontal road markings can be used as landmarks for a very accurate longitudinal localization of the ego vehicle within intersection scenarios compensating missing GPS data or its lack of accuracy.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.