Multi-Step Prediction of Occupancy Grid Maps With Recurrent Neural Networks

Mohajerin, Nima; Rohani, Mohsen

doi:10.1109/cvpr.2019.01085

Cited by 67 publications

(46 citation statements)

References 27 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…Для детекции динамических объектов в карте занятости, как правило, используются изображения c камер и нейросетевые подходы [5,6]. Другое семейство алгоритмов для обнаружения динамики также использует сверточные сети, но в качестве входа подаются классические карты занятости [7,8]. В то же время конкурсы по обнаружению и трекингу трехмерных движущихся объектов [9,10] показывают, что подавляющее большинство наиболее точных методов используют данные лидара и выдают информацию в виде вектора с координатами, ориентацией и размерами объекта [11,12].…”

Section: Algorithm For Complexing Multiple Data Sources Into a Single Occupancy Mapunclassified

Algorithm for Complexing Multiple Data Sources Into a Single Occupancy Map

Shepel¹

2021

IZVESTIYA SFedU. ENGINEERING SCIENCES

View full text Add to dashboard Cite

show abstract

Section: Algorithm For Complexing Multiple Data Sources Into a Single Occupancy Mapunclassified

Algorithm for Complexing Multiple Data Sources Into a Single Occupancy Map

Shepel¹

2021

IZVESTIYA SFedU. ENGINEERING SCIENCES

View full text Add to dashboard Cite

show abstract

“…A traditional approach having considered both static and dynamic interactions is a named "Social Force" model (Helbing and Molnar 1995), defining interactions as forces upon the pedestrians. Modern socially-aware methods usually use recurrent neural networks (Mikolov et al 2010) for trajectory prediction (Fernando et al 2018a;Kitani et al 2011;Ryoo et al 2014;Srivastava, Mansimov, and Salakhudinov 2015;Vemula, Muelling, and Oh 2018;Liang et al 2019;Chung et al 2014;Mohajerin and Rohani 2019;Liu et al 2016;Shi et al 2019) and introduce attention mechanism to interaction measure (Bhattacharyya, Fritz, and Schiele 2018;Choi and Dariush 2019) and social behavior understanding (Sadeghian et al 2019;Haddad et al 2019;Al-Molegi, Jabreel, and Martínez-Ballesté 2018;Varshneya and Srinivasaraghavan 2017). Also, Generative Adversarial Network (GAN) model is designed to generate multiple reasonable trajectories (Gupta et al 2018;Fernando et al 2018b;Amirian, Hayet, and Pettré 2019).…”

Section: Research On Trajectory Prediction Task Considering Interactionsmentioning

confidence: 99%

CF-LSTM: Cascaded Feature-Based Long Short-Term Networks for Predicting Pedestrian Trajectory

Yang

2020

AAAI

View full text Add to dashboard Cite

Pedestrian trajectory prediction is an important but difficult task in self-driving or autonomous mobile robot field because there are complex unpredictable human-human interactions in crowded scenarios. There have been a large number of studies that attempt to understand humans' social behavior. However, most of these studies extract location features from previous one time step while neglecting the vital velocity features. In order to address this issue, we propose a novel feature-cascaded framework for long short-term network (CF-LSTM) without extra artificial settings or social rules. In this framework, feature information from previous two time steps are firstly extracted and then integrated as a cascaded feature to LSTM, which is able to capture the previous location information and dynamic velocity information, simultaneously. In addition, this scene-agnostic cascaded feature is the external manifestation of complex human-human interactions, which can also effectively capture dynamic interaction information in different scenes without any other pedestrians' information. Experiments on public benchmark datasets indicate that our model achieves better performance than the state-of-the-art methods and this feature-cascaded framework has the ability to implicitly learn human-human interactions.

show abstract

“…Secondly, there are many factors that are relevant to the trajectory of an agent, such as the nature of the ground, the mental state of the driver and the pedestrian, etc. To address these challenges, many trajectory prediction methods have been proposed which can be roughly classified into two categories, i.e., the coordinate-based methods [1,2,3,4] and the vision-based methods [5,6,7,8,9].…”

Section: Introductionmentioning

confidence: 99%

“…However, the interaction features obtained by these coordinate-based methods do not have any physical meaning that can be visualized for interpretation. On the other hand, the vision-based methods are able to use semantic images, including camera images, LiDAR point cloud data, and Occupancy Grid Maps (OGMs) [5], for global interaction feature extraction. Work [6] generate the prediction sequences in the form of images by Convolutional Neural Networks (CNNs).…”

Section: Introductionmentioning

confidence: 99%

Visionnet: A Coarse-To-Fine Motion Prediction Algorithm Based On Active Interaction-Aware Drivable Space Learning

Ren¹,

Zhu²,

Fan

et al. 2021

2021 IEEE International Conference on Multimedia and Expo (ICME)

View full text Add to dashboard Cite

Trajectory prediction is a fundamental task in many real applications such as autonomous robotics and video surveillance. In this paper, we propose a novel vision-based trajectory prediction method which is able to extract the interactive features by active global interaction-aware drivable space learning. The learned global interaction-aware drivable spaces denote the areas with low occupation probabilities, which provide the regions and directions that the agents can move into. Specifically, our method describes a sequence of motion states, i.e. the location, the velocity and the acceleration, as occupancy grid maps, and then use them to train the deep learning model in the supervised manner. Moreover, an interactive loss for training the inference net of drivable spaces and trajectory prediction net simultaneously is introduced. Comparisons with state-of-the-art methods on benchmark datasets demonstrate the effectiveness of the proposed method.

show abstract

Multi-Step Prediction of Occupancy Grid Maps With Recurrent Neural Networks

Cited by 67 publications

References 27 publications

Algorithm for Complexing Multiple Data Sources Into a Single Occupancy Map

Algorithm for Complexing Multiple Data Sources Into a Single Occupancy Map

CF-LSTM: Cascaded Feature-Based Long Short-Term Networks for Predicting Pedestrian Trajectory

Visionnet: A Coarse-To-Fine Motion Prediction Algorithm Based On Active Interaction-Aware Drivable Space Learning

Contact Info

Product

Resources

About