2018 IEEE International Conference on Robotics and Automation (ICRA) 2018
DOI: 10.1109/icra.2018.8460504
|View full text |Cite
|
Sign up to set email alerts
|

Social Attention: Modeling Attention in Human Crowds

Abstract: Robots that navigate through human crowds need to be able to plan safe, efficient, and human predictable trajectories. This is a particularly challenging problem as it requires the robot to predict future human trajectories within a crowd where everyone implicitly cooperates with each other to avoid collisions. Previous approaches to human trajectory prediction have modeled the interactions between humans as a function of proximity. However, that is not necessarily true as some people in our immediate vicinity… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
498
1

Year Published

2018
2018
2021
2021

Publication Types

Select...
5
2
2

Relationship

0
9

Authors

Journals

citations
Cited by 605 publications
(499 citation statements)
references
References 35 publications
0
498
1
Order By: Relevance
“…Following [11], [13], [14], we compare these methods in terms of the Average Displacement Error (ADE) and Final Displacement Error (FDE). The ADE is defined as the mean Euclidean distance between predicted coordinates and the ground truth.…”
Section: Methodsmentioning
confidence: 99%
“…Following [11], [13], [14], we compare these methods in terms of the Average Displacement Error (ADE) and Final Displacement Error (FDE). The ADE is defined as the mean Euclidean distance between predicted coordinates and the ground truth.…”
Section: Methodsmentioning
confidence: 99%
“…Many approaches jointly model the past motion of multiple agents in the scene to capture interaction between agents [5], [15], [12], [10], [7], [11]. This is typically done by pooling the RNN states of individual agents in a social tensor [5], [12], [11], using graph neural networks [16] or by modeling pairwise distances between agents along with max pooling [8], [10], [7].…”
Section: Related Studiesmentioning
confidence: 99%
“…The model captures different styles of navigation but does not make any differences between structured and unstructured environments. [21] handles prediction using a spatio-temporal graph which models both position evolution and interaction between pedestrians. [9] embodies vislet information within the social-pooling mechanism also relying on mutual faces orientation to augment space perception.…”
Section: Related Workmentioning
confidence: 99%