2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2020
DOI: 10.1109/cvpr42600.2020.01136
|View full text |Cite
|
Sign up to set email alerts
|

STINet: Spatio-Temporal-Interactive Network for Pedestrian Detection and Trajectory Prediction

Abstract: Detecting pedestrians and predicting future trajectories for them are critical tasks for numerous applications, such as autonomous driving. Previous methods either treat the detection and prediction as separate tasks or simply add a trajectory regression head on top of a detector. In this work, we present a novel end-to-end two-stage network: Spatio-Temporal-Interactive Network (STINet). In addition to 3D geometry modeling of pedestrians, we model the temporal information for each of the pedestrians. To do so,… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
32
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
5
3
1
1

Relationship

0
10

Authors

Journals

citations
Cited by 64 publications
(32 citation statements)
references
References 23 publications
(44 reference statements)
0
32
0
Order By: Relevance
“…A new trend in representation of the acting agents for neural network models, were graph models [102][103][104][105][106][107][108][109]. Describing the connections between objects in a graph structure in opposition to an occupancy grid, is a huge advantage if there are only sparse connections between the objects.…”
Section: Gnns Attention and New Use Casesmentioning
confidence: 99%
“…A new trend in representation of the acting agents for neural network models, were graph models [102][103][104][105][106][107][108][109]. Describing the connections between objects in a graph structure in opposition to an occupancy grid, is a huge advantage if there are only sparse connections between the objects.…”
Section: Gnns Attention and New Use Casesmentioning
confidence: 99%
“…Future predictions are highly uncertain because of the unknown intents and behaviors of the agents [14,33,17,21,28,38]. In the field of autonomous driving, to model the high degree of multimodality, implicitly using latent variables is a popular approach [15,35,27,29].…”
Section: Related Workmentioning
confidence: 99%
“…The simple output of cutting-edge Crowd Human identification algorithms is provided. While significant progress has been achieved in pedestrian recognition [28], [29], identification in congested settings remains difficult. The conventional Non-Maximum Suppression (NMS) [30] has significant problems due to the severe occlusion of pedestrians.…”
Section: Pedestrian Detectionmentioning
confidence: 99%