2021 IEEE International Conference on Robotics and Automation (ICRA) 2021
DOI: 10.1109/icra48506.2021.9561754
|View full text |Cite
|
Sign up to set email alerts
|

Probabilistic 3D Multi-Modal, Multi-Object Tracking for Autonomous Driving

Abstract: Multi-object tracking is an important ability for an autonomous vehicle to safely navigate a traffic scene. Current state-of-the-art follows the tracking-by-detection paradigm where existing tracks are associated with detected objects through some distance metric. The key challenges to increase tracking accuracy lie in data association and track life cycle management. We propose a probabilistic, multi-modal, multiobject tracking system consisting of different trainable modules to provide robust and data-driven… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
64
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 125 publications
(64 citation statements)
references
References 27 publications
0
64
0
Order By: Relevance
“…Furthermore, the tracking method proposed in [7] uses a heuristic to create tracks and a greedy matching algorithm based on the Euclidean distance to associate CenterPoint object detections to tracks. Chiu et al [14] follows a similar strategy but makes use of a hybrid distance that combines the Mahalanobis distance and the so-called deep feature distance. Finally, the method introduced by Zaech et al [15] utilizes a network flow formulation and transforms the DA problem into a classification problem.…”
Section: B Performance Evaluationmentioning
confidence: 99%
“…Furthermore, the tracking method proposed in [7] uses a heuristic to create tracks and a greedy matching algorithm based on the Euclidean distance to associate CenterPoint object detections to tracks. Chiu et al [14] follows a similar strategy but makes use of a hybrid distance that combines the Mahalanobis distance and the so-called deep feature distance. Finally, the method introduced by Zaech et al [15] utilizes a network flow formulation and transforms the DA problem into a classification problem.…”
Section: B Performance Evaluationmentioning
confidence: 99%
“…For example, CenterPoint [1] achieved this by estimating the velocity of each object. Other methods [16], [2] exploited appearance information from camera to enhance the association quality. SimTrack [17] avoided heuristic matching step by joint detection and tracking in an end-to-end manner.…”
Section: B 3d Multi-object Trackingmentioning
confidence: 99%
“…In addition to bounding box information, multiple works include 2D/3D appearance features to substantiate pair-wise affinity representation [23]. Deep learning allows to learn semantic features via encoding: Popular methods [12], [22] use image classification networks as encoders for representing image data and PointNet [24] architectures for learning point cloud features.…”
Section: Related Workmentioning
confidence: 99%
“…While the detections are often derived only from a single sensor modality such as LiDAR or camera, the entirety of modalities can still be utilized for improved similarity finding of detections in the tracking task. As discussed in multiple prior works [22], [23], [26] sensor fusion is still an ongoing research matter. Our approach fuses 3D pose & motion features (3D-PM) from bounding boxes with 2D as well as 3D appearance features from (surround) cameras (2D-A), LiDAR (3D-AL) as well as radar sensors (3D-AR).…”
Section: A Feature Representationmentioning
confidence: 99%
See 1 more Smart Citation