2022
DOI: 10.1007/978-3-031-20047-2_1
|View full text |Cite
|
Sign up to set email alerts
|

ByteTrack: Multi-object Tracking by Associating Every Detection Box

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

2
274
1

Year Published

2022
2022
2024
2024

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 732 publications
(427 citation statements)
references
References 62 publications
2
274
1
Order By: Relevance
“…We also evaluated a few recent state-of-the-art models on the PersonPath22 dataset, including 1), zero-shot IdFree [52] model in which the embedding component is trained without any person identity annotation, 2), TrackFormer [41] whose underlying detector is transformer based Detr [10], as well as 3), Byte-Track [67] that is based on the state-of-the-art singe-tage YOLOX detector [28]. As can be clearly seen, ByteTrack achieves the best MOTA and IDF1, and the zero-shot IDFree model outperforms most recent state-of-the-art tracking models even though it is not trained on the target PersonPath22 dataset.…”
Section: Model Evaluationmentioning
confidence: 99%
See 1 more Smart Citation
“…We also evaluated a few recent state-of-the-art models on the PersonPath22 dataset, including 1), zero-shot IdFree [52] model in which the embedding component is trained without any person identity annotation, 2), TrackFormer [41] whose underlying detector is transformer based Detr [10], as well as 3), Byte-Track [67] that is based on the state-of-the-art singe-tage YOLOX detector [28]. As can be clearly seen, ByteTrack achieves the best MOTA and IDF1, and the zero-shot IDFree model outperforms most recent state-of-the-art tracking models even though it is not trained on the target PersonPath22 dataset.…”
Section: Model Evaluationmentioning
confidence: 99%
“…ByteTrack [67]. For ByteTrack, the detector is YOLOX [28] with YOLOX-X as the backbone and COCO-pretrained model as the initialized weights.…”
Section: D2 Implementation Detailsmentioning
confidence: 99%
“…A sample image and its corresponding annotation are shown in Figure 2A. The dataset consists of eight parts, with Four state-of-the-art MOT methods are tested on the proposed dataset, which are ByteTrack (Zhang et al, 2021a), ByteTrack with NSA Kalman filter (Du et al, 2021), FairMOT (Zhang et al, 2021b), and SORT (Bewley et al, 2016) 2 . They are finetuned on our dataset using their default hyperparameters.…”
Section: Image Annotation and Dataset Constructionmentioning
confidence: 99%
“…All raw videos were recorded in outdoor scenarios. For each video, we first performed ByteTrack [48] to generate human bounding boxes with unified IDs, and used them to crop out RGB sequences of each identity, which is then split into several short sequences (about 200 frames). After that, we merged the sequences of the same identity from different videos and manually labeled the clothes IDs.…”
Section: The Rccvreid Datasetmentioning
confidence: 99%