2023
DOI: 10.48550/arxiv.2302.11867
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Transformers in Single Object Tracking: An Experimental Survey

Abstract: Single object tracking is a well-known and challenging research topic in computer vision. Over the last two decades, numerous researchers have proposed various algorithms to solve this problem and achieved promising results. Recently, Transformer-based tracking approaches have ushered in a new era in single object tracking due to their superior tracking robustness. Although several survey studies have been conducted to analyze the performance of trackers, there is a need for another survey study after the intr… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
6
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(6 citation statements)
references
References 93 publications
0
6
0
Order By: Relevance
“…However, the large number of matrix multiplication computations required by the attention mechanism used dramatically slows down the inference while improving the performance; as in, for example, the recent SeqTrack [14] and MixFormer [15], which, although they achieved 72.5% and 70.0% success rates with the LaSOT dataset, respectively, utilized 535.85 G and 113.02 G model flops and 308.98 M and 195.40 M parameters for the best results. At the same time, their speeds were only 5.81 and 8.02 FPS [16].…”
Section: Introductionmentioning
confidence: 92%
See 1 more Smart Citation
“…However, the large number of matrix multiplication computations required by the attention mechanism used dramatically slows down the inference while improving the performance; as in, for example, the recent SeqTrack [14] and MixFormer [15], which, although they achieved 72.5% and 70.0% success rates with the LaSOT dataset, respectively, utilized 535.85 G and 113.02 G model flops and 308.98 M and 195.40 M parameters for the best results. At the same time, their speeds were only 5.81 and 8.02 FPS [16].…”
Section: Introductionmentioning
confidence: 92%
“…Recently, Yan [33] introduced Transformer into target tracking and achieved good performance. Transformer-based tracking can be divided into two categories [16]. One is CNN Transformer-based trackers [33][34][35][36].…”
Section: Transformer-based Trackermentioning
confidence: 99%
“…And fully-transformer based trackers like SwinTrack [27], MixFormer [19], Pro-ContEXT [28] and VideoTrack [29]. The state-ofthe-art tracker tends to utilize a fully-transformer architecture as it achieves the best performance [30], while CNN transformer architecture offers a balanced performance between speed and accuracy [30].…”
Section: Transformer-based Trackersmentioning
confidence: 99%
“…According to the type of the framework, the SOT algorithms using Transformer can be classified as CNN-Transformer based trackers and fully-Transformer based trackers (Thangavel et al 2023).…”
Section: Transformer-based Trackersmentioning
confidence: 99%