2021
DOI: 10.48550/arxiv.2112.07957
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

FEAR: Fast, Efficient, Accurate and Robust Visual Tracker

Abstract: We present FEAR, a novel, fast, efficient, accurate, and robust Siamese visual tracker. We introduce an architecture block for object model adaption, called dual-template representation, and a pixel-wise fusion block to achieve extra flexibility and efficiency of the model. The dualtemplate module incorporates temporal information with only a single learnable parameter, while the pixel-wise fusion block encodes more discriminative features with fewer parameters compared to standard correlation modules. By plug… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 51 publications
(141 reference statements)
0
2
0
Order By: Relevance
“…As shown in Table 8, there are gaps between transformer based models and our ResNet-based model (CLNet*-BAN), since former ones have stronger backbone. Besides, we also show the results of several efficient trackers: E.T.Track [91], LightTrack [92], and FEAR-L [93], which have real-time speeds on the edge-platforms. In future work, we will try to incorporate our CLNet into these recent models with more efficient backbones, to explore a more generalized and practical solution.…”
Section: More Discussionmentioning
confidence: 99%
“…As shown in Table 8, there are gaps between transformer based models and our ResNet-based model (CLNet*-BAN), since former ones have stronger backbone. Besides, we also show the results of several efficient trackers: E.T.Track [91], LightTrack [92], and FEAR-L [93], which have real-time speeds on the edge-platforms. In future work, we will try to incorporate our CLNet into these recent models with more efficient backbones, to explore a more generalized and practical solution.…”
Section: More Discussionmentioning
confidence: 99%
“…In [36], they suggested adaptively employing the level set segmentation and bounding box regression techniques to achieve a tight enclosing box, and designing a CNN to determine if the target is occluded. Recent attempts [37][38][39][40][41] have been made to realize higher performance of object tracking by using new schemes.…”
Section: Deep Visual Trackingmentioning
confidence: 99%