2022
DOI: 10.48550/arxiv.2201.08673
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Exploring Fusion Strategies for Accurate RGBT Visual Object Tracking

Abstract: We address the problem of multi-modal object tracking in video and explore various options of fusing the complementary information conveyed by the visible (RGB) and thermal infrared (TIR) modalities including pixel-level, feature-level and decision-level fusion. Specifically, different from the existing methods, paradigm of image fusion task is heeded for fusion at pixel level. Featurelevel fusion is fulfilled by attention mechanism with channels excited optionally. Besides, at decision level, a novel fusion s… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(3 citation statements)
references
References 62 publications
(117 reference statements)
0
3
0
Order By: Relevance
“…In the VOT2020-RGBT challenge, the core metric to evaluate the tracking performance is EAO. The four trackers used in the comparative experiment are: AFAT [44] with single modality (RGB and infrared) as input, combined RFN and AFAT (RFNT) [15], M2C2Frgbt [40], and the decisionlevel fusion tracker (DFAT) [43].…”
Section: The Tracking Results On Vot2020-rgbtmentioning
confidence: 99%
See 1 more Smart Citation
“…In the VOT2020-RGBT challenge, the core metric to evaluate the tracking performance is EAO. The four trackers used in the comparative experiment are: AFAT [44] with single modality (RGB and infrared) as input, combined RFN and AFAT (RFNT) [15], M2C2Frgbt [40], and the decisionlevel fusion tracker (DFAT) [43].…”
Section: The Tracking Results On Vot2020-rgbtmentioning
confidence: 99%
“…In order to apply our fusion framework to the multimodal object tracking task, a state-of-the-art RGBT tracker (DFAT) [43] is adopted as the baseline tracker. The DFAT won the third place in the evaluation on the public dataset and was the winning tracker in the VOT2020-RGBT challenge.…”
Section: Experiments On Rgbt Object Tracking Taskmentioning
confidence: 99%
“…To employ the temporal continuity in a video sequence, the history information was integrated to obtain fusion features by computing the adaptive weights of previous frames [23]. Tang et al [24] proposed multiple fusion strategies from different perspectives (including pixel-level, feature-level and decision-level) to boost the performance of multi-modal object tracking in video.…”
Section: Feature Aggregation Methods For Rgb-t Object Trackingmentioning
confidence: 99%