2019
DOI: 10.1007/978-3-030-11009-3_7
|View full text |Cite
|
Sign up to set email alerts
|

Towards a Better Match in Siamese Network Based Visual Object Tracker

Abstract: Recently, Siamese network based trackers have received tremendous interest for their fast tracking speed and high performance. Despite the great success, this tracking framework still suffers from several limitations. First, it cannot properly handle large object rotation. Second, tracking gets easily distracted when the background contains salient objects. In this paper, we propose two simple yet effective mechanisms, namely angle estimation and spatial masking, to address these issues. The objective is to ex… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
38
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
5
2

Relationship

2
5

Authors

Journals

citations
Cited by 48 publications
(38 citation statements)
references
References 31 publications
0
38
0
Order By: Relevance
“…Every patch of the same size as the target gets a similarity score, and the one with the highest score is identified as the new target location. There are also a great number of follow-up work [16,54,51], among which SA-Siam [18,17] and SiamRPN [27,60] are most related to ours.…”
Section: Related Workmentioning
confidence: 74%
See 2 more Smart Citations
“…Every patch of the same size as the target gets a similarity score, and the one with the highest score is identified as the new target location. There are also a great number of follow-up work [16,54,51], among which SA-Siam [18,17] and SiamRPN [27,60] are most related to ours.…”
Section: Related Workmentioning
confidence: 74%
“…However, it is difficult to attend to both requirements in a single network. SA-Siam [18] and Siam-BM [17] adopt a two-branch network to encode images into two embedding spaces, one for semantic similarity (more robust) and the other for appearance similarity (more discriminative). This typical parallel structure does not take advantage of the innate proposal capability of Figure 2.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…In particular, our threebranch variant significantly outperforms the very recent and top performing DaSiamRPN [63], achieving a EAO of 0.380 while running at 55 frames per second. Even without box regression branch, our simpler two-branch variant (SiamMask-2B) achieves a high EAO of 0.334, which is in par with SA Siam R [15] and superior to any other real-time method in the published literature. Finally, in SiamMask-Opt, the strategy proposed in [54] to find the optimal rotated rectangle from a binary mask brings the best overall performance (and a particularly high accuracy), but comes at a significant computational cost.…”
Section: How Much Does the Object Representation Matter?mentioning
confidence: 90%
“…Our method is motivated by the success of fast tracking approaches based on fullyconvolutional Siamese networks [3] trained offline on millions of pairs of video frames (e.g. [28,63,15,60]) and by the very recent availability of YouTube-VOS [58], a large video dataset with pixel-wise annotations. We aim at retaining the offline trainability and online speed of these methods while at the same time significantly refining their representation of the target object, which is limited to a simple axis-aligned bounding box.…”
Section: Initmentioning
confidence: 99%