2019
DOI: 10.1007/978-3-030-11018-5_57
|View full text |Cite
|
Sign up to set email alerts
|

Deep Learning of Appearance Models for Online Object Tracking

Abstract: This paper introduces a novel deep learning based approach for vision based single target tracking. We address this problem by proposing a network architecture which takes the input video frames and directly computes the tracking score for any candidate target location by estimating the probability distributions of the positive and negative examples. This is achieved by combining a deep convolutional neural network with a Bayesian loss layer in a unified framework. In order to deal with the limited number of p… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
15
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 23 publications
(15 citation statements)
references
References 32 publications
0
15
0
Order By: Relevance
“…Other approaches use covariance matrix representation, pixel comparison representation, SIFT-like features, or pose features [25,83,29,22,50]. Recently, deep neural network architectures have been used for modeling appearance [21,36,84]. In these architectures, high-level features are extracted by convolutional neural networks trained for a specific task.…”
Section: Appearance Modelmentioning
confidence: 99%
“…Other approaches use covariance matrix representation, pixel comparison representation, SIFT-like features, or pose features [25,83,29,22,50]. Recently, deep neural network architectures have been used for modeling appearance [21,36,84]. In these architectures, high-level features are extracted by convolutional neural networks trained for a specific task.…”
Section: Appearance Modelmentioning
confidence: 99%
“…The ImageNet+CF variant employs features taken from a network trained to solve the ImageNet classification challenge [28]. The results show that these features, which are often the first choice for combining CFs with CNNs [7,9,22,26,32,36], are significantly worse than those learned by CFNet and the Baseline experiment. The particularly poor performance of these features at deeper layers is somewhat unsurprising, since these layers are expected to have greater invariance to position when trained for classification.…”
Section: Feature Transfer Experimentsmentioning
confidence: 99%
“…The simplest approach is to disregard the lack of a-priori knowledge and adapt a pre-trained deep convolutional neural network (CNN) to the target, for example by using stochastic gradient descent (SGD), the workhorse of deep network optimization [32,26,36]. The extremely limited training data and large number of parameters make this a difficult learning problem.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Many approaches rely on appearance [17,29,57,59,11,33,47], motion [13], or social cues [20,44]. They are mostly used to associate pairs of detections, and only account for very short-term correlations.…”
Section: Related Workmentioning
confidence: 99%