Self-taught learning of a deep invariant representation for visual tracking via temporal slowness principle

Kuen, Jason; Lim, Kian Ming; Lee, Chin Poo

doi:10.1016/j.patcog.2015.02.012

Cited by 36 publications

(24 citation statements)

References 43 publications

(58 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…For complex applications, however, imbedding proposed models in current security systems becomes necessary, such as compressive sensing for sparse tracking [18] (it can be improved as locally compressive sensing within ROI), VIBE algorithm for real-time object detection from a moving camera [19], Adaboost algorithm for noise-detection in ROI [20], optical flow for robots' recognition of environments [21], SVM clustering for accidents classification [22], deep learning algorithms for anomaly detection, crow analysis, and hierarchical tracking within ROI [23][24][25][26][27]. Objects understanding and detection in dynamic environment changes are usually based on the adaptive background subtraction and other objects recognition methods [17,21,35,[65][66][67][68].…”

Section: Simulation and Discussionmentioning

confidence: 99%

“…Numerous algorithms have been developed to tackle video recognition challenges in various environments; however, a full understanding of environmental implications to video recognition efficiency demands learning models with universal significance (ignoring uncontrolled differences in real scenarios) [18][19][20][21][22][23][24][25][26][27]. That is the essential reason why the current online algorithms, even for latest algorithms, for example, the latest models for tackling crowd segmentation for the high-dimensional, large-scale anomaly detection, still encounter considerable uncertainties [23,24].…”

Section: Introductionmentioning

confidence: 99%

“…How to evaluate and compute the regulated attention in implications of the surrounding environment changes and, furthermore, how to define the warning level of EC-implications to video recognition efficiency should be research priorities towards precision security in intelligent surveillance [21][22][23][24][25][26][27]. It has been widely recognized that video surveillance should consider the implications of surrounding environments changes to video surveillance, but until now, there are still no models for a universal evaluation of EC-implications to video recognition efficiency [4,[12][13][14][15][16][17][18][19][20][21][22][23][24][25][26][27]. To solve the unresolved issues associated with uncontrolled EC-implications, various novel optimization models were proposed and further applied in current learning systems [13][14][15].…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Precision Security: Integrating Video Surveillance with Surrounding Environment Changes

et al. 2018

View full text Add to dashboard Cite

Video surveillance plays a vital role in maintaining the social security although, until now, large uncertainty still exists in danger understanding and recognition, which can be partly attributed to intractable environment changes in the backgrounds. This article presents a brain-inspired computing of attention value of surrounding environment changes (EC) with a processes-based cognition model by introducing a ratio value of EC-implications within considered periods. Theoretical models for computation of warning level of EC-implications to the universal video recognition efficiency (quantified as time cost of implication-ratio variations from to +1 , = 1, 2, . . .) are further established. Imbedding proposed models into the online algorithms is suggested as a future research priority towards precision security for critical applications and, furthermore, schemes for a practical implementation of such integration are also preliminarily discussed.

show abstract

Section: Simulation and Discussionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Precision Security: Integrating Video Surveillance with Surrounding Environment Changes

et al. 2018

View full text Add to dashboard Cite

show abstract

“…Indeed, it is more effective to exploit target specific representations through a learning process rather than using a fixed set of pre-defined features [178]. Deep neural networks, especially convolutional neural networks (CNN) have recently proposed for moving object tracking which effectively use category specific features for tracking and show some promising results even in the case of complex moving camera [179,180,181].…”

Section: Extract Target Featuresmentioning

confidence: 99%

New trends on moving object detection in video images captured by a moving camera: A survey

Yazdi

Bouwmans

2018

Computer Science Review

195

View full text Add to dashboard Cite

This paper presents a survey on the latest methods of moving object detection in video sequences captured by a moving camera. Although many researches and excellent works have reviewed the methods of object detection and background subtraction for a fixed camera, there is no survey which presents a complete review of the existing different methods in the case of moving camera. Most methods in this field can be classified into four categories: modelling based background subtraction, trajectory classification, low rank and sparse matrix decomposition, and object tracking. We discuss in details each category and present the main methods which proposed improvements in the general concept of the techniques. We also present challenges and main concerns in this field as well as performance metrics and some benchmark databases available to evaluate the performance of different moving object detection algorithms.

show abstract

“…Although the currently used hand-crafted features produce acceptable tracking results, it is always preferred to leverage more descriptive features. Therefore, it is more beneficial to exploit target-specific representations through a learning process rather than using a fixed set of pre-defined features [23].…”

Section: Introductionmentioning

confidence: 99%

Deep Learning of Appearance Models for Online Object Tracking

Zhai

Chen

Mori

et al. 2019

Lecture Notes in Computer Science

View full text Add to dashboard Cite

This paper introduces a novel deep learning based approach for vision based single target tracking. We address this problem by proposing a network architecture which takes the input video frames and directly computes the tracking score for any candidate target location by estimating the probability distributions of the positive and negative examples. This is achieved by combining a deep convolutional neural network with a Bayesian loss layer in a unified framework. In order to deal with the limited number of positive training examples, the network is pre-trained offline for a generic image feature representation and then is fine-tuned in multiple steps. An online fine-tuning step is carried out at every frame to learn the appearance of the target. We adopt a two-stage iterative algorithm to adaptively update the network parameters and maintain a probability density for target/non-target regions. The tracker has been tested on the standard tracking benchmark and the results indicate that the proposed solution achieves state-of-the-art tracking results.

show abstract

Self-taught learning of a deep invariant representation for visual tracking via temporal slowness principle

Cited by 36 publications

References 43 publications

Precision Security: Integrating Video Surveillance with Surrounding Environment Changes

Precision Security: Integrating Video Surveillance with Surrounding Environment Changes

New trends on moving object detection in video images captured by a moving camera: A survey

Deep Learning of Appearance Models for Online Object Tracking

Contact Info

Product

Resources

About