Learning a Robust Society of Tracking Parts Using Co-occurrence Constraints

Burceanu, Elena; Leordeanu, Marius

doi:10.1007/978-3-030-11009-3_9

Cited by 9 publications

(8 citation statements)

References 42 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…VOT‐2017 dataset : In the more important and difficult benchmark, VOT‐2017 [25], several trackers are compared with our tracker, these trackers include the very top state‐of‐the‐art methods: STP [42], CFWCR [43], convolutional features for correlation filters (CFCF) [57], ECO [45], CCOT [44] and other recent methods: RCPF [58], unified convolutional tracker (UCT) [59], SPCT [60], SiamFC [33], Staple [1] and DPT [49].…”

Section: Methodsmentioning

confidence: 99%

“…VOT-2016 dataset: We compare our tracker with 22 state-ofthe-art trackers on the VOT-2016 benchmark including society of tracking parts (STP) [42], CFWCR [43], ECO [45], CCOT [44], tree-structured convolutional neural network (TCNN) [48], SSAT [24], DPT [49], SiamFC [33], deepMKCF [50], new scale adaptive and multiple feature (NSAMF) [51], colour-aware complex cell tracker (CCCT) [52], structure output deep learning tracker (SO-DLT) [31], HCF [20], DAT [53], scale adaptive mean-shift (ASMS) [54], KCF [4], SAMF [51], DSST [3], tracking with Gaussian processes regression (TGPR) [55], multiple instance learning (MIL) [16], structured output tracking with kernels (STRUCK) [17] and incremental learning for visual tracking (IVT) [56]. Table 2 shows the results of our tracker and other trackers.…”

Section: State-of-the-art Comparisonmentioning

confidence: 99%

See 1 more Smart Citation

Adaptive convolutional layer selection based on historical retrospect for visual tracking

Tang

Zhang

et al. 2019

IET Computer Vision

View full text Add to dashboard Cite

Visual tracking has recently gained a great advance with the use of the convolutional neural network (CNN). Usually, existing CNN-based trackers exploit the features from a single layer or a certain combination of multiple layers. However, these features only characterise an object from an invariable aspect and cannot adapt to scene variation, which limits the performance of such trackers. To overcome this limitation, the authors study the problem from a new perspective and propose a novel convolutional layer selection method. To obtain robust appearance representation, they investigate the advantages of features extracted from different convolutional layers. To determine the correctness of the tracking prediction and updated model, they design a verification mechanism based on historical retrospect, which can estimate the deviation for each layer by bidirectionally locating the target. Meanwhile, the deviation works as the layer-wise selection criteria. Extensive evaluations on the OTB-2013, visual object tracking (VOT)-2016 and VOT-2017 benchmarks demonstrate that the proposed tracker performs favourably against several state-of-the-art trackers. The results are presented in terms of EAO, Av and Rv. The best, second and third results are marked underline, bold italic, italic, respectively. Bold represents our method and results.

show abstract

Section: Methodsmentioning

confidence: 99%

Section: State-of-the-art Comparisonmentioning

confidence: 99%

Adaptive convolutional layer selection based on historical retrospect for visual tracking

Tang

Zhang

et al. 2019

IET Computer Vision

View full text Add to dashboard Cite

show abstract

“…Using the orientation and magnitude of extracted tracklets, one-dimensional descriptors were derived and fed into one-class support vector machine (SVM) classifier for abnormality detection. Recently, Burceanu and Leordeanu [9] proposed a neural network object tracker with two pathways; the FilterParts and the ConvNetPart. The first pathway is robust to background noises while the second one is robust to object appearance changes over time.…”

Section: Related Workmentioning

confidence: 99%

Graph-based topic models for trajectory clustering in crowd videos

Ghamdi

Gotoh

2020

Machine Vision and Applications

View full text Add to dashboard Cite

Probabilistic topic modelings, such as latent Dirichlet allocation (LDA) and correlated topic models (CTM), have recently emerged as powerful statistical tools for processing video content. They share an important property, i.e., using a common set of topics to model all data. However such property can be too restrictive for modeling complex visual data such as crowd scenes where multiple fields of heterogeneous data jointly provide rich information about objects and events. This paper proposes graphbased extensions of LDA and CTM, referred to as GLDA and GCTM, to learn and analyze motion patterns by trajectory clustering in a highly cluttered and crowded environment. Unlike previous works that relied on a scene prior, we apply a spatio-temporal graph (STG) to uncover the spatial and temporal coherence between the trajectories of crowd motion during the learning process. The presented models advance the conventional approaches by integrating a manifold-based clustering as initialization and iterative statistical inference as optimization. The output of GLDA and GCTM are mid-level features that represent the motion patterns used later to generate trajectory clusters. Experiments on three different datasets show the effectiveness of the approaches in trajectory clustering and crowd motion modeling.

show abstract

“…• Robust target representation: Providing a powerful target representation is the main advantage of employing CNNs for visual tracking. To achieve the goal of learning generic representations for target modeling and constructing a more robust target models, the main contributions of methods are classified into: i) offline training of CNNs on large-scale datasets for visual tracking [63], [68], [80], [89], [97], [100], [101], [104], [112], [116], [135], [137], [142], [144], [153], [165], [168], [169], [173], ii) designing specific deep convolutional networks instead of employing pre-trained models [63], [68], [70], [72], [73], [75], [76], [80], [82], [89], [97], [100], [101], [104], [105], [108], [112], [116], [127], [135], [137], [141], [142], [144], [146], [150],…”

Section: Convolutional Neural Network (Cnn)mentioning

confidence: 99%

Deep Learning for Visual Tracking: A Comprehensive Survey

Marvasti-Zadeh,

Cheng,

Ghanei-Yakhdan

et al. 2019

Preprint

View full text Add to dashboard Cite

Visual target tracking is one of the most sought-after yet challenging research topics in computer vision. Given the ill-posed nature of the problem and its popularity in a broad range of real-world scenarios, a number of large-scale benchmark datasets have been established, on which considerable methods have been developed and demonstrated with significant progress in recent yearspredominantly by recent deep learning (DL)-based methods. This survey aims to systematically investigate the current DL-based visual tracking methods, benchmark datasets, and evaluation metrics. It also extensively evaluates and analyzes the leading visual tracking methods. First, the fundamental characteristics, primary motivations, and contributions of DL-based methods are summarized from six key aspects of: network architecture, network exploitation, network training for visual tracking, network objective, network output, and the exploitation of correlation filter advantages. Second, popular visual tracking benchmarks and their respective properties are compared, and their evaluation metrics are summarized. Third, the state-of-the-art DL-based methods are comprehensively examined on a set of well-established benchmarks of OTB2013, OTB2015, VOT2018, and LaSOT. Finally, by conducting critical analyses of these stateof-the-art methods both quantitatively and qualitatively, their pros and cons under various common scenarios are investigated. It may serve as a gentle use guide for practitioners to weigh on when and under what conditions to choose which method(s). It also facilitates a discussion on ongoing issues and sheds light on promising research directions.

show abstract

Learning a Robust Society of Tracking Parts Using Co-occurrence Constraints

Cited by 9 publications

References 42 publications

Adaptive convolutional layer selection based on historical retrospect for visual tracking

Adaptive convolutional layer selection based on historical retrospect for visual tracking

Graph-based topic models for trajectory clustering in crowd videos

Deep Learning for Visual Tracking: A Comprehensive Survey

Contact Info

Product

Resources

About