2017 IEEE International Conference on Computer Vision (ICCV) 2017
DOI: 10.1109/iccv.2017.41
|View full text |Cite
|
Sign up to set email alerts
|

Tracking the Untrackable: Learning to Track Multiple Cues with Long-Term Dependencies

Abstract: The majority of existing solutions to the Multi-Target Tracking (MTT) problem do not combine cues in a coherent end-to-end fashion over a long period of time. However, we present an online method that encodes long-term temporal dependencies across multiple cues. One key challenge of tracking methods is to accurately track occluded targets or those which share similar appearance properties with surrounding objects. To address this challenge, we present a structure of Recurrent Neural Networks (RNN) that jointly… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

4
406
0

Year Published

2018
2018
2021
2021

Publication Types

Select...
4
4
1

Relationship

0
9

Authors

Journals

citations
Cited by 491 publications
(410 citation statements)
references
References 81 publications
4
406
0
Order By: Relevance
“…One pronounced example is ROLO [33], which uses YOLOv1 as its feature extractor, combined with LSTMs. Similarly, [34] uses VGG-16 for feature extraction and inputs the 500x1 feature vector into an LSTM. LSTM networks have been shown to provide lower Mean Squared Error in single object and fewer ID switches in multi-object tasks.…”
Section: Related Work a Pedestrian Detection Re-identificationmentioning
confidence: 99%
“…One pronounced example is ROLO [33], which uses YOLOv1 as its feature extractor, combined with LSTMs. Similarly, [34] uses VGG-16 for feature extraction and inputs the 500x1 feature vector into an LSTM. LSTM networks have been shown to provide lower Mean Squared Error in single object and fewer ID switches in multi-object tasks.…”
Section: Related Work a Pedestrian Detection Re-identificationmentioning
confidence: 99%
“…Furthermore, it is worth noting that an association/correspondence strategy based only on object locations is likely to fail to track humans. In this case, more elaborated models considering explicitly the appearance should be taken into account as, for instance, using bi-directional long short-term memories to handle appearance changes [41,42].…”
Section: Object Tracking and Final Augmented Representationmentioning
confidence: 99%
“…Our approach to cell tracking is motivated by the now classic work of Jaqaman et al 32 and recent work applying deep learning to object tracking 33 . In these works, object tracking is treated as a linear assignment problem (Figure 2a).…”
Section: Tracking Single Cells With Deep Learning and Linear Programmingmentioning
confidence: 99%
“…Here, we take a supervised deep learning approach to learn an optimal cost function for the linear assignment framework. Our approach was inspired by previous work applying deep learning to object tracking 33 . Building on this work, we make adaptations to deal with the unique features of live-cell imaging data ( Figure 1c).…”
Section: Tracking Single Cells With Deep Learning and Linear Programmingmentioning
confidence: 99%