Automatic adaptation of a generic pedestrian detector to a specific traffic scene

Wang, Meng; Wang, Xiaogang

doi:10.1109/cvpr.2011.5995698

Cited by 190 publications

(171 citation statements)

References 22 publications

Supporting

Mentioning

168

Contrasting

Unclassified

Order By: Relevance

“…As can be observed in Section 4.3, the overwhelming majority of the state-of-the-art research for domain adaptation of object detectors in videos use self-training in one form or another [29,30,31,32,33,36,37,38,40,42,11,10]. In order to adapt a generic pedestrian detector to a specific scene, a typical system would run the generic detector on some frames in a video, then score each detection using some heuristics and afterwards, add the most confident positive and negative detections to the original dataset for retraining.…”

Section: Discussionmentioning

confidence: 99%

“…In each iteration, positive and negative examples are collected by filtering with a variety of cues, added to the current dataset and a new classifier is trained. Figure taken from [11]. many situations.…”

Section: Domain Adaptation For Object Detection In Videosmentioning

confidence: 99%

“…As with other iterative self-training algorithms, the algorithm requires setting the threshold for selecting confident detections. Moreover, the system requires the use of multiple source domains which may not be feasible in Figure 12: An iterative self-training technique of Wang and Wang [11]. In each iteration, positive and negative examples are collected by filtering with a variety of cues, added to the current dataset and a new classifier is trained.…”

Section: Domain Adaptation For Object Detection In Videosmentioning

confidence: 99%

“…The method proposed by Wang and Wang [11] iteratively improves a generic pedestrian detector by selecting new confident examples to add to the current dataset for retraining at every iteration. In order to collect examples for each self-training iteration, their oracle is a combination of vehicle and pedestrian paths, multiple different cues such as bounding box locations and sizes, background subtraction, thresholds, filters and hier-archical clustering.…”

Section: Domain Adaptation For Object Detection In Videosmentioning

confidence: 99%

“…(a) CUHK Square dataset [10] (b) MIT Traffic dataset [11] (c) PETS 2009 dataset [12] Figure 2: Random samples from scene-specific pedestrian datasets (only pedestrians, i.e. positive examples, are shown).…”

Section: Scene-specific Detectorsmentioning

confidence: 99%

See 4 more Smart Citations

Adapting pedestrian detectors to new domains: A comprehensive review

Htike

Hogg

2016

Engineering Applications of Artificial Intelligence

View full text Add to dashboard Cite

Successful detection and localisation of pedestrians is an important goal in computer vision which is a core area in Artificial Intelligence. State-of-the-art pedestrian detectors proposed in literature have reached impressive performance on certain datasets. However, it has been pointed out that these detectors tend not to perform very well when applied to specific scenes that differ from the training datasets in some ways. Due to this, domain adaptation approaches have recently become popular in order to adapt existing detectors to new domains to improve the performance in those domains. There is a real need to review and analyse critically the state-of-the-art domain adaptation algorithms, especially in the area of object and pedestrian detection. In this paper, we survey the most relevant and important state-of-the-art results for domain adaptation for image and video data, with a particular focus on pedestrian detection. Related areas to domain adaptation are also included in our review and we make observations and draw conclusions from the representative papers and give practical recommendations on which methods should be preferred in different situations that practitioners may encounter in real-life.

show abstract

Section: Discussionmentioning

confidence: 99%

Section: Domain Adaptation For Object Detection In Videosmentioning

confidence: 99%

Section: Domain Adaptation For Object Detection In Videosmentioning

confidence: 99%

Section: Domain Adaptation For Object Detection In Videosmentioning

confidence: 99%

Section: Scene-specific Detectorsmentioning

confidence: 99%

See 3 more Smart Citations

Adapting pedestrian detectors to new domains: A comprehensive review

Htike

Hogg

2016

Engineering Applications of Artificial Intelligence

View full text Add to dashboard Cite

show abstract

Crowd Counting and Profiling: Methodology and Evaluation

Loy

Chen

Gong

et al. 2013

The International Series in Video Computing

171

View full text Add to dashboard Cite

Video imagery based crowd analysis for population profiling and density estimation in public spaces can be a highly effective tool for establishing global situational awareness. Different strategies such as counting by detection and counting by clustering have been proposed, and more recently counting by regression has also gained considerable interest due to its feasibility in handling relatively more crowded environments. However, the scenarios studied by existing regression-based techniques are rather diverse in terms of both evaluation data and experimental settings. It can be difficult to compare them in order to draw general conclusions on their effectiveness. In addition, contributions of individual components in the processing pipeline such as feature extraction and perspective normalisation remain unclear and less well studied. This study describes and compares the state-of-the-art methods for video imagery based crowd counting, and provides a systematic evaluation of different methods using the same protocol. Moreover, we evaluate critically each processing component to identify potential bottlenecks encountered by existing techniques. Extensive evaluation is conducted on three public scene datasets, including a new shopping centre environment with labelled ground truth for validation. Our study reveals new insights into solving the problem of crowd analysis for population profiling and density estimation, and considers open questions for future studies.

show abstract

Where Are the Blobs: Counting by Localization with Point Supervision

Laradji

Rostamzadeh²,

Pinheiro³

et al. 2018

Lecture Notes in Computer Science

175

119

View full text Add to dashboard Cite

Object counting is an important task in computer vision due to its growing demand in applications such as surveillance, traffic monitoring, and counting everyday objects. State-of-the-art methods use regression-based optimization where they explicitly learn to count the objects of interest. These often perform better than detection-based methods that need to learn the more difficult task of predicting the location, size, and shape of each object. However, we propose a detectionbased method that does not need to estimate the size and shape of the objects and that outperforms regression-based methods. Our contributions are three-fold: (1) we propose a novel loss function that encourages the network to output a single blob per object instance using pointlevel annotations only; (2) we design two methods for splitting large predicted blobs between object instances; and (3) we show that our method achieves new state-of-the-art results on several challenging datasets including the Pascal VOC and the Penguins dataset. Our method even outperforms those that use stronger supervision such as depth features, multi-point annotations, and bounding-box labels.

show abstract

Automatic adaptation of a generic pedestrian detector to a specific traffic scene

Cited by 190 publications

References 22 publications

Adapting pedestrian detectors to new domains: A comprehensive review

Adapting pedestrian detectors to new domains: A comprehensive review

Crowd Counting and Profiling: Methodology and Evaluation

Where Are the Blobs: Counting by Localization with Point Supervision

Contact Info

Product

Resources

About