Guiding Visual Surveillance by Tracking Human Attention

Benfold, Ben; Reid, Ian

doi:10.5244/c.23.14

Cited by 87 publications

(65 citation statements)

References 17 publications

(20 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…corresponding facial landmarks such as eyes and lips to a set of trained poses. Recent studies have attempted to estimate head pose in low-resolution images [8] as well as crowded surveillance videos [52]. In addition to head pose, body posture configuration [46] and gait [49] may also play an important role in human intent inference.…”

Section: Intent Profilingmentioning

confidence: 99%

“…Alert is generated for an immediate review by security personnel if multiple persons enter a restricted area while only one of them is authorised by the access control system, e.g. Mate video analytics 8 . 4 A set of real-world datasets and alarm definitions are released as the Image Library for Intelligent Detection Systems (i-LIDS), a UK government Home Office Scientific Development Branch (HOSDB) benchmark for video analytics systems [63], which has also been adopted by the US National Institute of Standards and Technology (NIST).…”

Section: Current Systemsmentioning

confidence: 99%

See 1 more Smart Citation

Security and Surveillance

Gong

Loy

Xiang

2011

Visual Analysis of Humans

View full text Add to dashboard Cite

Human eyes are highly efficient devices for scanning through a large quantity of low-level visual sensory data and delivering selective information to one's brain for high-level semantic interpretation and gaining situational awareness. Over the last few decades, the computer vision community has endeavoured to bring about similar perceptual capabilities to artificial visual sensors. Substantial efforts have been made towards understanding static images of individual objects and the corresponding processes in the human visual system. This endeavour is intensified further by the need for understanding a massive quantity of video data, with the aim to comprehend multiple entities not only within a single image but also over time across multiple video frames for understanding their spatio-temporal relations. A significant application of video analysis and understanding is intelligent surveillance, which aims to interpret automatically human activity and detect unusual events that could pose a threat to public security and safety.

show abstract

Section: Intent Profilingmentioning

confidence: 99%

Section: Current Systemsmentioning

confidence: 99%

Security and Surveillance

Gong

Loy

Xiang

2011

Visual Analysis of Humans

View full text Add to dashboard Cite

show abstract

“…For example, in [65], and, independently, in [66] the idea was to infer what part of the scene is seen more frequently by people, thus creating a sort of interest maps. This may serve to highlight individuals that are focused on particular portions of the environment for a long time: if the observed target is critical (for example, an ATM machine) a threatening behavior could be inferred (PROBLEM 1).…”

Section: Face and Gaze Behaviormentioning

confidence: 99%

Human behavior analysis in video surveillance: A Social Signal Processing perspective

et al. 2013

View full text Add to dashboard Cite

The analysis of human activities is one of the most intriguing and important open issues for the automated video surveillance community. Since few years ago, it has been handled following a mere Computer Vision and Pattern Recognition perspective, where an activity corresponded to a temporal sequence of explicit actions (run, stop, sit, walk, etc.). Even under this simplistic assumption, the issue is hard, due to the strong diversity of the people appearance, the number of individuals considered (we may monitor single individuals, groups, crowd), the variability of the environmental conditions (indoor/outdoor, different weather conditions), and the kinds of sensors employed. More recently, the automated surveillance of human activities has been faced considering a new perspective, that brings in notions and principles from the social, affective, and psychological literature, and that is called Social Signal Processing (SSP). SSP employs primarily nonverbal cues, most of them are outside of conscious awareness, like face expressions and gazing, body posture and gestures, vocal characteristics, relative distances in the space and the like. This paper is the first review analyzing this new trend, proposing a structured snapshot of the state of the art and envisaging novel challenges in the surveillance domain where the cross-pollination of Computer Science technologies and Sociology theories may offer valid investigation strategies.

show abstract

“…These benchmarks are composed of training and testing sets [36][37][38][39][40][41] with public detections, given by Aggregate Channel Features (ACF) pedestrian detector [42] in the case of the 2DMOT2015 and a Deformable Part Model (DPM) [43] for the MOT16. The metrics employed by these benchmarks are based on the widely accepted CLEARMOT metrics [44].…”

Section: Benchmarks Metrics and Parameter Tuningmentioning

confidence: 99%

Improving Multi-frame Data Association with Sparse Representations for Robust Near-online Multi-object Tracking

Fagot-Bouquet

Audigier

Dhome

et al. 2016

Lecture Notes in Computer Science

View full text Add to dashboard Cite

Abstract. Multiple Object Tracking still remains a difficult problem due to appearance variations and occlusions of the targets or detection failures. Using sophisticated appearance models or performing data association over multiple frames are two common approaches that lead to gain in performances. Inspired by the success of sparse representations in Single Object Tracking, we propose to formulate the multi-frame data association step as an energy minimization problem, designing an energy that efficiently exploits sparse representations of all detections. Furthermore, we propose to use a structured sparsity-inducing norm to compute representations more suited to the tracking context. We perform extensive experiments to demonstrate the effectiveness of the proposed formulation, and evaluate our approach on two public authoritative benchmarks in order to compare it with several state-of-the-art methods.

show abstract

Guiding Visual Surveillance by Tracking Human Attention

Cited by 87 publications

References 17 publications

Security and Surveillance

Security and Surveillance

Human behavior analysis in video surveillance: A Social Signal Processing perspective

Improving Multi-frame Data Association with Sparse Representations for Robust Near-online Multi-object Tracking

Contact Info

Product

Resources

About