Proceedings of the 15th ACM International Conference on Multimedia 2007
DOI: 10.1145/1291233.1291388
|View full text |Cite
|
Sign up to set email alerts
|

Audio-visual multi-person tracking and identification for smart environments

Abstract: This paper presents a novel system for the automatic and unobtrusive tracking and identification of multiple persons in an indoor environment. Information from several fixed cameras is fused in a particle filter framework to simultaneously track multiple occupants. A set of steerable fuzzycontrolled pan-tilt-zoom cameras serves to smoothly track persons of interest and opportunistically capture facial closeups for face identification. In parallel, speech segmentation, sound source localization and speaker iden… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
26
0

Year Published

2008
2008
2012
2012

Publication Types

Select...
5
1
1

Relationship

1
6

Authors

Journals

citations
Cited by 31 publications
(26 citation statements)
references
References 23 publications
0
26
0
Order By: Relevance
“…In this context, robust and accurate tracking results have been demonstrated, e.g. [2,3], and current goals are to recover the pose of objects in addition to their localization, exploit other modalities such as audio [4], and characterize people activities. Another set of environement are open spaces.…”
Section: Key Factors and Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…In this context, robust and accurate tracking results have been demonstrated, e.g. [2,3], and current goals are to recover the pose of objects in addition to their localization, exploit other modalities such as audio [4], and characterize people activities. Another set of environement are open spaces.…”
Section: Key Factors and Related Workmentioning
confidence: 99%
“…Alternatively, one can represent people using sets of local templates or patches and geometric information, as proposed by Leibe et al [29]. In addition, other modalities such as audio microphone arrays can be used for localization, especially in smart rooms [4].…”
Section: Key Factors and Related Workmentioning
confidence: 99%
“…More recently Bernardin and Stiefelhagen [2] have implemented a system for the simultaneous tracking and incremental multimodal identification of multiple users in a smart environment which fuses person track information, localized speaker ID and high definition visual ID cues opportunistically to gradually refine the global scene model and thus increase the system's confidence in the set of recognized identities. The improvements of combining acoustic features and 2D face images for identification of participants in a smart room environment are discussed in [15].…”
Section: Biometrics In Smart Environmentsmentioning
confidence: 99%
“…In this context, robust and accurate tracking results have been demonstrated, e.g. [Bernardin et al, 2006, Fleuret et al, 2008, and current goals consists of improving tracking robustness under higher crowding levels, recovering the pose of objects in addition to their localization, exploiting other modalities such as audio [Bernardin and Stiefelhagen, 2007], and characterizing people activities.…”
Section: Introductionmentioning
confidence: 99%