Proceedings of the 7th International Conference on Multimodal Interfaces 2005
DOI: 10.1145/1088463.1088477
|View full text |Cite
|
Sign up to set email alerts
|

A joint particle filter for audio-visual speaker tracking

Abstract: In this paper, we present a novel approach for tracking a lecturer during the course of his speech. We use features from multiple cameras and microphones, and process them in a joint particle filter framework. The filter performs sampled projections of 3D location hypotheses and scores them using features from both audio and video. On the video side, the features are based on foreground segmentation, multiview face detection and upper body detection. On the audio side, the time delays of arrival between pairs … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
39
0

Year Published

2006
2006
2010
2010

Publication Types

Select...
6
1
1

Relationship

1
7

Authors

Journals

citations
Cited by 78 publications
(39 citation statements)
references
References 24 publications
(25 reference statements)
0
39
0
Order By: Relevance
“…Most proposed systems for this type of application rely on continuous tracking of identified people in the room. Important approaches that are previously proposed and successfully used are Kalman filters [25] and particle filters [34]. However, tracking multiple people for a long period is difficult, there is little possibility of recovering from mistakes.…”
Section: Feature Based Identification Modulementioning
confidence: 99%
See 2 more Smart Citations
“…Most proposed systems for this type of application rely on continuous tracking of identified people in the room. Important approaches that are previously proposed and successfully used are Kalman filters [25] and particle filters [34]. However, tracking multiple people for a long period is difficult, there is little possibility of recovering from mistakes.…”
Section: Feature Based Identification Modulementioning
confidence: 99%
“…They were previously employed for audio-visual multi-person tracking [20,55], multimodal event detection [54] and for active speaker tracking [14,34] successfully. In [55] the PF model incorporates the 3D location and velocity of tracked objects, and also employed for calibrating the sensors.…”
Section: Particle Filter-based Trackingmentioning
confidence: 99%
See 1 more Smart Citation
“…In particular, over the last few years, the particle filter framework is reported to be effective for tracking people [1,2,3,4,5,6,7,8,9,10,12,13]. The particle filter is a Bayesian sequential importance sampling technique, which recursively approximates the posterior distribution using a finite set of weighted samples.…”
Section: Introductionmentioning
confidence: 99%
“…In [8,9] particle filters are applied to track persons in 3D using a top-down approach. Each particle is evaluated for all available cameras by using cascaded classifiers as proposed by Viola and Jones [10].…”
Section: Related Workmentioning
confidence: 99%