2012 19th IEEE International Conference on Image Processing 2012
DOI: 10.1109/icip.2012.6467059
|View full text |Cite
|
Sign up to set email alerts
|

Interaction recognition in wide areas using audiovisual sensors

Abstract: We present an event recognition framework to detect interactions among objects, for example people, using a network of cameras and associated microphone pairs. The complementarity of the video and audio modalities is exploited to cover wide areas. In particular, object movements in portions of the scene that are not covered by the cameras' fields of view are estimated using the input from microphones. After estimating trajectories using audio-visual features, we recognize interactions based on a Coupled Hidden… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2013
2013
2014
2014

Publication Types

Select...
2

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(1 citation statement)
references
References 12 publications
0
1
0
Order By: Relevance
“…Moreover, in order to increase the performance achieved by pure vision systems in real scenarios, multi-modal sensing should be considered [183], [184]. Examples include the integration of 2D laser scanners with cameras [183] or cameras and microphones [185].…”
Section: Discussionmentioning
confidence: 99%
“…Moreover, in order to increase the performance achieved by pure vision systems in real scenarios, multi-modal sensing should be considered [183], [184]. Examples include the integration of 2D laser scanners with cameras [183] or cameras and microphones [185].…”
Section: Discussionmentioning
confidence: 99%