2011 IEEE/RSJ International Conference on Intelligent Robots and Systems 2011
DOI: 10.1109/iros.2011.6094558
|View full text |Cite
|
Sign up to set email alerts
|

Intelligent Sound Source Localization and its application to multimodal human tracking

Abstract: We have assessed robust tracking of humans based on intelligent Sound Source Localization (SSL) for a robot in a real environment. SSL is fundamental for robot audition, but has three issues in a real environment: robustness against noise with high power, lack of a general framework for selective listening to sound sources, and tracking of inactive and/or noisy sound sources. To address the first issue, we extended Multiple SIgnal Classification by incorporating Generalized EigenValue Decomposition (GEVD-MUSIC… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
25
0

Year Published

2015
2015
2022
2022

Publication Types

Select...
7
1
1

Relationship

0
9

Authors

Journals

citations
Cited by 45 publications
(25 citation statements)
references
References 10 publications
0
25
0
Order By: Relevance
“…In such a case, it can be difficult to easily identify the noise and signal spaces from the eigenvalue decomposition of the array cross-correlation matrix Γ M . For this reason, the GEVD-MUSIC (Generalized Eigen Value Decomposition-MUSIC) is proposed [66]. It consists in defining an additional freely-tunable correlation matrix Γ N for the frequency k 0 , and solving the new GEVD problem…”
Section: Musicmentioning
confidence: 99%
“…In such a case, it can be difficult to easily identify the noise and signal spaces from the eigenvalue decomposition of the array cross-correlation matrix Γ M . For this reason, the GEVD-MUSIC (Generalized Eigen Value Decomposition-MUSIC) is proposed [66]. It consists in defining an additional freely-tunable correlation matrix Γ N for the frequency k 0 , and solving the new GEVD problem…”
Section: Musicmentioning
confidence: 99%
“…First, the location information of the persons can be used as multi-modal sensing and analysis of poster conversations with smart posterboard a constraint in the speaker localization. This is a straightforward multi-modal integration in speaker diarization [8,9]. In this study, furthermore, we investigate the use of eyegaze information for speaker diarization as it is shown in Section IV that eye-gaze information is useful for predicting turn-taking by the audience.…”
Section: B) Multi-modal Sensingmentioning
confidence: 99%
“…Acoustic source localization has been largely restricted to estimating azimuth [26][27][28][29][30][31][32][33] on the assumption of zero elevation, except where audition has been fused with vision for estimates also of elevation [34,35,37,38]. Information gathered as the head is turned has been exploited either to locate the azimuth at which ITD reduces to zero thereby determining the azimuthal direction to a source, or to resolve the front-back ambiguity associated with estimating only azimuth [28][29][30][31][32][33][34]39,40].…”
Section: Introductionmentioning
confidence: 99%