2019
DOI: 10.1109/jstsp.2019.2903472
|View full text |Cite
|
Sign up to set email alerts
|

Online Localization and Tracking of Multiple Moving Speakers in Reverberant Environments

Abstract: We address the problem of online localization and tracking of multiple moving speakers in reverberant environments. The paper has the following contributions. We use the direct-path relative transfer function (DP-RTF), an interchannel feature that encodes acoustic information robust against reverberation, and we propose an online algorithm well suited for estimating DP-RTFs associated with moving audio sources. Another crucial ingredient of the proposed method is its ability to properly assign DP-RTFs to audio… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
43
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
4
3

Relationship

2
5

Authors

Journals

citations
Cited by 39 publications
(44 citation statements)
references
References 38 publications
1
43
0
Order By: Relevance
“…The present work is also related to previous work in localizing sounds in visual inputs [20,14,22,9,8,24,4,35], which aims to identify which pixels in a video are associated with an object making a particular sound.…”
Section: Sound Localizationmentioning
confidence: 97%
“…The present work is also related to previous work in localizing sounds in visual inputs [20,14,22,9,8,24,4,35], which aims to identify which pixels in a video are associated with an object making a particular sound.…”
Section: Sound Localizationmentioning
confidence: 97%
“…The proposed vM-VEM tracker yields the lowest false alarm (FA) rate of 5.9% and MAE of 2.6, and the second lowest MD rate of 23.9%. The GM-FO variant of [16] yields an MD rate of 22.3% since it uses velocity information to smooth the trajectories. This illustrates the advantage of the von-Mises distribution to model directional data (DOA).…”
Section: Experimental Evaluationmentioning
confidence: 99%
“…By using the fact that the prior distribution on z tm is denoted by p(z tm = n) = π n , we can now write the a posteriori distribution as q(z tm = n) ∝ π n β tmn with: β tmn = ω tm κ y A(ω tm κ y ) cos(y tm − µ tn ) n = 0 1/2π n = 0 , thus leading to the results in (16) and (3).…”
Section: Appendix B Derivation Of the E-z Stepmentioning
confidence: 99%
See 2 more Smart Citations