2022 30th European Signal Processing Conference (EUSIPCO) 2022
DOI: 10.23919/eusipco55093.2022.9909764
|View full text |Cite
|
Sign up to set email alerts
|

Binaural source localization using deep learning and head rotation information

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
1
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
2
2

Relationship

2
2

Authors

Journals

citations
Cited by 4 publications
(8 citation statements)
references
References 23 publications
0
1
0
Order By: Relevance
“…To the authors' best knowledge, no study has been done on regression-based binaural DOA estimation that would be tested for a large number of rooms and a full range of azimuth angles. We investigated binaural localization in a scenario with a rotating head, showing that head rotation information significantly improves the estimation precision [44]. This work is a direct continuation of that study.…”
Section: A Sound Source Localizationmentioning
confidence: 81%
See 3 more Smart Citations
“…To the authors' best knowledge, no study has been done on regression-based binaural DOA estimation that would be tested for a large number of rooms and a full range of azimuth angles. We investigated binaural localization in a scenario with a rotating head, showing that head rotation information significantly improves the estimation precision [44]. This work is a direct continuation of that study.…”
Section: A Sound Source Localizationmentioning
confidence: 81%
“…Although they demonstrated the benefit of using motion-based cues in binaural DOAE systems, the investigation was limited to azimuth rotation angles comprised in the range of ±90º. In [44], we first proposed a DNN system that takes advantage of head movement information and explicitly estimates the DOA for an unlimited range of azimuth and elevation angles. In this study, we extend this approach to scenarios with a moving listener.…”
Section: Binaural Doae With Head Rotation or A Moving Listenermentioning
confidence: 99%
See 2 more Smart Citations
“…The dataset used for experiments follows the same setup as in [37]. Briefly, anechoic speech recordings obtained from the TIMIT dataset [38] are convolved with the simulated omnidirectional RIRs from an image-source room simulator for shoebox geometries [39].…”
Section: A Synthetic Datasetmentioning
confidence: 99%