2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2014
DOI: 10.1109/icassp.2014.6854122
|View full text |Cite
|
Sign up to set email alerts
|

Gaussian process models for HRTF based 3D sound localization

Abstract: The human ability to localize sound-source direction using just two receivers is a complex process of direction inference from spectral cues of sound arriving at the ears. While these cues can be described using the well-known head-related transfer function (HRTF) concept, it is unclear as to how densely HRTF must be sampled and whether a higher-order representation is employed in localization. We propose a class of binaural sound source localization models to answer these two questions. First, using the sound… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
6
0

Year Published

2015
2015
2022
2022

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 10 publications
(6 citation statements)
references
References 27 publications
(39 reference statements)
0
6
0
Order By: Relevance
“…To overcome the need of a complex explicit sound propagation model, a number of supervised approaches to SSL have been recently proposed. These methods use either artificial neural networks [2], manifold learning [8], [9] or regression [5], [9], [19], [22], [10], first to learn a mapping from binaural features to the (1D or 2D) direction of a single source, and second to infer an unknown source direction from binaural observations. These methods have the advantage that an explicit HRTF model is replaced by an implicit one, embedded in the parameters learned during the training stage.…”
Section: A Related Work and Contributionsmentioning
confidence: 99%
See 1 more Smart Citation
“…To overcome the need of a complex explicit sound propagation model, a number of supervised approaches to SSL have been recently proposed. These methods use either artificial neural networks [2], manifold learning [8], [9] or regression [5], [9], [19], [22], [10], first to learn a mapping from binaural features to the (1D or 2D) direction of a single source, and second to infer an unknown source direction from binaural observations. These methods have the advantage that an explicit HRTF model is replaced by an implicit one, embedded in the parameters learned during the training stage.…”
Section: A Related Work and Contributionsmentioning
confidence: 99%
“…For a single spatially-narrow emitter, the ILD and IPD depend on the emitter's position relative to the head, namely the 2D directional vector formed by azimuth and elevation. Binaural features have hence been used for single sound source localization (single-SSL), e.g., [2], [3], [4], [5], [6], [7], [8], [9], [10], [11]. Matters are more complex when multiple sound sources, emitting from different directions, are simultaneously active.…”
Section: Introductionmentioning
confidence: 99%
“…The magnitude ratio (MR) is a bounded form of the ILD [8]. As the ILD can result in extreme values, the MR is used in its place.…”
Section: Proposed Methodsmentioning
confidence: 99%
“…is taken to eliminate background noise. One of the current state-of-the-art binaural sound localisation framework is called Head-Related Transfer Function (HRTF) [105], [106]. The idea is to recover how human ears receive and perceive sound, and treat the human way of localising sound as a transfer function.…”
Section: Maritime Object Localisation By Soundmentioning
confidence: 99%