2013 IEEE/RSJ International Conference on Intelligent Robots and Systems 2013
DOI: 10.1109/iros.2013.6696771
|View full text |Cite
|
Sign up to set email alerts
|

A learning-based approach to robust binaural sound localization

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
39
0

Year Published

2015
2015
2022
2022

Publication Types

Select...
5
1
1
1

Relationship

1
7

Authors

Journals

citations
Cited by 32 publications
(39 citation statements)
references
References 15 publications
0
39
0
Order By: Relevance
“…One solution could consist in learning the head effect in realistic conditions. Such an idea was successfully assessed in [29] through a dedicated neural network able to generalize learning to new acoustic conditions. One can also cite [30], or [31], where the iCub humanoid robot's head was endowed with two pinnae.…”
Section: Horizontal Localizationmentioning
confidence: 99%
“…One solution could consist in learning the head effect in realistic conditions. Such an idea was successfully assessed in [29] through a dedicated neural network able to generalize learning to new acoustic conditions. One can also cite [30], or [31], where the iCub humanoid robot's head was endowed with two pinnae.…”
Section: Horizontal Localizationmentioning
confidence: 99%
“…Finally, in what respect to the experimental setup, most works use simulated data either for training or for training and testing [44][45][46][47][48][49][50][51][52][54][55][56][57][58][59], usually by convolving clean (anechoic) speech with impulse responses (room, head related, or DOA related (azimuth, elevation)). Only some of them actually face real recordings [44,45,53,55,56], which in our opinion is a must to be able to assess the actual impact of the proposals in real conditions. So, in this paper we describe, for the first time in the literature to the best of our knowledge, a CNN architecture in which we directly exploit the raw acoustic signal to be provided to the neural network, with the objective of directly estimating the three dimensional position of an acoustic source in a given environment.…”
Section: State Of the Artmentioning
confidence: 99%
“…Alternatively, multidimensional features can be extracted as sets of frequency-dependent components, which require a signal frequency-dependent decomposition. This can be done through FFTs [2,[34][35][36] or filterbanks, notably of gammatone filters [4,5,37,38].…”
Section: Speaker Localizationmentioning
confidence: 99%
“…It undertakes tasks like sound processing and de-noising (speech, music, and environmental sounds), sound source localization, separation and identification. A growing field of CASA research is binaural computer audition [2][3][4][5]. Relying on signals acquired inside a human-like head and ears, it attempts to create a computational reproduction of the human auditory system stages.…”
Section: Introductionmentioning
confidence: 99%