2021
DOI: 10.48550/arxiv.2109.03465
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

A Survey of Sound Source Localization with Deep Learning Methods

Pierre-Amaury Grumiaux,
Srđan Kitić,
Laurent Girin
et al.

Abstract: This article is a survey on deep learning methods for single and multiple sound source localization. We are particularly interested in sound source localization in indoor/domestic environment, where reverberation and diffuse noise are present. We provide an exhaustive topography of the neural-based localization literature in this context, organized according to several aspects: the neural network architecture, the type of input features, the output strategy (classification or regression), the types of data use… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
14
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
4
4

Relationship

0
8

Authors

Journals

citations
Cited by 12 publications
(14 citation statements)
references
References 187 publications
(461 reference statements)
0
14
0
Order By: Relevance
“…However, these approaches are sensitive to room acoustics and noisy backgrounds and may be unreliable when multiple sources are present. More recently, machine learning has been used for direction of arrival estimation with some success [12,13,19]. Although these methods improve upon the traditional approaches, the lack of visual information limits the efficacy of these systems in real-word settings.…”
Section: Related Workmentioning
confidence: 99%
“…However, these approaches are sensitive to room acoustics and noisy backgrounds and may be unreliable when multiple sources are present. More recently, machine learning has been used for direction of arrival estimation with some success [12,13,19]. Although these methods improve upon the traditional approaches, the lack of visual information limits the efficacy of these systems in real-word settings.…”
Section: Related Workmentioning
confidence: 99%
“…Among the most widely used methods are narrowband realizations of steered response power (SRP) [29,30], where the direction is determined by maximizing the output power of a beamformer, as well as subspace decomposition-based methods like MUSIC [31]. Alternatively, a deep learning approach can also be used [32].…”
Section: Doa-guided Source Separationmentioning
confidence: 99%
“…Estimating the direction of arrival (DOA) of a sound while also classifying the type of event is an important type of frontend processing for a wide variety of monitoring and robotic applications. Due to its wide applicability, this task, often referred to in the literature as sound event localization and detection (SELD), has recently seen a surge of interest [1][2][3][4]. However, SELD remains challenging because sound sources can move, cease to produce sound, have their positions obscured by room reverberation, and are often mixed with interfering sounds.…”
Section: Introductionmentioning
confidence: 99%