2020
DOI: 10.3390/s20133768
|View full text |Cite
|
Sign up to set email alerts
|

Configuration-Invariant Sound Localization Technique Using Azimuth-Frequency Representation and Convolutional Neural Networks

Abstract: Deep neural networks (DNNs) have achieved significant advancements in speech processing, and numerous types of DNN architectures have been proposed in the field of sound localization. When a DNN model is deployed for sound localization, a fixed input size is required. This is generally determined by the number of microphones, the fast Fourier transform size, and the frame size. if the numbers or configurations of the microphones change, the DNN model should be retrained because the size of the input fe… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(1 citation statement)
references
References 17 publications
(27 reference statements)
0
1
0
Order By: Relevance
“…To improve SSL performance in noisy and reverberant environments, the steering vector phase difference enhancement is realized by the deep neural network [ 17 ]. The SSL system invariant to the configuration was developed by Chun et al based on the azimuth-frequency representation and convolutional neural networks [ 18 ].…”
Section: Introductionmentioning
confidence: 99%
“…To improve SSL performance in noisy and reverberant environments, the steering vector phase difference enhancement is realized by the deep neural network [ 17 ]. The SSL system invariant to the configuration was developed by Chun et al based on the azimuth-frequency representation and convolutional neural networks [ 18 ].…”
Section: Introductionmentioning
confidence: 99%