The platform will undergo maintenance on Sep 14 at about 7:45 AM EST and will be unavailable for approximately 2 hours.
2018
DOI: 10.1109/taslp.2018.2835723
|View full text |Cite
|
Sign up to set email alerts
|

PSD Estimation and Source Separation in a Noisy Reverberant Environment Using a Spherical Microphone Array

Abstract: In this paper, we propose an efficient technique for estimating individual power spectral density (PSD) components, i.e., PSD of each desired sound source as well as of noise and reverberation, in a multi-source reverberant sound scene with coherent background noise. We formulate the problem in the spherical harmonics domain to take the advantage of the inherent orthogonality of the spherical harmonics basis functions and extract the PSD components from the crosscorrelation between the different sound field mo… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
12
0

Year Published

2018
2018
2021
2021

Publication Types

Select...
3
2
1

Relationship

3
3

Authors

Journals

citations
Cited by 21 publications
(12 citation statements)
references
References 34 publications
0
12
0
Order By: Relevance
“…The resulting ERPs show the expected P3 response for target tones only (Polich, 2007). Advancing from this simple validation task to everyday life settings, PSD information could be used to differentiate between different sound sources (e.g., Fahim, Samarasinghe, & Abhayapala, 2018). For example, in a two speaker scenario, PSD can be used to identify which speaker (low vs. high voice) is currently talking.…”
Section: Discussionmentioning
confidence: 98%
“…The resulting ERPs show the expected P3 response for target tones only (Polich, 2007). Advancing from this simple validation task to everyday life settings, PSD information could be used to differentiate between different sound sources (e.g., Fahim, Samarasinghe, & Abhayapala, 2018). For example, in a two speaker scenario, PSD can be used to identify which speaker (low vs. high voice) is currently talking.…”
Section: Discussionmentioning
confidence: 98%
“…Figure 1b depicts the localization accuracy of all estimators with a function of the reverberation levels in the range T60 = {0.2, 0.3, 0.4, 0.5, 0.6} s. All the estimators are still useable even when the T60 = 0.6 s. A stronger reverberation time implies the direct-path is contaminated by the acoustic reflections, thus all the estimators' localization accuracy degrades. It is observed 3 https://www.audiolabs-erlangen.de/fau/professor/habets/software/rirgenerator that the performance of the 'Gradient descent' estimator deteriorates more severally than the '2-D search' and 'Decoupled' estimators in more reverberant environments. This may be attributed to the sensitivity of gradient descent search in (13) to the acoustic reflections as compared with that are matching between the estimated and the theoretical RHC.…”
Section: Methodsmentioning
confidence: 99%
“…In the past decades, source direction-of-arrival (DOA) estimation [1,2] has been extensively investigated in the research community since it is an essential component in many spatial signal processing techniques and applications including source dereverberation, speech separation [3], automatic speech recognition [4] and automated camera steering [5].…”
Section: Introductionmentioning
confidence: 99%
“…Using (13) in (12) and then by comparing it with (3), we obtain an analytical expression for α nm in a reverberant room as [56] α…”
Section: B Rtf In the Spatial Domainmentioning
confidence: 99%