The platform will undergo maintenance on Sep 14 at about 7:45 AM EST and will be unavailable for approximately 2 hours.
2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2018
DOI: 10.1109/icassp.2018.8462639
|View full text |Cite
|
Sign up to set email alerts
|

Semi-Supervised Learning with Deep Neural Networks for Relative Transfer Function Inverse Regression

Abstract: supervised learning with deep neural networks for relative transfer function inverse regression. ABSTRACTPrior knowledge of the relative transfer function (RTF) is useful in many applications but remains little studied. In this paper, we propose a semi-supervised learning algorithm based on deep neural networks (DNNs) for RTF inverse regression, that is to generate the full-band RTF vector directly from the source-receiver pose (position and orientation). Two typical scenarios are discussed: training on labele… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
7
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(7 citation statements)
references
References 23 publications
(31 reference statements)
0
7
0
Order By: Relevance
“…The phase features are closely related to IPD, which are thus further activated by sine and cosine functions as is done in Eq. (10). The phase branch is also followed by a convolutional layer with 64 3 × 3 kernels, a BN and a ReLU activation function.…”
Section: Dp-rtf Learningmentioning
confidence: 99%
See 1 more Smart Citation
“…The phase features are closely related to IPD, which are thus further activated by sine and cosine functions as is done in Eq. (10). The phase branch is also followed by a convolutional layer with 64 3 × 3 kernels, a BN and a ReLU activation function.…”
Section: Dp-rtf Learningmentioning
confidence: 99%
“…The complementarity of the two types of difference features contributes the fusion of time and intensity difference information. A typical fused feature is relative transfer function (RTF) [9], [10] which encodes time and intensity difference in its argument and magnitude respectively.…”
Section: Introductionmentioning
confidence: 99%
“…Moreover, inter-channel intensity difference (IID) is computed as the energy ratio of the signals captured by two microphones. Relative transfer function (RTF) [10,11] encodes time and intensity information in its argument and magnitude respectively, which is the ratio between the acoustic transfer functions of the two channels. Other high-level localization features include the cross-correlation function (CCF) [3], the eigen vectors of spatial correlation matrix associated with signal subspace [12], and so forth.…”
Section: Introductionmentioning
confidence: 99%
“…Under the dual-stage localization framework, deep neural network (DNN) can be used to either extract localization features [3,4], or build the mapping from the localization features to source location [5,6]. Commonly used localization feature includes inter-channel time difference (ITD) [7], inter-channel phase difference (IPD) [8], inter-channel intensity difference (IID), relative transfer function (RTF) [9,10], etc. The source can be easily localized with aforementioned localization features under a noisefree and anechoic condition.…”
Section: Introductionmentioning
confidence: 99%