2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2014
DOI: 10.1109/icassp.2014.6854447
|View full text |Cite
|
Sign up to set email alerts
|

HRTF magnitude synthesis via sparse representation of anthropometric features

Abstract: We propose a method for the synthesis of the magnitudes of Head-related Transfer Functions (HRTFs) using a sparse representation of anthropometric features. Our approach treats the HRTF synthesis problem as finding a sparse representation of the subject's anthropometric features w.r.t. the anthropometric features in the training set. The fundamental assumption is that the magnitudes of a given HRTF set can be described by the same sparse combination as the anthropometric data. Thus, we learn a sparse vector th… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

2
27
0

Year Published

2017
2017
2022
2022

Publication Types

Select...
5
2
2

Relationship

1
8

Authors

Journals

citations
Cited by 51 publications
(29 citation statements)
references
References 18 publications
2
27
0
Order By: Relevance
“…In order to design it properly, the knowledge of subjective perception should be considered. Since log-magnitude spectra preserves all of the perceptuallyrelevant information which is contained in a measured HRTF for a position [21], we design loss function derived from log-spectral distortion (LSD) which represents the difference between HRTFs on a logarithmic basis from human hearing, and has been widely used for objective evaluation of HRTFs models [25][26] [27]. LSD expresses the distortion between the estimated and the measured HRTFs.…”
Section: Universal Model Trainingmentioning
confidence: 99%
“…In order to design it properly, the knowledge of subjective perception should be considered. Since log-magnitude spectra preserves all of the perceptuallyrelevant information which is contained in a measured HRTF for a position [21], we design loss function derived from log-spectral distortion (LSD) which represents the difference between HRTFs on a logarithmic basis from human hearing, and has been widely used for objective evaluation of HRTFs models [25][26] [27]. LSD expresses the distortion between the estimated and the measured HRTFs.…”
Section: Universal Model Trainingmentioning
confidence: 99%
“…year # subjects # meas. # pairs ARI˚ [23] 2010 135 1150 138000 CIPIC [24] 2001 45 1250 56250 ITA˚˚ [25] 2016 46 2304 110592 Microsoft [9] 2015 252 400 100800 RIEC [26] 2014 105 865 90825…”
Section: Classification Performancementioning
confidence: 99%
“…They show that in a localization test, the model demonstrated qualitatively similar performance to a human subject. A related area of active research aims to personalize generic HRTFs given a user's anthropometric features [9,10,11].…”
Section: Introductionmentioning
confidence: 99%
“…Hu et al [11] conducted a nonlinear mapping from eight independent anthropometric features to the twelve weight vectors of the principal components by using back-propagation neural network to obtain individualized HRTFs. Under the assumption that the linear weighting relationship between HRTF magnitudes are the same for different subjects and anthropometric features, Bilinski et al [12] proposed the sparse efficient vectors, which can be used to represent anthropometric features of target subject with that of available subjects in the database. The individualized HRTF magnitudes could be obtained by multiplying the sparse vectors to the HRTF magnitudes in the database.…”
Section: Introductionmentioning
confidence: 99%