This article details an investigation into the perceptual effects of different rendering strategies when synthesizing loudspeaker array room impulse responses (RIRs) using microphone array RIRs in a parametric fashion. The aim of this rendering task is to faithfully reproduce the spatial characteristics of a captured space, encoded within the input microphone array RIR (or the spherical harmonic RIR derived from it), over a loudspeaker array. For this study, a higherorder formulation of the Spatial Impulse Response Rendering (SIRR) method is introduced and subsequently employed to investigate the perceptual effects of the following rendering configurations: the spherical harmonic input order, frequency resolution, and utilizing dedicated diffuse stream rendering. Formal listening tests were conducted using a 64-channel loudspeaker array in an anechoic chamber, where simulated reference scenarios were compared against the outputs of different methods and rendering configurations. The test results indicate that dedicated diffuse stream rendering and higher analysis orders both yield noticeable perceptual improvements, particularly when employing problematic transient stimuli as input. Additionally, it was found that the frequency resolution employed during rendering has only a minor influence over the perceived accuracy of the reproduction in comparison to the other two tested attributes.
In this article, the application of spatial covariance matching is investigated for the task of producing spatially enhanced binaural signals using head-worn microphone arrays. A two-step processing paradigm is followed, whereby an initial estimate of the binaural signals is first produced using one of three suggested binaural rendering approaches. The proposed spatial covariance matching enhancement is then applied to these estimated binaural signals with the intention of producing refined binaural signals that more closely exhibit the correct spatial cues as dictated by the employed sound-field model and associated spatial parameters. It is demonstrated, through objective and subjective evaluations, that the proposed enhancements in the majority of cases produce binaural signals that more closely resemble the spatial characteristics of simulated reference signals when the enhancement is applied to and compared against the three suggested starting binaural rendering approaches. Furthermore, it is shown that the enhancement produces spatially similar output binaural signals when using these three different approaches, thus indicating that the enhancement is general in nature and could, therefore, be employed to enhance the outputs of other similar binaural rendering algorithms.
The purpose of this article is to detail and evaluate three alternative approaches to soundfield visualization, which all employ the use of spatially localized active-intensity (SLAI) vectors. These SLAI vectors are of particular interest, as they allow direction-of-arrival (DoA) estimates to be extracted in multiple spatially localized sectors, such that a sound source present in one sector has reduced influence on the DoA estimate made in another sector. These DoA estimates may be used to visualize the sound-field by either: (I) directly depicting the estimates as icons, with their relative size dictated by the corresponding energy of each sector; (II) generating traditional activity maps via histogram analysis of the DoA estimates; or (III) by using the DoA estimates to reassign energy and subsequently sharpen traditional beamformer-based activity maps. Since the SLAI-based DoA estimates are continuous, these approaches are inherently computationally efficient, as they forego the need for dense scanning grids to attain high-resolution imaging. Simulation results also show that these SLAI-based alternatives outperform traditional active-intensity and beamformer-based approaches, for the majority of cases.
A method for computing and sharpening angular spectra, derived from low-order ambisonic signals, is presented in this paper, which is intended for high-resolution directional sound-field visualisation. The method relies on a re-assignment principle, whereby the directional energy for each grid point is assigned to a new direction, which corresponds to a direction-of-arrival (DoA) estimate within a spatially-localised region, centred around the respective grid point. This leads to the concentration of energy around the true sources, and hence, to sharper angular spectra than that of steered response power (SRP) beamformers of maximum directivity, with the same order of ambisonic input. It is demonstrated that the proposed method, when using loworder input, can achieve similar results to the SRP approach of much higher order.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.