Symeon Delikaris-Manias scite author profile

This work proposes a novel method for 3D direction of arrival (DOA) estimation based on the sound intensity vector estimation, via the encoding of the signals of a spherical microphone array from the space domain to the spherical harmonic domain. The sound intensity vector is estimated on detected single source zones (SSZs), where one source is dominant. A smoothed 2D histogram of these estimates reveals the DOA of the present sources and through an iterative process, accurate 3D DOA information can be obtained. The performance of the proposed method is demonstrated through simulations in various signal-to-noise ratio and reverberation conditions.

show abstract

Laser-induced acoustic point source for accurate impulse response measurements within the audible bandwidth

Bolaños

Delikaris-Manias

Pulkki

et al. 2014

View full text Add to dashboard Cite

Laser induced air breakdown is proposed as a sound source for accurate impulse response measurements. Within the audible bandwidth, the source is repeatable, broadband, and omnidirectional. The applicability of the source was evaluated by measuring the impulse response of a room. The proposed source provides a more accurate temporal and spatial representation of room reflections than conventional loudspeakers due to its omnidirectionality, negligible size and short pulse duration.

show abstract

3D DOA estimation of multiple sound sources based on spatially constrained beamforming driven by intensity vectors

Pavlidi¹,

Delikaris-Manias²,

Pulkki³

et al. 2016

View full text Add to dashboard Cite

Sound source localization in three dimensions with microphone arrays is an active field of research, applicable in sound enhancement, source separation, and sound field analysis. In this contribution we propose a method for three dimensional multiple sound source localization in reverberant environments. We employ a spatially constrained steered response beamformer on a spherical sector centered at the direction of arrival (DOA) estimates of the intensity vector. Experiments are performed in both simulated and real acoustical environments with a spherical microphone array for multiple sound sources under different reverberation and signal-to-noise ratio (SNR) conditions. The performance of the proposed method is compared with our previously proposed work and a subspace method in the spherical harmonic domain. The results demonstrate a significant improvement in terms of localization accuracy.Index Termsdirection of arrival, 3D, multiple sources, spherical microphone array processing, sound intensity

show abstract

Cross Pattern Coherence Algorithm for Spatial Filtering Applications Utilizing Microphone Arrays

Delikaris-Manias

Pulkki

2013

IEEE Trans. Audio Speech Lang. Process.

View full text Add to dashboard Cite

Signal-Dependent Spatial Filtering Based on Weighted-Orthogonal Beamformers in the Spherical Harmonic Domain

Delikaris-Manias

Vilkamo

Pulkki

2016

IEEE/ACM Trans. Audio Speech Lang. Process.

View full text Add to dashboard Cite

Spatial filtering with microphone arrays is a technique that can be utilized to obtain the signal of a target sound source from a specific direction. Typical approaches in the field of audio underperform in practical environments with multiple sound sources and diffuse sound. In this contribution we propose a post-filtering technique to suppress the effect of interferers and diffuse sound. The proposed technique utilizes the cross-spectral estimates of the output of two beamformers to formulate a timefrequency soft masker. The beamformers' outputs are used only for parameter estimation and not for generating an audio signal. Two sets of beamformer weights, a constant and an adaptive, are applied to the microphone array signals for the parameter estimation. The weights of the constant beamformer are designed such that they provide a spatially narrow beam pattern that is time and frequency invariant, having a unity gain towards the direction of interest. The weights of the adaptive beamformer are formulated using linearly constrained optimization with the constraint of weighted orthogonality with respect to the constant beamformer weights, as well as the unity gain towards the look direction. The orthogonality constraint provides diffuse sound suppression while the unity gain distortionless response. The cross spectrum of these two beamformers provides the target energy at a given look direction for the post filter. The study focuses on compact microphone arrays with which the typical beamforming techniques feature a trade-off between noise amplification and spatial selectivity, especially in the low frequency region. The proposed method is evaluated with instrumental measures and listening tests under different reverberation times, in dual and multi-talker scenarios. The evaluation shows that the proposed method provides a better performance when compared with a previous state-of-the-art spatial filter based on cross-pattern coherence, a linearly constrained beamformer and a Wiener postfilter.

show abstract

Beamforming with a volumetric array of massless laser spark sources—Application in reflection tracking

Eskelinen

Hæggström

Delikaris-Manias

et al. 2015

View full text Add to dashboard Cite

A volumetric array of laser-induced air breakdown sparks is used to produce a directional and steerable acoustic source. The laser breakdown array element is broadband, point-like, and massless. It produces an impulse-like waveform in midair, thus generating accurate spatio-temporal information for acoustic beamforming. A laser-spark scanning setup and the concept of a massless steerable source are presented and evaluated with a cubic array by using an off-line far field delay-and-sum beamforming method. This virtual acoustic array with minimal source influence can, for instance, produce narrow transmission beams to obtain localized and directional impulse response information by reflection tracking.

show abstract

Applications of Spatially Localized Active-Intensity Vectors for Sound-Field visualization

McCormack¹,

Delikaris-Manias²,

Politis³

et al. 2019

J. Audio Eng. Soc.

View full text Add to dashboard Cite

The purpose of this article is to detail and evaluate three alternative approaches to soundfield visualization, which all employ the use of spatially localized active-intensity (SLAI) vectors. These SLAI vectors are of particular interest, as they allow direction-of-arrival (DoA) estimates to be extracted in multiple spatially localized sectors, such that a sound source present in one sector has reduced influence on the DoA estimate made in another sector. These DoA estimates may be used to visualize the sound-field by either: (I) directly depicting the estimates as icons, with their relative size dictated by the corresponding energy of each sector; (II) generating traditional activity maps via histogram analysis of the DoA estimates; or (III) by using the DoA estimates to reassign energy and subsequently sharpen traditional beamformer-based activity maps. Since the SLAI-based DoA estimates are continuous, these approaches are inherently computationally efficient, as they forego the need for dense scanning grids to attain high-resolution imaging. Simulation results also show that these SLAI-based alternatives outperform traditional active-intensity and beamformer-based approaches, for the majority of cases.

show abstract

DOA estimation with histogram analysis of spatially constrained active intensity vectors

Delikaris-Manias¹,

Pavlidi²,

Mouchtaris³

et al. 2017

View full text Add to dashboard Cite

The active intensity vector (AIV) is a common descriptor of the sound field. In microphone array processing, AIV is commonly approximated with beamforming operations and utilized as a direction of arrival (DOA) estimator. However, in its original form, it provides inaccurate estimates in sound field conditions where coherent sound sources are simultaneously active. In this work we utilize a higher order intensitybased DOA estimator on spatially-constrained regions (SCR) to overcome such limitations. We then apply 1-dimensional (1D) histogram processing on the noisy estimates for multiple DOA estimation. The performance of the estimator is shown with a 7-channel microphone array, fitted on a rigid mobile-like device, in reverberant conditions and under different signal-to-noise ratios.

show abstract

12 3

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.