This paper presents a robust and precise method for solving the permutation problem of frequency-domain blind source separation. It is based on two previous approaches: the direction of arrival estimation and the inter-frequency correlation. We discuss the advantages and disadvantages of the two approaches, and integrate them to exploit their respective advantages. We also present a closed form formula to estimate the directions of source signals from a separating matrix obtained by ICA. Experimental results show that our method solved permutation problems almost perfectly for a situation that two sources were mixed in a room whose reverberation time was 300 ms.
This paper addresses the determined blind source separation problem and proposes a new effective method unifying independent vector analysis (IVA) and nonnegative matrix factorization (NMF). IVA is a state-of-the-art technique that utilizes the statistical independence between sources in a mixture signal, and an efficient optimization scheme has been proposed for IVA. However, since the source model in IVA is based on a spherical multivariate distribution, IVA cannot utilize specific spectral structures such as the harmonic structures of pitched instrumental sounds. To solve this problem, we introduce NMF decomposition as the source model in IVA to capture the spectral structures. The formulation of the proposed method is derived from conventional multichannel NMF (MNMF), which reveals the relationship between MNMF and IVA. The proposed method can be optimized by the update rules of IVA and single-channel NMF. Experimental results show the efficacy of the proposed method compared with IVA and MNMF in terms of separation accuracy and convergence speed.Index Terms-Blind source separation, determined, independent vector analysis, nonnegative matrix factorization.
Abstract-This paper presents new formulations and algorithms for multichannel extensions of non-negative matrix factorization (NMF). The formulations employ Hermitian positive semidefinite matrices to represent a multichannel version of non-negative elements. Multichannel Euclidean distance and multichannel Itakura-Saito (IS) divergence are defined based on appropriate statistical models utilizing multivariate complex Gaussian distributions. To minimize this distance/divergence, efficient optimization algorithms in the form of multiplicative updates are derived by using properly designed auxiliary functions. Two methods are proposed for clustering NMF bases according to the estimated spatial property. Convolutive blind source separation (BSS) is performed by the multichannel extensions of NMF with the clustering mechanism. Experimental results show that 1) the derived multiplicative update rules exhibited good convergence behavior, and 2) BSS tasks for several music sources with two microphones and three instrumental parts were evaluated successfully.
This paper proposes a two-stage method for the blind separation of convolutively mixed sources. We employ time-frequency masking, which can be applied even to an underdetermined case where the number of sensors is insuf¿cient for the number of sources. In the ¿rst stage of the method, frequency bin-wise mixtures are classi¿ed based on Gaussian mixture model ¿tting. In the second stage, the permutation ambiguities of the bin-wise classi¿ed signals are aligned by clustering the posterior probability sequences calculated in the ¿rst stage. Experimental results for separating four speeches with three microphones under reverberant conditions show the superiority of the proposed method over existing methods based on time-difference-of-arrival estimations or signal envelope clustering.
Magnetic Weyl semimetals have novel transport phenomena related to pairs of Weyl nodes in the band structure. Although the existence of Weyl fermions is expected in various oxides, the evidence of Weyl fermions in oxide materials remains elusive. Here we show direct quantum transport evidence of Weyl fermions in an epitaxial 4d ferromagnetic oxide SrRuO3. We employ machine-learning-assisted molecular beam epitaxy to synthesize SrRuO3 films whose quality is sufficiently high to probe their intrinsic transport properties. Experimental observation of the five transport signatures of Weyl fermions—the linear positive magnetoresistance, chiral-anomaly-induced negative magnetoresistance, π phase shift in a quantum oscillation, light cyclotron mass, and high quantum mobility of about 10,000 cm2V−1s−1—combined with first-principles electronic structure calculations establishes SrRuO3 as a magnetic Weyl semimetal. We also clarify the disorder dependence of the transport of the Weyl fermions, which gives a clear guideline for accessing the topologically nontrivial transport phenomena.
Abstract. This article provides an overview of the first stereo audio source separation evaluation campaign, organized by the authors. Fifteen underdetermined stereo source separation algorithms have been applied to various audio data, including instantaneous, convolutive and real mixtures of speech or music sources. The data and the algorithms are presented and the estimated source signals are compared to reference signals using several objective performance criteria.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.