Surrogate Source Model Learning for Determined Source Separation

Scheibler, Robin; Togami, Masahito

doi:10.1109/icassp39728.2021.9414255

Cited by 13 publications

(5 citation statements)

References 27 publications

(56 reference statements)

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…( 1 ) 音源分離における空間フィルタをニューラルネットワークによって推論 (112)， (113) ( 2 ) ブラインド音源分離の音源モデルへの深層学習の導入 ( 114) - (120) ( 3 ) 位相差などの複数マイクロホンの信号から得た情報をニューラルネットワークの特徴量として利用 (103)， (121) ( 4 ) ビームフォーミングで用いる共分散行列をニューラルネットワークを介して求めた時間周波数マスクを用いて推定 (122)…”

Section: の様々な組み合わせ方が検討されている．unclassified

音源分離技術の基礎と動向

Togami

2023

IEICE Fundamentals Review

Self Cite

View full text Add to dashboard Cite

Sound source separation, which separates multiple sound sources from a mixture, has continued to evolve by incorporating beamforming techniques in wireless communication, signal processing, optimization techniques based on probabilistic models, and deep learning techniques. This paper prondes an overview of sound source separation techniques for multiple microphones based on a spatial model and a probabilistic sound source model, for a single microphone with deep learning, and for multiple microphones using a deep-learning-based sound source model and a spatial model.

show abstract

Section: の様々な組み合わせ方が検討されている．unclassified

音源分離技術の基礎と動向

Togami

2023

IEICE Fundamentals Review

Self Cite

View full text Add to dashboard Cite

show abstract

“…Let a k,i t be the mixing vector after i deflation steps computed on the tth block via (8). Due to the orthogonality of w k,i and a k,i t , the subtraction is achieved through…”

Section: E Re-estimation Of the Soi On Extraction Failure: Deflationmentioning

confidence: 99%

“…The recovered frequency components have a random order and all components corresponding to the wide-band source need to be identified in order to reconstruct it in the time-domain. To alleviate this drawback, the independent vector analysis (IVA, [7], [8]) has been proposed. It binds together the frequency components corresponding to a single source using higherorder dependencies among them.…”

Section: Introductionmentioning

confidence: 99%

Target Speech Extraction: Independent Vector Extraction Guided by Supervised Speaker Identification

Málek

Janský

Koldovsky

et al. 2022

IEEE/ACM Trans. Audio Speech Lang. Process.

View full text Add to dashboard Cite

This manuscript proposes a novel robust procedure for the extraction of a speaker of interest (SOI) from a mixture of audio sources. The estimation of the SOI is performed via independent vector extraction (IVE). Since the blind IVE cannot distinguish the target source by itself, it is guided towards the SOI via frame-wise speaker identification based on deep learning. Still, an incorrect speaker can be extracted due to guidance failings, especially when processing challenging data. To identify such cases, we propose a criterion for nonintrusively assessing the estimated speaker. It utilizes the same model as the speaker identification, so no additional training is required. When incorrect extraction is detected, we propose a "deflation" step in which the incorrect source is subtracted from the mixture and, subsequently, another attempt to extract the SOI is performed. The process is repeated until successful extraction is achieved. The proposed procedure is experimentally tested on artificial and real-world datasets containing challenging phenomena: source movements, reverberation, transient noise, or microphone failures. The method is compared with state-of-theart blind algorithms as well as with current fully supervised deep learning-based methods.

show abstract

“…Recently, it was shown that DNNs can estimate statistics to control step-sizes [61], [62] or estimate entire updates [63] for a single-channel AEC. Similarly, past work has used DNNs to predict updates for the internal statistics of multi-channel beamformers [64] and to learn source-models for multi-channel source separation [65]. These works differ from hybrid approaches in that they leverage neural networks to update or control AFs directly and thus focus on improving the performance of AFs themselves.…”

Section: Introductionmentioning

confidence: 99%

Meta-AF: Meta-Learning for Adaptive Filters

Casebeer

Bryan

Smaragdis

2023

IEEE/ACM Trans. Audio Speech Lang. Process.

View full text Add to dashboard Cite

Adaptive filtering algorithms are pervasive throughout signal processing and have had a material impact on a wide variety of domains including audio processing, telecommunications, biomedical sensing, astrophysics and cosmology, seismology, and many more. Adaptive filters typically operate via specialized online, iterative optimization methods such as least-mean squares or recursive least squares and aim to process signals in unknown or nonstationary environments. Such algorithms, however, can be slow and laborious to develop, require domain expertise to create, and necessitate mathematical insight for improvement. In this work, we seek to improve upon hand-derived adaptive filter algorithms and present a comprehensive framework for learning online, adaptive signal processing algorithms or update rules directly from data. To do so, we frame the development of adaptive filters as a metalearning problem in the context of deep learning and use a form of self-supervision to learn online iterative update rules for adaptive filters. To demonstrate our approach, we focus on audio applications and systematically develop meta-learned adaptive filters for five canonical audio problems including system identification, acoustic echo cancellation, blind equalization, multi-channel dereverberation, and beamforming. We compare our approach against common baselines and/or recent state-ofthe-art methods. We show we can learn high-performing adaptive filters that operate in real-time and, in most cases, significantly outperform each method we compare against -all using a single general-purpose configuration of our approach.

show abstract

Surrogate Source Model Learning for Determined Source Separation

Cited by 13 publications

References 27 publications

音源分離技術の基礎と動向

音源分離技術の基礎と動向

Target Speech Extraction: Independent Vector Extraction Guided by Supervised Speaker Identification

Meta-AF: Meta-Learning for Adaptive Filters

Contact Info

Product

Resources

About