Guoliang Cheng scite author profile

Guoliang Cheng

4Publications

4Citation Statements Received

81Citation Statements Given

How they've been cited

How they cite others

123

Affiliations

Nanjing University, Institute of Acoustics

Publications

Order By: Most citations

Semi-Blind Source Separation for Nonlinear Acoustic Echo Cancellation

Cheng

Liao

Chen

et al. 2021

IEEE Signal Process. Lett.

View full text Add to dashboard Cite

The mismatch between the numerical and actual nonlinear models is a challenge to nonlinear acoustic echo cancellation (NAEC) when nonlinear adaptive filter is utilized. To alleviate this problem, we propose an effective method based on semi-blind source separation (SBSS), which uses a basis-generic expansion of the memoryless nonlinearity and then merges the unknown nonlinear expansion coefficients into the echo path. By regarding all the basis functions of the far-end input signal as the known equivalent reference signals, an SBSS updating algorithm is derived based on the constrained scaled natural gradient strategy. Unlike the commonly utilized adaptive algorithm, the proposed SBSS is based on the independence between the nearend signal and the reference signals, and is less sensitive to the mismatch of nonlinearity between the numerical and actual models. The experimental results with both simulated and real captured data validate the efficacy of the proposed method in NAEC.

show abstract

Multichannel Variational Autoencoder-Based Speech Separation in Designated Speaker Order

Liao¹,

Cheng²,

Ruan³

et al. 2022

Symmetry

View full text Add to dashboard Cite

The multichannel variational autoencoder (MVAE) integrates the rule-based update of a separation matrix and the deep generative model and proves to be a competitive speech separation method. However, the output (global) permutation ambiguity still exists and turns out to be a fundamental problem in applications. In this paper, we address this problem by employing two dedicated encoders. One encodes the speaker identity for the guidance of the output sorting, and the other encodes the linguistic information for the reconstruction of the source signals. The instance normalization (IN) and the adaptive instance normalization (adaIN) are applied to the networks to disentangle the speaker representations from the content representations. The separated sources are arranged in designated order by a symmetric permutation alignment scheme. In the experiments, we test the proposed method in different gender combinations and various reverberant conditions and generalize it to unseen speakers. The results validate its reliable sorting accuracy and good separation performance. The proposed method outperforms the other baseline methods and maintains stable performance, achieving over 20 dB SIR improvement even in high reverberant environments.

show abstract

Efficient independent vector extraction of dominant source (L)

Liao

Cheng

et al. 2022

View full text Add to dashboard Cite

The complete decomposition performed by blind source separation is computationally demanding and superfluous when only the speech of one specific target speaker is desired. This letter proposes a computationally efficient blind source extraction method based on the fast fixed-point optimization algorithm under the mild assumption that the average power of the source of interest outweighs the interfering sources. Moreover, a one-unit scaling operation is designed to solve the scaling ambiguity for source extraction. Experiments validate the efficacy of the proposed method in extracting the dominant source.

show abstract

Semi-blind source separation using convolutive transfer function for nonlinear acoustic echo cancellation

Cheng

Liao

Chen

et al. 2023

View full text Add to dashboard Cite

The recently proposed semi-blind source separation (SBSS) method for nonlinear acoustic echo cancellation (NAEC) outperforms adaptive NAEC in attenuating the nonlinear acoustic echo. However, the multiplicative transfer function (MTF) approximation makes it unsuitable for real-time applications, especially in highly reverberant environments, and the natural gradient makes it hard to balance well between fast convergence speed and stability. In this paper, two more effective SBSS methods based on auxiliary-function-based independent vector analysis (AuxIVA) and independent low-rank matrix analysis (ILRMA) are proposed. The convolutive transfer function approximation is used instead of the MTF so that a long impulse response can be modeled with a short latency. The optimization schemes used in AuxIVA and ILRMA are carefully regularized according to the constrained demixing matrix of NAEC. The experimental results validate significantly better echo cancellation performances of the proposed methods.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Guoliang Cheng

Semi-Blind Source Separation for Nonlinear Acoustic Echo Cancellation

Multichannel Variational Autoencoder-Based Speech Separation in Designated Speaker Order

Efficient independent vector extraction of dominant source (L)

Semi-blind source separation using convolutive transfer function for nonlinear acoustic echo cancellation

Contact Info

Product

Resources

About