Pejman Mowlaee scite author profile

We present new results on single-channel speech separation and suggest a new separation approach to improve the speech quality of separated signals from an observed mixture. The key idea is to derive a mixture estimator based on sinusoidal parameters. The proposed estimator is aimed at finding sinusoidal parameters in the form of codevectors from vector quantization (VQ) codebooks pre-trained for speakers that, when combined, best fit the observed mixed signal. The selected codevectors are then used to reconstruct the recovered signals for the speakers in the mixture. Compared to the log-max mixture estimator used in binary masks and the Wiener filtering approach, it is observed that the proposed method achieves an acceptable perceptual speech quality with less cross-talk at different signal-to-signal ratios. Moreover, the method is independent of pitch estimates and reduces the computational complexity of the separation by replacing the short-time Fourier transform (STFT) feature vectors of high dimensionality with sinusoidal feature vectors. We report separation results for the proposed method and compare them with respect to other benchmark methods. The improvements made by applying the proposed method over other methods are confirmed by employing perceptual evaluation of speech quality (PESQ) as an objective measure and a MUSHRA listening test as a subjective evaluation for both speaker-dependent and gender-dependent scenarios.

show abstract

Harmonic Phase Estimation in Single-Channel Speech Enhancement Using Phase Decomposition and SNR Information

Mowlaee

Kulmer

2015

IEEE/ACM Trans. Audio Speech Lang. Process.

View full text Add to dashboard Cite

In conventional single-channel speech enhancement, typically the noisy spectral amplitude is modified while the noisy phase is used to reconstruct the enhanced signal. Several recent attempts have shown the effectiveness of utilizing an improved spectral phase for phase-aware speech enhancement and consequently its positive impact on the perceived speech quality. In this paper, we present a harmonic phase estimation method relying on fundamental frequency and signal-to-noise ratio (SNR) information estimated from noisy speech. The proposed method relies on SNR-based time-frequency smoothing of the unwrapped phase obtained from the decomposition of the noisy phase. To incorporate the uncertainty in the estimated phase due to unreliable voicing decision and SNR estimate, we propose a binary hypothesis test assuming speech-present and speech-absent classes representing high and low SNRs. The effectiveness of the proposed phase estimation method is evaluated for both phase-only enhancement of noisy speech and in combination with an amplitude-only enhancement scheme. We show that by enhancing the noisy phase both perceived speech quality as well as speech intelligibility are improved as predicted by the instrumental metrics and justified by subjective listening tests.

show abstract

Evaluating single-channel speech separation performance in transform-domain

Mowlaee

Sayadiyan

Sheikhzadeh

2010

J. Zhejiang Univ. - Sci. C

View full text Add to dashboard Cite

Iterative Closed-Loop Phase-Aware Single-Channel Speech Enhancement

Mowlaee

Saeidi

2013

IEEE Signal Process. Lett.

View full text Add to dashboard Cite

Single Channel Phase‐Aware Signal Processing in Speech Communication: Theory and Practice

Mowlaee¹,

Kulmer²,

Stahl³

et al. 2016

View full text Add to dashboard Cite

Modeling speech with sum-product networks: Application to bandwidth extension

Peharz

Kapeller

Mowlaee

et al. 2014

View full text Add to dashboard Cite

Sum-product networks (SPNs) are a recently proposed type of probabilistic graphical models allowing complex variable interactions while still granting efficient inference. In this paper we demonstrate the suitability of SPNs for modeling log-spectra of speech signals using the application of artificial bandwidth extension, i.e. artificially replacing the high-frequency content which is lost in telephone signals. We use SPNs as observation models in hidden Markov models (HMMs), which model the temporal evolution of log short-time spectra. Missing frequency bins are replaced by the SPNs using most-probable-explanation inference, where the state-dependent reconstructions are weighted with the HMM state posterior. According to subjective listening and objective evaluation, our system consistently and significantly improves the state of the art.

show abstract

Phase Estimation in Single-Channel Speech Enhancement: Limits-Potential

Mowlaee

Kulmer

2015

IEEE/ACM Trans. Audio Speech Lang. Process.

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Pejman Mowlaee

Advances in phase-aware signal processing in speech communication

New Results on Single-Channel Speech Separation Using Sinusoidal Modeling

Harmonic Phase Estimation in Single-Channel Speech Enhancement Using Phase Decomposition and SNR Information

Evaluating single-channel speech separation performance in transform-domain

Iterative Closed-Loop Phase-Aware Single-Channel Speech Enhancement

Single Channel Phase‐Aware Signal Processing in Speech Communication: Theory and Practice

Modeling speech with sum-product networks: Application to bandwidth extension

Phase Estimation in Single-Channel Speech Enhancement: Limits-Potential

Contact Info

Product

Resources

About