ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2021
DOI: 10.1109/icassp39728.2021.9414620
|View full text |Cite
|
Sign up to set email alerts
|

A Novel NMF-HMM Speech Enhancement Algorithm Based on Poisson Mixture Model

Abstract: In this paper, we propose a novel non-negative matrix factorization (NMF) and hidden Markov model (NMF-HMM) based speech enhancement algorithm, which employs a Poisson mixture model (PMM). Compared to the previously proposed NMF-HMM method, the new algorithm, termed PMM-NMF-HMM, uses the Poisson mixture distribution for the state conditional likelihood function for a HMM rather than the single Poisson distribution. This means that there are the more basis matrices that can be used to model the speech and noise… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
3

Relationship

2
5

Authors

Journals

citations
Cited by 8 publications
(5 citation statements)
references
References 29 publications
0
5
0
Order By: Relevance
“…Unlike SC, mixture models (such as PMMs) assume a single latent per data point for data generation, i.e., data is modeled in terms of a single compositional feature rather than a combination of multiple features. PMMs assume the observable data to be subject to Poisson distributed noise, and they have been applied in the context of image and audio signal processing applications before (e.g., [81][82][83]). The here used PMM-based denoising approach is described in more detail in S1 Text.…”
Section: Algorithmsmentioning
confidence: 99%
“…Unlike SC, mixture models (such as PMMs) assume a single latent per data point for data generation, i.e., data is modeled in terms of a single compositional feature rather than a combination of multiple features. PMMs assume the observable data to be subject to Poisson distributed noise, and they have been applied in the context of image and audio signal processing applications before (e.g., [81][82][83]). The here used PMM-based denoising approach is described in more detail in S1 Text.…”
Section: Algorithmsmentioning
confidence: 99%
“…In this paper, we propose a novel NMF-HMM speech enhancement method based on the Kullback-Leibler (KL) divergence, expanding on our preliminary work [47]. Our preliminary work has briefly verified the effectiveness of an NMF-HMM for speech enhancement [47,48], but the effect of the parameters for the model was not considered. This is very important to optimize the algorithm performance.…”
Section: Open Accessmentioning
confidence: 99%
“…To summarize, the proposed β-PVAE includes a training and an enhancement stage for the SE application, which is similar to PVAE [26]. In the training stage, C-VAE and N-VAE are separately pre-trained by self-supervision using (4) and (5). After that, we apply (8) to train NS-VAE.…”
Section: β-Vae-based Speech Enhancementmentioning
confidence: 99%
“…During the past decades, many single-channel SE algorithms have been developed, including signal subspace methods [4], non-negative matrix factorization methods [5], [6], and codebook-based methods [7]. In recent years, deep neural networks (DNN) have shown great potential for SE [2], [8]- [14] because DNNs can use a non-linear process to model complex high-dimensional signals, which is more reasonable in practical applications [15].…”
Section: Introductionmentioning
confidence: 99%