2020
DOI: 10.1007/s10772-020-09674-2
|View full text |Cite
|
Sign up to set email alerts
|

Fundamentals, present and future perspectives of speech enhancement

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

0
14
0
1

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 57 publications
(25 citation statements)
references
References 141 publications
0
14
0
1
Order By: Relevance
“…The most cited methods are the Cepstral Mean Subtraction (CMS) [18], the Power-Normalized Cepstral Coefficients (PNCCs) [19], and the Cepstral Mean Normalization (CMN) [20] which is a popular feature compensation method dealing with convolutional noise. In this same context, the majority of the published works demonstrated that the wavelet-based feature extraction [21][22][23][24] has better performance improvement than traditional Cepstral features in noisy environments. The already presented wavelet-based techniques rely on the multiresolution PWP properties and combine the extracted MFCC features from various frequency sub bands to a unique feature vector.…”
Section: Introductionmentioning
confidence: 97%
“…The most cited methods are the Cepstral Mean Subtraction (CMS) [18], the Power-Normalized Cepstral Coefficients (PNCCs) [19], and the Cepstral Mean Normalization (CMN) [20] which is a popular feature compensation method dealing with convolutional noise. In this same context, the majority of the published works demonstrated that the wavelet-based feature extraction [21][22][23][24] has better performance improvement than traditional Cepstral features in noisy environments. The already presented wavelet-based techniques rely on the multiresolution PWP properties and combine the extracted MFCC features from various frequency sub bands to a unique feature vector.…”
Section: Introductionmentioning
confidence: 97%
“…Undoubtedly, the popularity of DL in image processing, computer vision, and natural language processing has led to significant impact in fields closely related to spatial audio, including speech enhancement or music information retrieval [15,16]. While ML algorithms have already positioned themselves at the top of the state of the art within the aforementioned fields, their use in immersive spatial audio is only emerging, as it will be illustrated throughout this review.…”
Section: Introductionmentioning
confidence: 99%
“…Nevertheless, the vast majority of these apps are not resilient when dealing with interference. As a result, speech enhancement (SE) [ 1 , 2 , 3 , 4 ], a technique that tries to enhance the intelligibility and quality of the original speech signals, has seen widespread use in the context of these applications. Over the last few years, deep learning techniques have seen an increased amount of use when it comes to the construction of SE systems.…”
Section: Introductionmentioning
confidence: 99%