In today’s scientific epoch, speech is an important means of communication. Speech enhancement is necessary for increasing the quality of speech. However, the presence of noise signals can corrupt speech signals. Thereby, this work intends to propose a new speech enhancement framework that includes (a) training phase and (b) testing phase. The input signal is first given to STFT-based noise estimate and NMF-based spectra estimate during the training phase in order to compute the noise spectra and signal spectra, respectively. The obtained signal spectra and noise spectra are then Wiener-filtered, then empirical mean decomposition (EMD) is used. Because the tuning factor of Wiener filters is so important, it should be computed for each signal by coaching in a fuzzy wavelet neural network (FW-NN). Subsequently, a bark frequency is computed from the denoised signal, which is then subjected to FW-NN to identify the suitable tuning factor for all input signals in the Wiener filter. For optimal tuning of [Formula: see text], this work deploys the fitness-oriented elephant herding optimization (FO-EHO) algorithm. Additionally, an adaptive Wiener filter is used to supply EMD with the ideal tuning factor from FW-NN, producing an improved speech signal. At last, the presented approach’s supremacy is proved with varied metrics.
Speech signal enhancement is a subject of study in which a large number of researchers are working to improve the quality and perceptibility of speech signals. In the existing Kalman Filter method, the short-time magnitude or power spectrum due to random variations of noise was a serious problem and the signal-to-noise ratio was very low. This issue severely reduced the perceived qualityand intelligibility of enhanced speech. Thus, this paper intent to develop an improved speech enhancement model and it includes“training phase and testing phase”. In the training phase, the input noise corrupted signal is initially fed as input to both STFT-based noise estimation and NMF-based spectrum estimation forestimating the noise spectrum and signal spectrum, respectively. The obtained noise spectrum and the signal spectrum are fed as input to the Wiener filter and these filtered signals are subjected to Empirical Mean Decomposition (EMD).Since, tuning factor η plays a key role in Wiener filter, it has to be determined for each signal and from the denoised signal the bark frequency is evaluated. The computed bark frequency is fed as input to the learning algorithm referred as Fuzzy Wavelet Neural Network (FW-NN)for detecting the suited tuning factor η for the entire input signal in Wiener filter.An Adaptive Randomized Grey Wolf Optimization (AR-GWO) is proposed for proper tuning of the tuning factor η referred as tuned tuning factor ( η tuned ). The proposed AR-GWO is the improved version of standard Grey wolf optimization (GWO). In the testing phase, the training is accomplished initially and from which the tuning factor is gathered for each of the relevant input signal. Then, the properly tuned tuning factor ( η tuned ) from FW-NN is fed as input to EMD via adaptive wiener filter for decomposing the spectral signal and the output of EMD is denoised enhanced speech signal. At last, the performance of the adopted approach is evaluated to the existing approaches in terms of various metrics. In particular, the computation time of the adopted AR-GWO model is 34.07%, 43.57%, 28.86%, 38.88%, and 16.03% better than the existing GA, ABC, PSO, FF, and GWO approaches respectively.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.