CMGAN: Conformer-Based Metric-GAN for Monaural Speech Enhancement

Abdulatif, Sherif; Cao, Ruizhe; Yang, Bin

doi:10.36227/techrxiv.21187846.v1

Cited by 11 publications

(1 citation statement)

References 5 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Most of the existing research work is based on a single channel network to establish the transformation between models, so its adaptability is not strong [13]. For the acquisition of multimodal information, CMGAN has been proposed by researchers [14]. In other words, an adversarial training network (CMGAN) is used to establish the sharing characteristics of multi-modal information, and then establish the correlation relationship between various types of heterogeneous information.…”

Section: Generate Adversarial Network Across Modesmentioning

confidence: 99%

Research on Multimodal Generative Adversarial Networks in the Framework of Deep Learning

Xu,

Yang,

Qiu

et al. 2024

JCEIM

View full text Add to dashboard Cite

This project aims to align facial and vocal characteristics within a closely related common space through the construction of multi-modal generative adversarial networks (GANs). The project proposes a multi-modal approach grounded in visual perception, utilizing the Graph Cut algorithm to align feature components with the image features of each corresponding local context, thereby achieving adaptability in multi-modal information. To enhance the speed and accuracy of the modeling process, a regional attention strategy is integrated. Experimental results demonstrate that the proposed algorithm enhances the accuracy of image recognition tasks.

show abstract

Section: Generate Adversarial Network Across Modesmentioning

confidence: 99%