Adversarial Domain Generalized Transformer for Cross-Corpus Speech Emotion Recognition

Gao, Yuan; Wang, Longbiao; Liu, Jiaxing; Dang, Jianwu; Okada, Shogo

doi:10.1109/taffc.2023.3290795

Cited by 2 publications

(1 citation statement)

References 59 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Speech processing is focused on enabling machines to understand and interpret human speech with the ultimate objective of creating systems that facilitate natural and intuitive interaction between humans and machines (Hickok and Poeppel 2007). AEs have found numerous applications in speech processing, especially in speech denoising (Bhangale and Kothandaraman 2022;Tanveer et al 2023), speech recognition (Kumar et al 2022;Sayed et al 2023), speech representation (Alex and Mary 2023;Seki et al 2023), speech compression (Li et al 2021;Srikotr 2022), feature representation (Shixin et al 2022;Tian et al 2022), and speech emotion recognition (Dutt and Gader 2023;Gao et al 2023).…”

Section: Speech Processingmentioning

confidence: 99%

Autoencoders and their applications in machine learning: a survey

Berahmand,

Daneshfar,

Salehi

et al. 2024

Artif Intell Rev

View full text Add to dashboard Cite

Autoencoders have become a hot researched topic in unsupervised learning due to their ability to learn data features and act as a dimensionality reduction method. With rapid evolution of autoencoder methods, there has yet to be a complete study that provides a full autoencoders roadmap for both stimulating technical improvements and orienting research newbies to autoencoders. In this paper, we present a comprehensive survey of autoencoders, starting with an explanation of the principle of conventional autoencoder and their primary development process. We then provide a taxonomy of autoencoders based on their structures and principles and thoroughly analyze and discuss the related models. Furthermore, we review the applications of autoencoders in various fields, including machine vision, natural language processing, complex network, recommender system, speech process, anomaly detection, and others. Lastly, we summarize the limitations of current autoencoder algorithms and discuss the future directions of the field.

show abstract