Longxi Chen scite author profile

Longxi Chen

5Publications

17Citation Statements Received

464Citation Statements Given

How they've been cited

How they cite others

389

464

Affiliations

Shandong Youth University of Political Science

Publications

Order By: Most citations

Multi-Modal Fusion Emotion Recognition Method of Speech Expression Based on Deep Learning

Liu

Wang

et al. 2021

Front. Neurorobot.

View full text Add to dashboard Cite

The redundant information, noise data generated in the process of single-modal feature extraction, and traditional learning algorithms are difficult to obtain ideal recognition performance. A multi-modal fusion emotion recognition method for speech expressions based on deep learning is proposed. Firstly, the corresponding feature extraction methods are set up for different single modalities. Among them, the voice uses the convolutional neural network-long and short term memory (CNN-LSTM) network, and the facial expression in the video uses the Inception-Res Net-v2 network to extract the feature data. Then, long and short term memory (LSTM) is used to capture the correlation between different modalities and within the modalities. After the feature selection process of the chi-square test, the single modalities are spliced to obtain a unified fusion feature. Finally, the fusion data features output by LSTM are used as the input of the classifier LIBSVM to realize the final emotion recognition. The experimental results show that the recognition accuracy of the proposed method on the MOSI and MELD datasets are 87.56 and 90.06%, respectively, which are better than other comparison methods. It has laid a certain theoretical foundation for the application of multimodal fusion in emotion recognition.

show abstract

Recognition of Audio Depression Based on Convolutional Neural Network and Generative Antagonism Network Model

et al. 2020

View full text Add to dashboard Cite

This paper proposes an audio depression recognition method based on convolution neural network and generative antagonism network model. First of all, preprocess the data set, remove the longterm mute segments in the data set, and splice the rest into a new audio file. Then, the features of speech signal, such as Mel-scale Frequency Cepstral Coefficients (MFCCs), short-term energy and spectral entropy, are extracted based on audio difference normalization algorithm. The extracted matrix vector feature data, which represents the unique attributes of the subjects' own voice, is the data base for model training. Then, based on the combination of CNN and GAN, DR AudioNet is used to build the model of depression recognition research. With the help of DR AudioNet, the former model is optimized and the recognition classification is completed through the normalization characteristics of the two adjacent segments before and after the current audio segment. The experimental results on AViD-Corpus and DAIC-WOZ datasets show that the proposed method effectively reduces the depression recognition error compared with other existing methods, and the RMSE and MAE values obtained on the two datasets are better than the comparison algorithm by more than 5%. INDEX TERMS Recognition of audio depression; generative antagonism network; convolutional neural network; Mel-scale Frequency Cepstral Coefficients; entropy feature of spectrogram

show abstract

Speech Expression Multimodal Emotion Recognition Based on Deep Belief Network

et al. 2021

View full text Add to dashboard Cite

Novel multi‐scale deep residual attention network for facial expression recognition

Dong

Wang

et al. 2020

J. eng.

View full text Add to dashboard Cite

A multi-modal emotion fusion classification method combined expression and speech based on attention mechanism

Liu

Chen

Wang

et al. 2021

Multimed Tools Appl

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Longxi Chen

Multi-Modal Fusion Emotion Recognition Method of Speech Expression Based on Deep Learning

Recognition of Audio Depression Based on Convolutional Neural Network and Generative Antagonism Network Model

Speech Expression Multimodal Emotion Recognition Based on Deep Belief Network

Novel multi‐scale deep residual attention network for facial expression recognition

A multi-modal emotion fusion classification method combined expression and speech based on attention mechanism

Contact Info

Product

Resources

About