2020
DOI: 10.17485/ijst/v13i31.1118
|View full text |Cite
|
Sign up to set email alerts
|

Real-time video based emotion recognition using convolutional neural network and transfer learning

Abstract: Background/Objectives: The deep learning approaches have paved their way to construct various artificial intelligence products and the proposed system uses a convolutional neural network for detecting real-time emotions of mankind. The objective of the study is to develop a real-time application for emotion recognition using convolutional neural network and transfer learning methods. Methods/Statistical analysis: The proposed system considers happy, normal and surprised categories of emotions. The system consi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 7 publications
(5 citation statements)
references
References 17 publications
0
5
0
Order By: Relevance
“…When comparing samples, the source domain sample γ s and the target domain sample γ t can be mapped to the corresponding subspace respectively, and then, the subspace can be aligned by the transformation matrix M ∗ , and the similarity function can be defined [ 13 , 14 ]: where A = X S X S T X t X t T represents the importance of each part of the feature vector in the original space. Sim( γ s , γ t ) compares the similarity between the source domain sample γ s and the target domain sample γ t on the aligned subspace, so the K -nearest neighbor algorithm can be used directly for classification [ 15 ]. In addition, it is also possible to map the source domain sample γ s from X a to the aligned subspace and map the target domain sample γ t to its corresponding subspace through X t and then use the SVM algorithm for classification [ 16 ].…”
Section: Recognition and Prediction Of Premature Ovarian Failure By U...mentioning
confidence: 99%
“…When comparing samples, the source domain sample γ s and the target domain sample γ t can be mapped to the corresponding subspace respectively, and then, the subspace can be aligned by the transformation matrix M ∗ , and the similarity function can be defined [ 13 , 14 ]: where A = X S X S T X t X t T represents the importance of each part of the feature vector in the original space. Sim( γ s , γ t ) compares the similarity between the source domain sample γ s and the target domain sample γ t on the aligned subspace, so the K -nearest neighbor algorithm can be used directly for classification [ 15 ]. In addition, it is also possible to map the source domain sample γ s from X a to the aligned subspace and map the target domain sample γ t to its corresponding subspace through X t and then use the SVM algorithm for classification [ 16 ].…”
Section: Recognition and Prediction Of Premature Ovarian Failure By U...mentioning
confidence: 99%
“…Convolution Neural Network (CNN) is commonly applied in areas related to analyzing visual images, such as object detection, image recognition and classification, and facial recognition. CNN contains convolution layers that extract the input's significant features while preserving the relationship between the 2D spatial domain and extracted features, and has been used for emotion recognition [53,55]. We can also convert temporal data into two-dimensions of time-frequency data, then use CNN to find the relationship between the time domain and the frequency/spatial domain to determine the corresponding emotion.…”
Section: Emotion Recognition Methodsmentioning
confidence: 99%
“…Pre-trained CNN Models are usually better in retrieving meaningful generic features, especially from images. The VGG neural network (VGG16, VGG19) [53] has been widely used in image classification and extracts the image data features for emotion recognition [54,55]. Even though the VGG neural network is pre-trained for object classification of various objects rather than human images, since the ImageNet contains vast data samples, the convolution filters have been trained to extract the key features of the images of faces.…”
Section: B Feature Extraction Techniquesmentioning
confidence: 99%
“…According to Palanivel S. and Sujanaa J. [10] a straightforward preprocessing phase where the RGB photos are changed into grayscale images is first applied to the photographs. The input frames were pre-processed using the Scikit-library.…”
Section: Literature Review a Data Preprocessingmentioning
confidence: 99%
“…As a result, the positions of the face, left and right eyes, nose, and the left and right corners of the mouth can thus be obtained. Using the cascade classifier, an algorithm devised by Paul Viola and Michael Jones that employs machine learning techniques, [10] note that these cascade classifiers are trained with samples containing facial and non-facial images. With the aid of the Haar-based cascade classifier, the mouth and face of each video frame is detected.…”
Section: B Feature Extractionmentioning
confidence: 99%