In recent years, there is a growing interest in speech emotion recognition (SER) by analyzing input speech. SER can be considered as simply pattern recognition task which includes features extraction, classifier, and speech emotion database. The objective of this paper is to provide a comprehensive review on various literature available on SER. Several audio features are available, including linear predictive coding coefficients (LPCC), Melfrequency cepstral coefficients (MFCC), and Teager energy based features. While for classifier, many algorithms are available including hidden Markov model (HMM), Gaussian mixture model (GMM), vector quantization (VQ), artificial neural networks (ANN), and deep neural networks (DNN). In this paper, we also reviewed various speech emotion database. Finally, recent related works on SER using DNN will be discussed.
This paper presents a gravity optimised particle filter (GOPF) where the magnitude of the gravitational force for every particle is proportional to its weight. GOPF attracts nearby particles and replicates new particles as if moving the particles towards the peak of the likelihood distribution, improving the sampling efficiency. GOPF is incorporated into a technique for hand features tracking. A fast approach to hand features detection and labelling using convexity defects is also presented. Experimental results show GOPF outperforms the standard particle filter and its variants, as well as state-of-the-art Camshift guided particle filter using a significantly reduced number of particles.
<span>Augmented reality (AR) applications have become increasingly ubiquitous as it integrates virtual information such as images, 3D objects, video, and more to the real world, which further enhances the real environment. Many researchers have investigated the augmentation of the 3D object on the digital screen. However, certain loopholes exist in the existing system while estimating the object’s pose, making it inaccurate for mobile augmented reality (MAR) applications. Objects augmented in the current system have much jitter due to frame illumination changes, affecting the accuracy of vision-based pose estimation. This paper proposes to estimate the pose of an object by blending both vision-based techniques and micro electrical mechanical system (MEMS) sensor (gyroscope) to minimize the jitter problem in MAR. The algorithm used for feature detection and description is oriented FAST rotated BRIEF (ORB), whereas to evaluate the homography for pose estimation, <a name="_Hlk100294718"></a>random sample consensus (RANSAC) is used. Furthermore, gyroscope sensor data is incorporated with the vision-based pose estimation. We evaluated the performance of augmenting the 3D object using the techniques, vision-based, and incorporating the sensor data using the video data. After extensive experiments, the validity of the proposed method was superior to the existing vision-based pose estimation algorithms.</span>
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.