In the last few years, deep learning has led to very good performance on a variety of problems, such as visual recognition, speech recognition and natural language processing. Among different types of deep neural networks, convolutional neural networks have been most extensively studied. Leveraging on the rapid growth in the amount of the annotated data and the great improvements in the strengths of graphics processor units, the research on convolutional neural networks has been emerged swiftly and achieved stateof-the-art results on various tasks. In this paper, we provide a broad survey of the recent advances in convolutional neural networks. We detailize the improvements of CNN on different aspects, including layer design, activation function, loss function, regularization, optimization and fast computation. Besides, we also introduce various applications of convolutional neural networks in computer vision, speech and natural language processing.
Abstract-We propose a novel approach for predicting on-going action with the assistance of a low-cost depth camera. Our approach introduces a soft regression-based early prediction framework. In this framework, we estimate soft labels for the subsequences at different progress levels, jointly learned with an action predictor. Our formulation of soft regression framework 1) overcomes a usual assumption in existing early action prediction systems that the progress level of on-going sequence is given in the testing stage; and 2) presents a theoretical framework to better resolve the ambiguity and uncertainty of subsequences at early performing stage. The proposed soft regression framework is further enhanced in order to take the relationships among subsequences and the discrepancy of soft labels over different classes into consideration, so that a Multiple Soft labels Recurrent Neural Network (MSRNN) is finally developed. For real-time performance, we also introduce a new RGB-D feature called "local accumulative frame feature (LAFF)", which can be computed efficiently by constructing an integral feature map. Our experiments on three RGB-D benchmark datasets and an unconstrained RGB action set demonstrate that the proposed regression-based early action prediction model outperforms existing models significantly and also show that the early action prediction on RGB-D sequence is more accurate than that on RGB channel.
The 4D trajectory is a multi-dimensional time series with plentiful spatial-temporal features and has a high degree of complexity and uncertainty. Aiming at these features of aircraft flight trajectory and the problem that it is difficult for existing trajectory prediction methods to extract spatial-temporal features from the trajectory data at the same time, we propose a novel 4D trajectory prediction hybrid architecture based on deep learning, which combined Convolutional Neural Network (CNN) and Long Short-Term Memory (LSTM). An 1D convolution is used to extract the spatial dimension feature of the trajectory, and LSTM is used to mine the temporal dimension feature of the trajectory. Hence the highprecision prediction of the 4D trajectory is realized based on the sufficient fusion of the above features. We use real Automatic Dependent Surveillance-Broadcast (ADS-B) historical trajectory data for experiments and compare the proposed method with a single LSTM model and BP model on the same data set. The experimental results show that the trajectory prediction accuracy of the CNN-LSTM hybrid model is superior to a single model. The prediction error is reduced by an average of 21.62% compared to the LSTM model and by an average of 52.45%compared to the BP model. It provides a certain reference for the trajectory prediction research and Air Traffic Management decision-making. INDEX TERMS 4D trajectory prediction, deep learning, CNN-LSTM model, spatial-temporal feature Ma Lan (1966-) Female; Born in Huocheng, Xinjiag Province. Professor. Her main research direction is Air Traffic Information Process, Air Traffic Management Optimization. Shan Tian, born in 1996. Her research interests include data mining and deep learning with recent applications on aircraft 4D trajectory prediction.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.