Geometry-based next frame prediction from monocular video

Mahjourian, Reza; Wicke, Martin; Angelova, Anelia

doi:10.1109/ivs.2017.7995953

Cited by 27 publications

(24 citation statements)

References 28 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In some earlier studies [23][24][25], a next frame was predicted using the current frame and previous sequential frames. A dual-motion GAN model (ConvLSTMGAN) was proposed [23], and image prediction was performed using a visible light image.…”

Section: Prediction Of Next Framementioning

confidence: 99%

“…In this method, the proposed network was trained in a hybrid way using real and synthetic videos. In [25], a method for generating the next frame using a visible light image and ConvLSTM was proposed. In this study, the depth image is predicted using a current image and camera The remainder of this study is organized as follows.…”

Section: Prediction Of Next Framementioning

confidence: 99%

See 1 more Smart Citation

Enlargement of the Field of View Based on Image Region Prediction Using Thermal Videos

2021

View full text Add to dashboard Cite

Various studies have been conducted for detecting humans in images. However, there are the cases where a part of human body disappears in the input image and leaves the camera field of view (FOV). Moreover, there are the cases where a pedestrian comes into the FOV as a part of the body slowly appears. In these cases, human detection and tracking fail by existing methods. Therefore, we propose the method for predicting a wider region than the FOV of a thermal camera based on the image prediction generative adversarial network version 2 (IPGAN-2). When an experiment was conducted using the marathon subdataset of the Boston University-thermal infrared video benchmark open dataset, the proposed method showed higher image prediction (structural similarity index measure (SSIM) of 0.9437) and object detection (F1 score of 0.866, accuracy of 0.914, and intersection over union (IoU) of 0.730) accuracies than state-of-the-art methods.

show abstract

Section: Prediction Of Next Framementioning

confidence: 99%

Section: Prediction Of Next Framementioning

confidence: 99%

Enlargement of the Field of View Based on Image Region Prediction Using Thermal Videos

2021

View full text Add to dashboard Cite

show abstract

“…Structured random forests [28], feed-forward CNNs [29] and variational autoencoders [30] have all been used to predict dense pixel trajectories from single frames. Prediction of raw pixels-as opposed to pixel trajectories-has been attempted using ordinary feedforward networks, GANs [31], [32] and RNNs [33], [34].…”

Section: Background and Related Workmentioning

confidence: 99%

Human Pose Forecasting via Deep Markov Models

Toyer

Cherian

Han

et al. 2017

2017 International Conference on Digital Image Computing: Techniques and Applications (DICTA)

View full text Add to dashboard Cite

Human pose forecasting is an important problem in computer vision with applications to human-robot interaction, visual surveillance, and autonomous driving. Usually, forecasting algorithms use 3D skeleton sequences and are trained to forecast for a few milliseconds into the future. Long-range forecasting is challenging due to the difficulty of estimating how long a person continues an activity. To this end, our contributions are threefold: (i) we propose a generative framework for poses using variational autoencoders based on Deep Markov Models (DMMs); (ii) we evaluate our pose forecasts using a pose-based action classifier, which we argue better reflects the subjective quality of pose forecasts than distance in coordinate space; (iii) last, for evaluation of the new model, we introduce a 480,000frame video dataset called Ikea Furniture Assembly (Ikea FA), which depicts humans repeatedly assembling and disassembling furniture. We demonstrate promising results for our approach on both Ikea FA and the existing NTU RGB+D dataset.

show abstract

“…LSTM is a widely applicable kind of RNN which contains feedback connections for both single data points and entire data sequences in deep learning [ 50 ]. The optimization task regarding accurate future image prediction has been a highlighted problem in artificial intelligence in recent several years [ 51 , 52 , 53 , 54 , 55 , 56 , 57 , 58 , 59 , 60 , 61 , 62 , 63 , 64 , 65 , 66 , 67 ]. Kalchbrenner et al have developed a video pixel network to predict the joint distribution of future image in pixel videos [ 60 ].…”

Section: Introductionmentioning

confidence: 99%

Intelligent Calibration of Static FEA Computations Based on Terrestrial Laser Scanning Reference

Bao

Chen

et al. 2020

Sensors

View full text Add to dashboard Cite

The demand for efficient and accurate finite element analysis (FEA) is becoming more prevalent with the increase in advanced calibration technologies and sensor-based monitoring methods. The current research explores a deep learning-based methodology to calibrate FEA results. The utilization of monitoring reference results from measurements, e.g., terrestrial laser scanning, can help to capture the actual features in the static loading process. We learn the deviation sequence results between the standard FEA computations with the simplified geometry and refined reference values by the long short-term memory method. The complex changing principles in different deviations are trained and captured effectively in the training process of deep learning. Hence, we generate the FEA sequence results corresponding to next adjacent loading steps. The final FEA computations are calibrated by the threshold control. The calibration reduces the mean square errors of the FEA future sequence results significantly. This strengthens the calibration depth. Consequently, the calibration of FEA computations with deep learning can play a helpful role in the prediction and monitoring problems regarding the future structural behaviors.

show abstract

Geometry-based next frame prediction from monocular video

Cited by 27 publications

References 28 publications

Enlargement of the Field of View Based on Image Region Prediction Using Thermal Videos

Enlargement of the Field of View Based on Image Region Prediction Using Thermal Videos

Human Pose Forecasting via Deep Markov Models

Intelligent Calibration of Static FEA Computations Based on Terrestrial Laser Scanning Reference

Contact Info

Product

Resources

About