Context-driven Multi-stream LSTM (M-LSTM) for Recognizing Fine-Grained Activity of Drivers

Behera, Ardhendu; Keidel, Alexander; Debnath, Bappaditya

doi:10.1007/978-3-030-12939-2_21

Cited by 18 publications

(27 citation statements)

References 35 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…These models have the ability to long-term dependencies by incorporating memory units. These memory units allow the network to learn, forget previously hidden states, and update hidden states (Behera et al 2018). Figure 7b depicts the general arrangement of an LSTM memory cell.…”

Section: Dnn With Long Short-term Memory (Lstm) Layersmentioning

confidence: 99%

“…where w x ; b x ; ; i t ; j t ; f t ; o t are weight matrices, biases, element-wise vector product, input gate contributing to Connection solar panels to the weatherboard memory, input moderation gate contributing to memory, forget gate, and output gate as a multiplier between memory gates, respectively. The c t and h t are the two types of hidden layers to allow the LSTM to make complex decisions over a short period of time (Behera et al 2018;Jozefowicz et al 2015). The i t and f t gates are switching each other to selectively consider the current inputs or forget its previous memory.…”

Section: Dnn With Long Short-term Memory (Lstm) Layersmentioning

confidence: 99%

See 1 more Smart Citation

Temporal convolutional neural (TCN) network for an effective weather forecasting using time-series data from the local weather station

et al. 2020

Self Cite

View full text Add to dashboard Cite

Non-predictive or inaccurate weather forecasting can severely impact the community of users such as farmers. Numerical weather prediction models run in major weather forecasting centers with several supercomputers to solve simultaneous complex nonlinear mathematical equations. Such models provide the medium-range weather forecasts, i.e., every 6 h up to 18 h with grid length of 10-20 km. However, farmers often depend on more detailed short-to medium-range forecasts with higher-resolution regional forecasting models. Therefore, this research aims to address this by developing and evaluating a lightweight and novel weather forecasting system, which consists of one or more local weather stations and state-of-the-art machine learning techniques for weather forecasting using time-series data from these weather stations. To this end, the system explores the state-of-the-art temporal convolutional network (TCN) and long short-term memory (LSTM) networks. Our experimental results show that the proposed model using TCN produces better forecasting compared to the LSTM and other classic machine learning approaches. The proposed model can be used as an efficient localized weather forecasting tool for the community of users, and it could be run on a stand-alone personal computer.Soft Computing https://doi.org/10.1007/s00500-020-04954-0( 0123456789().,-volV) (0123456789(). ,-volV)Publisher's Note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations. P. Hewage et al.

show abstract

Section: Dnn With Long Short-term Memory (Lstm) Layersmentioning

confidence: 99%

Section: Dnn With Long Short-term Memory (Lstm) Layersmentioning

confidence: 99%

Temporal convolutional neural (TCN) network for an effective weather forecasting using time-series data from the local weather station

et al. 2020

Self Cite

View full text Add to dashboard Cite

show abstract

“…Martin et al [17] advocate a method to combine multiple streams involving body pose and contextual information in videos to recognise driver's activities. Similarly, Behera et al [13] describe a multi-stream LSTM for recognising driver's activities by combining high-level body pose and body-object interaction with CNN features. These models [16], [15], [17], [13] are similar to video classification methods, which require complete observation and is unsuitable for live activity recognition.…”

Section: Related Workmentioning

confidence: 99%

“…Similarly, Behera et al [13] describe a multi-stream LSTM for recognising driver's activities by combining high-level body pose and body-object interaction with CNN features. These models [16], [15], [17], [13] are similar to video classification methods, which require complete observation and is unsuitable for live activity recognition. Simialrly Alotaibi and Alotaibi [19] describe an approach that combines the inception module with a residual block and a hierarchical recurrent neural network to enhance the recognition performance of the distracted behaviours of drivers.…”

Section: Related Workmentioning

confidence: 99%

See 1 more Smart Citation

Deep CNN, Body Pose, and Body-Object Interaction Features for Drivers’ Activity Monitoring

Behera

Wharton

Keidel

et al. 2022

IEEE Trans. Intell. Transport. Syst.

Self Cite

View full text Add to dashboard Cite

Automatic recognition and prediction of in-vehicle human activities has a significant impact on the next generation of driver assistance and intelligent autonomous vehicles. In this paper, we present a novel single image driver action recognition algorithm inspired by human perception that often focuses selectively on parts of the images to acquire information at specific places which are distinct to a given task. Unlike existing approaches, we argue that human activity is a combination of pose and semantic contextual cues. In detail, we model this by considering the configuration of body joints, their interaction with objects being represented as a pairwise relation to capture the structural information. Our body-pose and bodyobject interaction representation is built to be semantically rich and meaningful, and is highly discriminative even though it is coupled with a basic linear SVM classifier. We also propose a Multi-stream Deep Fusion Network (MDFN) for combining highlevel semantics with CNN features. Our experimental results demonstrate that the proposed approach significantly improves the drivers' action recognition accuracy on two exacting datasets. Index Terms-Transfer learning, intelligent vehicles, in-vehicle activity monitoring, deep learning, body pose and contextual descriptor, neural network-based fusion.

show abstract

Nighttime Traffic Sign and Pedestrian Detection Using RefineDet with Time‐Series Information

Yamamoto

Sultana

Ohashi

2022

IEEJ Transactions Elec Engng

View full text Add to dashboard Cite

Object detection is one of the most important tasks in computer vision-based automation, such as advanced driver assistance systems in driving automation. It is preferable to detect traffic-related objects at a far distance that appear small in the recorded scene in order to ensure maximum road safety while driving. As drivers tend to miss more traffic-related objects at nighttime driving, this work focuses on nighttime in-vehicle camera images. Because videos were recorded using an in-vehicle camera, objects to be detected in this study, such as traffic signs and pedestrians, occupy a small size in the frame when far away from the own vehicle. Furthermore, it is necessary to take into account time-series information to detect objects in sequential frames. Therefore, this research proposes an object detection model that combines the RefineDet small object detection model and the TSSD video detection model. Experimental results confirm the effectiveness of the proposed model. Moreover, a publicly available benchmark dataset is used to confirm the performance of the proposed model regardless of daytime or nighttime images.

show abstract

Context-driven Multi-stream LSTM (M-LSTM) for Recognizing Fine-Grained Activity of Drivers

Cited by 18 publications

References 35 publications

Temporal convolutional neural (TCN) network for an effective weather forecasting using time-series data from the local weather station

Temporal convolutional neural (TCN) network for an effective weather forecasting using time-series data from the local weather station

Deep CNN, Body Pose, and Body-Object Interaction Features for Drivers’ Activity Monitoring

Nighttime Traffic Sign and Pedestrian Detection Using RefineDet with Time‐Series Information

Contact Info

Product

Resources

About