2022
DOI: 10.3390/ijerph191811819
|View full text |Cite
|
Sign up to set email alerts
|

Spatiotemporal Feature Enhancement Aids the Driving Intention Inference of Intelligent Vehicles

Abstract: In order that fully self-driving vehicles can be realized, it is believed that systems where the driver shares control and authority with the intelligent vehicle offer the most effective solution. An understanding of driving intention is the key to building a collaborative autonomous driving system. In this study, the proposed method incorporates the spatiotemporal features of driver behavior and forward-facing traffic scenes through a feature extraction module; the joint representation was input into an infer… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(8 citation statements)
references
References 46 publications
(62 reference statements)
0
8
0
Order By: Relevance
“…Some researchers have utilized 3D Conv to model the temporal information of video frame sequences. Chen et al proposed a two-stream structure based on a deep three-dimensional CNN [16], while Rong et al used 3D-ResNet to extract video spatiotemporal features [10]. Recent studies have demonstrated that CNN-based algorithms achieve state-of-the-art (SOTA) performance on datasets such as Brain4Car [1].…”
Section: Driver Intention Prediction By Cnnmentioning
confidence: 99%
See 4 more Smart Citations
“…Some researchers have utilized 3D Conv to model the temporal information of video frame sequences. Chen et al proposed a two-stream structure based on a deep three-dimensional CNN [16], while Rong et al used 3D-ResNet to extract video spatiotemporal features [10]. Recent studies have demonstrated that CNN-based algorithms achieve state-of-the-art (SOTA) performance on datasets such as Brain4Car [1].…”
Section: Driver Intention Prediction By Cnnmentioning
confidence: 99%
“…To address this, many research teams have emphasized the importance of cross-modal information interaction and designed effective multi-modal fusion methods. However, these methods typically focus on multimodal fusion in a single dimension (i.e., either feature extractor or classifier) [1,10,[16][17][18], and often overlook the potential benefits of incorporating GPS information. This is a noteworthy limitation in the field that warrants further exploration.…”
Section: Cross-modal Information Interactionmentioning
confidence: 99%
See 3 more Smart Citations