2021
DOI: 10.3390/s21196368
|View full text |Cite
|
Sign up to set email alerts
|

Dynamic Hand Gesture Recognition in In-Vehicle Environment Based on FMCW Radar and Transformer

Abstract: Hand gesture recognition technology plays an important role in human-computer interaction and in-vehicle entertainment. Under in-vehicle conditions, it is a great challenge to design gesture recognition systems due to variable driving conditions, complex backgrounds, and diversified gestures. In this paper, we propose a gesture recognition system based on frequency-modulated continuous-wave (FMCW) radar and transformer for an in-vehicle environment. Firstly, the original range-Doppler maps (RDMs), range-azimut… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
21
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
1
1

Relationship

0
7

Authors

Journals

citations
Cited by 22 publications
(21 citation statements)
references
References 36 publications
0
21
0
Order By: Relevance
“…Figure 4 a shows this method. One other novel result for recognizing single gesture motion is from [ 13 ], where a Transformer-encoder architecture is used by learning the temporal features from time series of the Range-Doppler Map (RDM), Range Azimuth Map (RAM), and Range Elevation Map (REM), and these temporal features are fused as a fused feature. Figure 4 b displays the method.…”
Section: Preliminariesmentioning
confidence: 99%
See 3 more Smart Citations
“…Figure 4 a shows this method. One other novel result for recognizing single gesture motion is from [ 13 ], where a Transformer-encoder architecture is used by learning the temporal features from time series of the Range-Doppler Map (RDM), Range Azimuth Map (RAM), and Range Elevation Map (REM), and these temporal features are fused as a fused feature. Figure 4 b displays the method.…”
Section: Preliminariesmentioning
confidence: 99%
“…In the following, we describe the positional embedding operation, based on our 2D/3D embedded features. Let be the position embedding function [ 13 ] for the 2D embedded features and . where and are position embedding after performing the function to the 2D embedded features , and , respectively.…”
Section: Our Proposed Chmr-hs Algorithmmentioning
confidence: 99%
See 2 more Smart Citations
“…A CNN feature extractor with an attention-based network was used in [19] for person and activity recognition. In [20], a CNN and a vision-transformer were used for HGR for in-vehicle environments. In these transformer networks, the output feature map of the CNN was directly fed as the input to the transformer.…”
Section: Introductionmentioning
confidence: 99%