2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2021
DOI: 10.1109/iccv48922.2021.01438
|View full text |Cite
|
Sign up to set email alerts
|

Gait Recognition via Effective Global-Local Feature Representation and Local Temporal Aggregation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
96
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 156 publications
(127 citation statements)
references
References 25 publications
0
96
0
Order By: Relevance
“…Then, gait patterns are modeled by parameters like lengths of limbs, angles of joints, and relative positions of body parts [3,48]. The model-free methods mainly adopt the silhouettes obtained by background subtraction from video frames [5,9,11,15,16,22,32,46,57,58]. In particular, Han et al proposed to aggregate a sequence of silhouettes into a compact Gait Energy Image (GEI) [11] which was widely used by the following methods [32,46].…”
Section: Related Workmentioning
confidence: 99%
“…Then, gait patterns are modeled by parameters like lengths of limbs, angles of joints, and relative positions of body parts [3,48]. The model-free methods mainly adopt the silhouettes obtained by background subtraction from video frames [5,9,11,15,16,22,32,46,57,58]. In particular, Han et al proposed to aggregate a sequence of silhouettes into a compact Gait Energy Image (GEI) [11] which was widely used by the following methods [32,46].…”
Section: Related Workmentioning
confidence: 99%
“…The GEI-based methods [28], [29], [30], [31], [32] greatly compressed computational cost but lost discriminative expression. In contrast, the video-based approaches [8], [10], [11], [12], [13], [25], [26], [33] processed gait sequences frame by frame, which maintained the framelevel discriminative feature in a large extent, and benefited the networks to learn temporal representation. Our approach belongs to appearance-based method and takes silhouette sequences as input.…”
Section: Gait Recognitionmentioning
confidence: 99%
“…LSTM networks were applied in [10], [11] to achieve longshort temporal modeling, which fused temporal clues by temporal accumulation. With the help of stacked 3D blocks, MT3D [12] and GaitGL [13] incorporated temporal information with small and large scales, then concatenated or summed these features as outputs. 3DLocal [34] applied 3D CNN to obtain different local parts, and fused them with feature concatenation.…”
Section: Temporal Modelingmentioning
confidence: 99%
See 2 more Smart Citations