2022
DOI: 10.1109/lsp.2022.3178673
|View full text |Cite
|
Sign up to set email alerts
|

An Efficient Axial-Attention Network for Video-Based Person Re-Identification

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
5
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(5 citation statements)
references
References 27 publications
0
5
0
Order By: Relevance
“…We implement DCCAL based on our previous EAAN work [49], and choose the ResNet50-3D-EAAN model trained on DukeV or MARS as the backbone network. T=6 frames are selected with the RRS strategy to represent a video sequence, and the images are resized to 256ั…128 and augmented by random erasing.…”
Section: B Implementation Detailsmentioning
confidence: 99%
See 3 more Smart Citations
“…We implement DCCAL based on our previous EAAN work [49], and choose the ResNet50-3D-EAAN model trained on DukeV or MARS as the backbone network. T=6 frames are selected with the RRS strategy to represent a video sequence, and the images are resized to 256ั…128 and augmented by random erasing.…”
Section: B Implementation Detailsmentioning
confidence: 99%
“…The experimental results are shown in Table I. We choose the ResNet50-3D pre-trained on the MARS dataset and the DukeV dataset in EAAN [49], respectively, as the backbone networks. eps is valued in four ways, respectively taking the mean value of the distance matrix D (๐‘’๐‘๐‘  ๐ท ฬ… ), about half of ๐‘’๐‘๐‘  ๐ท ฬ… (๐‘’๐‘๐‘  ๐ท ฬ… 2 โ„ ), the mean value of the best eps of each round of dynamic clustering in the whole training process (๐‘’๐‘๐‘  ๐‘๐‘’๐‘ ๐‘ก ฬ…ฬ…ฬ…ฬ…ฬ…ฬ… ), and the best eps of each round of dynamic clustering (๐‘’๐‘๐‘  ๐‘๐‘’๐‘ ๐‘ก ).…”
Section: ) Analysis Of Dynamic Clusteringmentioning
confidence: 99%
See 2 more Smart Citations
“…However, cropping images using a fixed interval brings misalignments of local features, since some person images are acquired with inaccurate detection boxes, such as boxes with the person not centered or boxes with partial bodies. Therefore, the attention scheme [10][11][12] has been introduced to enforce the model to capture cardinal discriminative local features, which boosts the performance of person ReID models greatly. These methods usually focus on the existence of discriminative patterns without regard for positions and orientations.…”
Section: Introductionmentioning
confidence: 99%