2021
DOI: 10.1007/s42979-021-00493-z
|View full text |Cite
|
Sign up to set email alerts
|

Depthwise Separable Convolutional Neural Networks for Pedestrian Attribute Recognition

Abstract: Video surveillance is ubiquitous. In addition to understanding various scene objects, extracting human visual attributes from the scene has attracted tremendous traction over the past many years. This is a challenging problem even for human observers. This is a multi-label problem, i.e., a subject in a scene can have multiple attributes that we are hoping to recognize, such as shoes types, clothing type, wearing some accessory, or carrying some object or not, etc. Solutions have been presented over the years a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
1
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 9 publications
(2 citation statements)
references
References 35 publications
0
1
0
Order By: Relevance
“…The overall number of learnable parameters in our system is 6,844, as opposed to 14,362 for the very same system using traditional convolutions. We chose this particular DS-CNN because of its demonstrated versatility, training efficacy, low parameter bank, and impressive performance on smaller samples ( 25 ).…”
Section: Methodsmentioning
confidence: 99%
“…The overall number of learnable parameters in our system is 6,844, as opposed to 14,362 for the very same system using traditional convolutions. We chose this particular DS-CNN because of its demonstrated versatility, training efficacy, low parameter bank, and impressive performance on smaller samples ( 25 ).…”
Section: Methodsmentioning
confidence: 99%
“…However, the handcrafted feature was used in studies [35] and [34], which could not represent images effectively on surveillance cameras. In research [36], the Depthwise Separable Convolution method achieved a recall value of 72.07 and an F1 score of 66.60. In the study [37], multi-visual feature recognition with multi-label focal loss was carried out, producing 84.83% mA (mean accuracy), 79.37% accuracy, 87.47% precision, 86.09% recall, and 86.77% F1-score.…”
Section: Body Visual Multi-feature Recognitionmentioning
confidence: 96%