2021
DOI: 10.1007/s11042-021-10612-w
|View full text |Cite
|
Sign up to set email alerts
|

Bag-of-Visual-Words codebook generation using deep features for effective classification of imbalanced multi-class image datasets

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
2

Citation Types

0
12
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
7
1
1

Relationship

0
9

Authors

Journals

citations
Cited by 14 publications
(12 citation statements)
references
References 34 publications
0
12
0
Order By: Relevance
“…Especially, there are many coding methods proposed in the era of BoVW model [5], e.g., hard voting, soft voting, sparse coding, LLC, local coordinate coding, super vector coding, fisher coding, grouping saliency coding, etc. Given the high similarity between BoVW model and BoDVW model in terms of workflow, some of them have been applied in the existing BoDVW methods such as hard voting [12] [13], sparse coding [4], and LLC [10].…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Especially, there are many coding methods proposed in the era of BoVW model [5], e.g., hard voting, soft voting, sparse coding, LLC, local coordinate coding, super vector coding, fisher coding, grouping saliency coding, etc. Given the high similarity between BoVW model and BoDVW model in terms of workflow, some of them have been applied in the existing BoDVW methods such as hard voting [12] [13], sparse coding [4], and LLC [10].…”
Section: Related Workmentioning
confidence: 99%
“…In [11], the author adopted the layer "conv5" of AlexNet, the layer "inception 4(e)" of GoogleNet, and the layer "conv5-3" of VGGNet-16 to extract features, leading to 169 256dimensional features, 196 832-dimensional features, and 196 512-dimensional features for each image, respectively. In [12], the last convolutional layer of ResNet-50 is used to generate 49 2048-dimensional features for each image.…”
Section: Related Workmentioning
confidence: 99%
“…Their approach enhanced the accuracy by approximately 15% and 3% compared to [ 29 , 30 ] respectively, which were sophisticatedly designed convolutional neural networks (CNNs) introduced in 2020. Saini et al [ 12 ] utilized the BoDVW model to address the imbalance issue encountered when processing multi-class image datasets. In the realm of remote sensing classification tasks, the BoDVW model’s classification accuracy surpasses CNN-based methods by about 6% [ 23 ].…”
Section: Introductionmentioning
confidence: 99%
“…Over the past few years, detecting focal zones in images has generated sustained interest among the engineering design and computer vision community. Many vision-based methods have been proposed using features [40], models [41][42][43][44], and general image processing framework. The applied experimental approaches in engineering areas still remains a specialized task due to the certain specifics of each task.…”
Section: Introductionmentioning
confidence: 99%