2020
DOI: 10.1109/access.2020.2995805
|View full text |Cite
|
Sign up to set email alerts
|

A Deep Multi-Attention Driven Approach for Multi-Label Remote Sensing Image Classification

Abstract: Deep learning (DL) based methods have been found popular in the framework of remote sensing (RS) image scene classification. Most of the existing DL based methods assume that training images are annotated by single-labels, however RS images typically contain multiple classes and thus can simultaneously be associated with multi-labels. Despite the success of existing methods in describing the information content of very high resolution aerial images with RGB bands, any direct adaptation for highdimensional high… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
33
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
6
2
1

Relationship

1
8

Authors

Journals

citations
Cited by 45 publications
(38 citation statements)
references
References 26 publications
0
33
0
Order By: Relevance
“…For example, class Airports essentially relates to land use, and Intertidal flats appear in RS images either with or without water depending on the image acquisition time and hence require appropriate time series for its classification. The number of labels associated with each image pair varies between 1 and 12, while 96.80% of image 3 https://bigearth.eu/BigEarthNetListofClasses.pdf pairs are not associated with more than 5 labels. Only 23 image pairs are annotated with more than 9 labels.…”
Section: A Class-nomenclature Of Bigeartnet-mmmentioning
confidence: 99%
“…For example, class Airports essentially relates to land use, and Intertidal flats appear in RS images either with or without water depending on the image acquisition time and hence require appropriate time series for its classification. The number of labels associated with each image pair varies between 1 and 12, while 96.80% of image 3 https://bigearth.eu/BigEarthNetListofClasses.pdf pairs are not associated with more than 5 labels. Only 23 image pairs are annotated with more than 9 labels.…”
Section: A Class-nomenclature Of Bigeartnet-mmmentioning
confidence: 99%
“…Hua et al [54] proposed an endto-end network comprising a CNN and a long short-term memory (LSTM) network that is responsible for modeling label dependencies through its recurrent units for multilabel object classification. Sumbul and Demir [55] exploited a bidirectional LSTM network to learn spatial relations among all patches in an image for the final prediction. Hua et al [56] proposed a relational reasoning network module to model label dependencies and gained better classification results.…”
Section: B Multilabel Object Classificationmentioning
confidence: 99%
“…Hua et al [20] proposed an end-to-end architecture that consists of three modules, one for learning high-level feature from the high-resolution aerial image, the second is an attention layer to keep only features located in the discriminative regions, and the last module is a bidirectional LSTM for utilizing the relations among labels in both directions to produce the final labels. In [17], the authors proposed a method for multi-label classification for high-dimensional varying spatial resolutions remote sensing imagery. First, a multi-branch CNN is used to describe the local area of image bands with a branch dedicated to each spatial resolution.…”
Section: Related Workmentioning
confidence: 99%
“…On the contrary, annotation during scene classification requires labels at the scene level only. For these reasons, the interest in multi-label classification, which aims to assign an image with multiple semantic labels, is increasing in the community [5][6][7][8][9][10][11][12][13][14][15][16][17][18][19][20]. It is an essential step to provide a better understanding of the scene.…”
Section: Introductionmentioning
confidence: 99%