2020
DOI: 10.1109/jstars.2020.2971763
|View full text |Cite
|
Sign up to set email alerts
|

A CNN-Transformer Hybrid Approach for Crop Classification Using Multitemporal Multisensor Images

Abstract: Multitemporal Earth observation capability plays an increasingly important role in crop monitoring. As the frequency of satellite acquisition of remote sensing images becomes higher, how to fully exploit the implicit phenological laws in dense multitemporal data is of increasing importance. In this article, we propose a CNN-transformer approach to perform the crop classification, in the model, we borrow the transformer architecture from the knowledge of NLP to dig into the pattern of multitemporal sequence. Fi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
32
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
9

Relationship

0
9

Authors

Journals

citations
Cited by 88 publications
(33 citation statements)
references
References 27 publications
1
32
0
Order By: Relevance
“…The authors in [46] proposed a hybrid CNN and transformer architecture for crop classification on multitemporal and multispectral data. In their research, a dataset with 65 acquiring dates were collected from Sentinel-2 A/B and Landsat-8 for a region in central California.…”
Section: Deep Learning Techniques For Hyperspectral Data Analyticsmentioning
confidence: 99%
“…The authors in [46] proposed a hybrid CNN and transformer architecture for crop classification on multitemporal and multispectral data. In their research, a dataset with 65 acquiring dates were collected from Sentinel-2 A/B and Landsat-8 for a region in central California.…”
Section: Deep Learning Techniques For Hyperspectral Data Analyticsmentioning
confidence: 99%
“…In the building extraction task, the transformer-based model may be a potential choice because of its strong feature representation capability, global receptive field, and the capability of modeling long-range dependencies between pixels. However, when dealing with high-resolution image data, transformers are usually computation-inefficient and entail high memory usage [19][20][21][22]. This greatly limits its usability in remote sensing image analysis tasks since both the volume and resolution of high-resolution remote sensing image data have recently been growing exponentially.…”
Section: Introductionmentioning
confidence: 99%
“…An RNN needs to process the input time series moment by moment, which brings a lot of calculation and storage burden to training the model. With the invention of the Transformer model, researchers have applied it to extract temporal dependencies from remote sensing time series data [39], [46], [47].…”
Section: Related Workmentioning
confidence: 99%
“…• In view of the existing problems of DL processing remote-sensing time series, we use the Informer model (a new variant of Transformer) to deal with the remotesensing time-series classification task; we also explain the role of each module of the model when it is applied to this task. • Because the label of the sample is visible to the entire time series [39], we removed the mask operation in the classic Informer model. In addition, unlike the Informer, the input of the decoder module retains the data of all time steps, so that the mutual influence among all moments can be exploited.…”
Section: Introductionmentioning
confidence: 99%