“…In this section, we conduct extensive experiments on UCM, RSSCN7 and AID datasets and compare the proposed method with CaffeNet (Cheng, Han et al, 2017a;Xia et al, 2017), VGG-VD-16 (Cheng, Han et al, 2017a;Xia et al, 2017), GoogLeNet (Cheng, Han et al, 2017a;Xia et al, 2017), BoCF (Cheng, Li et al, 2017b), Two-Stage Fusion (Liu et al, 2017), MCNN (Liu et al, 2017), MDFR (Zhang et al, 2019), conv5-MSP5-FV (Zheng et al, 2019) which are the representative remote sensing scene classification methods based on CNN in the past few years. Furthermore, considering the attention mechanism used in our method, the attention-based scene classification methods, e.g., APDC-Net (Bi et al, 2019), ADFF (Zhu et al, 2019), the method proposed by Fan et al (Fan et al, 2019), HoSA (He et al, 2019), Attention GANs (Yu et al, 2020) and SAFF (Cao et al, 2021), are also used for comparison.…”