Deep learning classification is the state-of-the-art of machine learning approach. Earlier work proves that the deep convolutional neural network has successfully and brilliantly in different applications such as images or video data. Recognizing and clarifying the remote sensing aspect of the earth's surface and exploit land cover and land use (LCLU). First, this article summarized the remote sensing emerging application and challenges for deep learning methods. Second, we propose four approaches to learn efficient and effective CNNs to transfer image representation on the ImageNet dataset to recognize LCLU datasets. We use VGG16, Inception-ResNet-V2, Inception-V3, and DenseNet201 models to extract features from the EACC dataset. We use pre-trained CNNs on ImageNet to extract features. For feature selection we proposed principal component analysis (PCA) to improve accuracy and speed up the model. We train our model by multi-layer perceptron (MLP) as a classifier. Lastly, we apply the multi-granularity encoding ensemble model. We achieve an overall accuracy of 92.3% for the nine-class classification problem. This work will help remote sensing scientists understand deep learning tools and apply them in large-scale remote sensing challenges
Remote sensing scene classification is a fundamental responsibility of earth observation, aiming to identify information granular for land cover classification. The multi-granular land use for multisource remotely sensed image categories is now a principal task in remote sensing data augmentation and data selection. Understanding image representations are meaningful for the scene classification task. Training deep learning model-based approaches has to do with scene classification and brings about fantastic achievement. At the same time, these high-level approaches are computationally expensive and timeconsuming. This paper introduces multi-granularity Neural Network Encoding architecture base on InceptionV3, InceptionReseNetV2, VGG16, and DenseNet201 architecture into remote sensing scene classification. To improve performance and to solve intra-class variation for multi-class scene issues remote sensing dataset. By using pre-trained CNN, activation function and ensemble learning have been adopting. InceptionV3 and VGG16 are used to extract features. InceptionResNetV2 use for fine-tuning, which consists of unfreezing the entre model and retraining the new data with a lower learning rate. The proposed fine-tune whole pre-trained model produces better results of test set up to 97.84 % than features extracted by InceptionResNetV2. Also, we use DCNNs ensemble average and weighted average to achieve better outcomes for 97.3.6% and 99.10%. In our experiments, we attempted to fine-tune the deep convolutional neural networks (DCNNs) training method for remote sensing scene classification on two public datasets UCM, SIRI-WHU, and one dataset I collected through the google earth engine from East Africa Community Countries (EACC) within nine classes within total 2112 labeled images. The results indicate that our proposed fine-tuning of the pre-trained model with few epochs and less computational time increases accuracy.INDEX TERMS Convolutional neural networks (CNNs), fine-tuning, granularity feature extraction, machine learning, and remote sensing (RS) I.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.