Mingteng Li scite author profile

Mingteng Li

4Publications

37Citation Statements Received

111Citation Statements Given

How they've been cited

How they cite others

Affiliations

Publications

Order By: Most citations

Homo–Heterogenous Transformer Learning Framework for RS Scene Classification

Tang

et al. 2022

IEEE J. Sel. Top. Appl. Earth Observations Remote Sensing

View full text Add to dashboard Cite

Remote sensing (RS) scene classification plays an essential role in the RS community and has attracted increasing attention due to its wide applications. Recently, benefiting from the powerful feature learning capabilities of convolutional neural networks (CNNs), the accuracy of RS scene classification has significantly been improved. Although the existing CNNbased methods achieve excellent results, there is still room for improvement. First, the CNN-based methods are adept at capturing the global information from RS scenes. Still, the context relationships hidden in RS scenes cannot be thoroughly mined. Second, due to the specific structure, it is easy for normal CNNs to exploit the heterogenous information from RS scenes. Nevertheless, the homogenous information, which is also crucial to comprehensively understand complex contents within RS scenes, does not get the attention it deserves. Third, most CNNs focus on establishing the relationships between RS scenes and semantic labels. However, the similarities between them are not considered deeply, which are helpful to distinguish the intra-/inter-class samples. To overcome the limitations mentioned above, we propose a homo-heterogenous transformer learning (HHTL) framework for RS scene classification in this paper. First, a patch generation module (PGM) is designed to generate homogenous and heterogenous patches. Then, a dual-branch feature learning module (FLM) is proposed to mine homogenous and heterogenous information within RS scenes simultaneously. In FLM, based on vision transformer, not only the global information but also the local areas and their context information can be captured. Finally, we design a classification module, which consists of a fusion sub-module and a metric-learning module. It can integrate homo-heterogenous information and compact/separate samples from the same/different RS scene categories. Extensive experiments are conducted on four public RS scene data sets. The encouraging results demonstrate that our HHTL framework can outperform many state-of-the-art methods. Our source codes are available at https://github.com/TangXu-Group/Remote-Sensing-Images-Classification/tree/main/HHTL.

show abstract

EMTCAL: Efficient Multiscale Transformer and Cross-Level Attention Learning for Remote Sensing Scene Classification

Tang

et al. 2022

IEEE Trans. Geosci. Remote Sensing

View full text Add to dashboard Cite

Multi-Scale Interactive Transformer for Remote Sensing Cross-Modal Image-Text Retrieval

Wang

et al. 2022

View full text Add to dashboard Cite

Resformer: Bridging Residual Network and Transformer for Remote Sensing Scene Classification

Tang

et al. 2022

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Mingteng Li

Homo–Heterogenous Transformer Learning Framework for RS Scene Classification

EMTCAL: Efficient Multiscale Transformer and Cross-Level Attention Learning for Remote Sensing Scene Classification

Multi-Scale Interactive Transformer for Remote Sensing Cross-Modal Image-Text Retrieval

Resformer: Bridging Residual Network and Transformer for Remote Sensing Scene Classification

Contact Info

Product

Resources

About