Feature Alignment FPN for Oriented Object Detection in Remote Sensing Images

Li, Zhiqing; Li, Erzhu; Xu, Tianyu; Samat, Alim; Liu, Wei

doi:10.1109/lgrs.2023.3234267

Cited by 12 publications

(7 citation statements)

References 70 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…However, the nearest neighbor interpolation upsampling still harms spatial information, resulting in poor network effect in details. Li et al [31] proposed a flow guided upsampling module, which utilizes a novel flow distortion to align features and achieve better cross scale fusion. Park and Paik [32] proposed a pyramid attention upsampling module for object detection by dividing the upsampling process into two branches to extract global contextual information and interpolate and scale feature maps, thereby reducing the loss of semantic information.…”

Section: Feature Upsamplingmentioning

confidence: 99%

Tire defect detection based on low and high-level feature fusion

Wu,

Wang,

Zhou

et al. 2024

Meas. Sci. Technol.

View full text Add to dashboard Cite

Recently, object detection based on deep learning has made great progress in the field of defect detection. Due to its complex texture background and varied defect features, existing defect detection methods based on object detection face great challenges in the detection of tire internal defects. In this paper, a tire defect detection model based on low and high-level feature fusion is proposed. First, a multi-head feature extraction module is proposed, which extracts abundant effective information from multiple dimensions. Second, a spatial semantic fusion upsampling module is proposed to alleviate the problem of information loss in the upsampling process. Finally, a novel prediction head is designed to expand the receptive field by compressing the size of the feature map to improve the detection accuracy of large defects. Experimental results show that the mAP of the proposed method achieves 94.03% on the tire internal defect dataset, and the average detection time is 36.74ms, which can meet the needs of industrial online detection.

show abstract

Section: Feature Upsamplingmentioning

confidence: 99%

Tire defect detection based on low and high-level feature fusion

Wu,

Wang,

Zhou

et al. 2024

Meas. Sci. Technol.

View full text Add to dashboard Cite

show abstract

“…In this work, we improve the TransMVSNet network, which uses a feature extraction module (FPN) [31] as its feature extraction network. In this paper, we use an asymptotic feature pyramid network (AFPN) [32] to support the direct interaction of non-adjacent layers.…”

Section: Related Work and Contributionmentioning

confidence: 99%

An Improved TransMVSNet Algorithm for Three-Dimensional Reconstruction in the Unmanned Aerial Vehicle Remote Sensing Domain

Teng,

Sun,

Liu

et al. 2024

Sensors

View full text Add to dashboard Cite

It is important to achieve the 3D reconstruction of UAV remote sensing images in deep learning-based multi-view stereo (MVS) vision. The lack of obvious texture features and detailed edges in UAV remote sensing images leads to inaccurate feature point matching or depth estimation. To address this problem, this study improves the TransMVSNet algorithm in the field of 3D reconstruction by optimizing its feature extraction network and costumed body depth prediction network. The improvement is mainly achieved by extracting features with the Asymptotic Pyramidal Network (AFPN) and assigning weights to different levels of features through the ASFF module to increase the importance of key levels and also using the UNet structured network combined with an attention mechanism to predict the depth information, which also extracts the key area information. It aims to improve the performance and accuracy of the TransMVSNet algorithm’s 3D reconstruction of UAV remote sensing images. In this work, we have performed comparative experiments and quantitative evaluation with other algorithms on the DTU dataset as well as on a large UAV remote sensing image dataset. After a large number of experimental studies, it is shown that our improved TransMVSNet algorithm has better performance and robustness, providing a valuable reference for research and application in the field of 3D reconstruction of UAV remote sensing images.

show abstract

“…In conventional fusion methods, this leads to poor predictions of the boundaries between features. The accuracy of spatiotemporal data fusion depends largely on the accuracy of the input data, so an improved or innovative input data downscaling method can improve the performance of most spatio-temporal data fusion methods; for example, Zhai et al [21] and Xie et al [22] produced good results, and Li et al [23] proposed a MODIS strip noise cancellation strategy for spatio-temporal fusion methods, which better improves the accuracy of the fusion results. In addition, we found that compared to traditional spatio-temporal fusion methods, the deep learningbased spatio-temporal fusion algorithms bring more noise to the image fusion process and ignore the advantages of traditional spatio-temporal fusion algorithms in terms of image element unmixing and land use type change prediction.…”

Section: ⅰ Introductionmentioning

confidence: 99%

A Novel Remote Sensing Spatiotemporal Data Fusion Framework Based on the Combination of Deep-Learning Downscaling and Traditional Fusion Algorithm

Cui,

Wang,

Zhao

et al. 2024

IEEE J. Sel. Top. Appl. Earth Observations Remote Sensing

View full text Add to dashboard Cite

Traditional remote sensing spatio-temporal data fusion algorithms generally use up-sampled low-resolution images (MODIS) to be fused with high-resolution images (Landsat), this makes both images less spatially consistent and many hybrid image elements in low-resolution images, so uncertainty errors propagate into the fusion results. To address this issue, we propose a framework for combining deep learning-based super-resolution techniques with traditional spatio-temporal fusion methods. By reconstructing low-resolution images using super-resolution image reconstruction techniques, we obtain low-resolution images with more spatial details and better spatial consistency with highresolution images. These reconstructed images are then fused using spatio-temporal fusion methods. In this study, we selected Flexible Spatio-Temporal Data Fusion (FSDAF) and Residual Channel Attention Network (RCAN) to carry out a detailed study to prove the effectiveness of this kind of framework. That is, a new RCAN-FSDAF model is developed. After testing, RCAN-FSDAF has the following advantages::(1) The band reflectance predicted by RCAN-FSDAF is closer to base reflectance than FSDAF, DMNet, and GAN-STFM, as shown by greater correlation and smaller error. (2) RCAN-FSDAF better decomposes image elements among heterogeneous features and more accurately identifies boundaries between different features and changes in land cover type. (3) High spatial and temporal resolution NDVI data obtained by the inversion of the prediction results of RCAN-FSDAF are more accurate. The framework developed in this study can be extended to other spatial and temporal data fusion applications.

show abstract

Feature Alignment FPN for Oriented Object Detection in Remote Sensing Images

Cited by 12 publications

References 70 publications

Tire defect detection based on low and high-level feature fusion

Tire defect detection based on low and high-level feature fusion

An Improved TransMVSNet Algorithm for Three-Dimensional Reconstruction in the Unmanned Aerial Vehicle Remote Sensing Domain

A Novel Remote Sensing Spatiotemporal Data Fusion Framework Based on the Combination of Deep-Learning Downscaling and Traditional Fusion Algorithm

Contact Info

Product

Resources

About