Feature pyramid network with self-guided attention refinement module for crack segmentation

Ong, Jeremy C.H.; Lau, Stephen L. H.; Ismadi, Mohd Zulhilmi Paiz; Wang, Xin

doi:10.1177/14759217221089571

Cited by 13 publications

(5 citation statements)

References 73 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…More importantly, MT-GAN-CS achieves a high F 1 Score of 0.8 at CR = 128 even the used tasks are not similar to each other, such a performance level is comparable or superior to existing crack segmentation methods. [55][56][57] Furthermore, Figure 12 confirms that the reconstruction accuracy of MT-GAN-CS is basically the same as that of ST-GAN-CS, but its time cost to reconstruct one image block is only about a quarter of the cost by ST-GAN-CS (see Appendix D). Therefore, we conclude that MT-GAN-CS can speed up reconstruction without sacrificing the accuracy of recovered crack regions.…”

Section: Multitask Recovery For Image Blocks With Diverse Cracksmentioning

confidence: 58%

Robust multitask compressive sampling via deep generative models for crack detection in structural health monitoring

Zhang

Huang

et al. 2023

Structural Health Monitoring

View full text Add to dashboard Cite

In structural health monitoring (SHM), there is an increasing demand for real-time image-based damage detection. Such a technology is essential for minimizing hazard loss caused by delayed emergency response after earthquakes or other natural disasters, or service interruption during structural inspection. Compressive sampling (CS) is a promising solution to achieve such a goal by greatly reducing the power consumption on high-resolution image transmission when using wireless devices. However, conventional CS failed to achieve high enough compression ratios, while existing generative-model-based CS requires laboriously training a high-quality generator with many large-scale images. To overcome such a bottleneck that hinders the practical use of CS in SHM, we propose a multitask CS algorithm that only relies on existing generators trained by low-pixel crack images. By exploiting the new discovery that similar crack images share a similar sparsity pattern in their latent vectors mapped by the generator, our algorithm achieves higher crack detection accuracy and robustness within a much shorter time when using a high data compression ratio. We verify the effectiveness of the proposed CS algorithm using synthetic and real image data. The results demonstrate that this work has moved a step closer toward successful implementation of operational CS-based crack detection systems in real-time SHM.

show abstract

Section: Multitask Recovery For Image Blocks With Diverse Cracksmentioning

confidence: 58%

Robust multitask compressive sampling via deep generative models for crack detection in structural health monitoring

Zhang

Huang

et al. 2023

Structural Health Monitoring

View full text Add to dashboard Cite

show abstract

“…Ong et al. (2023) proposed a multiscale encoder–decoder architecture with the embedment of self‐guided attention refinement modules, effectively suppressing background noise while enhancing network representation of crack details. Wu et al.…”

Section: Related Workmentioning

confidence: 99%

“…The authors have specifically noticed that among the previously discussed seven HR representation methods, multiscale sampling enhances the representation of small targets by leveraging multiscale feature information, while physical cascading operations achieve refined representation of randomly shaped targets by incorporating physical constraints. Notably, these two methods both exhibit relatively low model complexity, thus deeply mitigating the dependence on high GPU memory for HR image inference (Cheng et al., 2020; Ong et al., 2023). Consequently, this study aims to integrate these two methods to bridge the research gap in fine‐grained representation of HR crack images.…”

Section: Introductionmentioning

confidence: 99%

Fine‐grained crack segmentation for high‐resolution images via a multiscale cascaded network

Chu,

Chun

2023

Computer aided Civil Eng

View full text Add to dashboard Cite

High‐resolution (HR) crack images offer more detailed information for assessing structural conditions compared to low‐resolution (LR) images. This wealth of detail proves indispensable in bolstering the safety of unmanned aerial vehicle (UAV)‐based inspection procedures and elevating the precision of small crack segmentation. Nonetheless, achieving a balance between segmentation accuracy and GPU memory consumption poses a substantial challenge for deep learning models when processing HR crack images. To overcome this challenge, a novel “HR crack segmentation framework” (HRCSF) is proposed, specifically designed to meticulously segment crack images with resolutions exceeding 4K. First, a multiscale crack feature extraction network (MsCFEN) was proposed with the embedment of the strip pooling operation to enhance the representation of the transverse and longitudinal crack pixels from the complex backgrounds. Subsequently, two cascaded operations were tailored to MsCFEN, enabling a comprehensive refinement process that incorporates both global and local aspects. Furthermore, to fully leverage the potential of each proposed component in the refinement process, the complete architecture was trained using a loss function with embedded boundary optimization. Conclusively, a UAV‐based case study was conducted on a real bridge in Changsha, demonstrating HRCSF's practicability in segmenting HR crack images. The implementation of HRCSF allows the UAV to perform crack inspection effectively from a distance of 3 m away from the girder, resulting in a significant 50% reduction in inspection time compared to LR segmentation methods while maintaining high detection accuracy.

show abstract

“…Inspired by NL, Wan et al (2021) designed CrackResAttentionNet for pavement crack detection based on the encoder-decoder network, where two self-attention-based attention modules were added after different encoder layers to aggregate long-range context information. Ong et al (2023) used selfattention to refine each feature pyramid network (FPN) layer so that the deep and shallow layers of the FPN could enhance crack information and reduce noise impact, respectively. However, the large computational cost of the self-attention mechanism severely limits the detection speed.…”

Section: Introductionmentioning

confidence: 99%

“…Ong et al. (2023) used self‐attention to refine each feature pyramid network (FPN) layer so that the deep and shallow layers of the FPN could enhance crack information and reduce noise impact, respectively. However, the large computational cost of the self‐attention mechanism severely limits the detection speed.…”

Section: Introductionmentioning

confidence: 99%

Encoder–decoder with pyramid region attention for pixel‐level pavement crack recognition

Yao,

Liu,

et al. 2023

Computer aided Civil Eng

View full text Add to dashboard Cite

Timely and accurate extraction of pavement crack information is crucial to maintain service conditions and structural safety for infrastructures and reduce further road maintenance costs. Currently, deep learning techniques for automated pavement crack detection are far superior to traditional manual approaches in both speed and accuracy. However, existing deep learning models may easily lose crack details when processing images containing complex background textures or other noises. Although many studies have alleviated this challenge by introducing attention mechanisms, especially the non‐local (NL) block, which has the ability to efficiently capture long‐range dependencies to facilitate crack pixel capture, the huge computational cost of NL makes the inference time of the model too long, which is not conducive to practical implementation. In this study, a new module, namely, the pyramid region attention module (PRAM), was developed by combining the pyramid pooling module in the pyramid scene parsing network and optimized NL, which can achieve global multi‐scale context integration and long‐range dependencies capture at a relatively lower computational cost. By applying PRAM to deep skip connections in the modified U‐Net, an effective crack segmentation model called CrackResU‐Net was developed. The test results on the existing CrackForest dataset showed that CrackResU‐Net not only achieved an F1 score of 0.9580 but also took only 25.89 ms to process an image with a resolution of 480 × 320, which had advantages in accuracy and speed, compared with several other state‐of‐the‐art crack segmentation approaches. It was fully demonstrated that this approach could realize automatic fast and high‐precision recognition of pavement cracks for engineering purposes.

show abstract

Feature pyramid network with self-guided attention refinement module for crack segmentation

Cited by 13 publications

References 73 publications

Robust multitask compressive sampling via deep generative models for crack detection in structural health monitoring

Robust multitask compressive sampling via deep generative models for crack detection in structural health monitoring

Fine‐grained crack segmentation for high‐resolution images via a multiscale cascaded network

Encoder–decoder with pyramid region attention for pixel‐level pavement crack recognition

Contact Info

Product

Resources

About