“…(2019) pruned and utilized VGG16 to classify corrosion and crack defects in bridge structures, achieving accuracy rates of 93.6% and 98.5%, respectively. Novel networks like Faster Region‐CNN (Faster RCNN) and You Only Look Once (YOLO) series (Chun et al., 2023; Z. Zhou et al., 2022) have emerged for object detection, while Mask RCNN and U‐Net (Yamaguchi & Mizutani, 2023) are employed for segmentation. Recent researchers proposed EfficientNet and incorporated attention mechanisms (Chen & He, 2022; Y. Pan & Zhang, 2022; L. Zhang et al., 2023), separable convolution (Zhu et al., 2023; Zou et al., 2022), deformable convolution (Lei et al., 2023), atrous convolution (Siriborvornratanakul, 2023), and other strategies (Zheng et al., 2022) to further enhance model performance.…”