“…The comparative experiments include an in‐depth analysis of multiple networks, such as ResNet101 [30], DenseNet [39], EfficientNet‐V2 [40], Vision Transformer [25], MobileVit [41], Swin Transformer [27], and NoisyViT [42]. These networks or their variants achieved superior performance in many scenarios [24, 26, 28]. Among them, ResNet101, DenseNet, and EfficientNet‐V2 are typical convolutional neural networks; Vision Transformer, Mobile ViT, and Swin Transformer are typical transformer networks; and NoisyViT is a newly proposed network with satisfactory results in classification tasks.…”