Yang Zhao scite author profile

Deep convolutional networks have better smoke recognition performance. However, a lightweight network model and high recognition accuracy cannot be balanced when deployed on hardware with limited computing resources such as edge computing. Based on this background, we propose a novel smoke recognition network that combines convolutional networks (CNN) and self-attention. The core ideas of this framework are as follows: (1) Combine the depthwise convolution and asymmetric convolution of large convolution kernels to construct a lightweight CNN model, and realize multiscale extraction of feature information with slight model complexity. (2) Combined with the self-attention in transformer, a skip-connection branch is designed, which improves the feature extraction capability of the backbone network through parallel processing and fusion of feature map information. (3) Fusion multicomponent discrete cosine transform (DCT) is used to compress channel information and expand the ability of global average pooling (GAP) to aggregate feature maps. The proposed DCT-GAP improves the accuracy of the network without adding additional computational costs. Experimental results show that the proposed CSANet achieves an average accuracy of over 98.3% with 238 M FLOPs and 5.8 M parameters on the homemade smoke dataset, outperforming state-of-the-art competitors.

show abstract

Micro-YOLO+: Searching Optimal Methods for Compressing Object Detection Model Based on Speed, Size, Cost, and Accuracy

Zhang

Zhao

et al. 2022

SN COMPUT. SCI.

View full text Add to dashboard Cite

Single Image Dehazing Based on Contrastive Learning and Transformer

Zhao

Wang²

2023

J. Phys.: Conf. Ser.

View full text Add to dashboard Cite

For the single image dehazing problem, an end-to-end multi-stage dehazing algorithm is designed. The algorithm contains two distinct parts to extract features. For shallow features, texture-level information is mined by stacking pixel and channel attention mechanisms. The proposed method uses multi-head self-attention (MHSA) to capture high-level features. MHSA improves dehazing performance by mining the dependencies of a wide range of abstract information. The superiority of the transformer architecture is extended with cascaded attention mechanisms and convolutions to improve feature extraction capabilities. Multilayer perceptron (MLP) is used in the decoding stage to equalize the context information. Furthermore, a contrastive loss function that introduces multiple negative samples and correction terms is proposed. The correction term is generated according to the difference between the precise and blurred images, which can enhance the training effect when dealing with different concentrations of dehaze. The training result of this loss function assists the model in approximating clear images and staying away from blurry images. According to the experimental results compared with other methods under the same conditions, the proposed method achieves good results in both subjective visual effects and objective evaluation indicators. The proposed contrast loss function also improves the dehazing performance of the algorithm.

show abstract

From Sparse to Dense: Semantic Graph Evolutionary Hashing for Unsupervised Cross-Modal Retrieval

Zhao

Liao

et al. 2023

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Yang Zhao

Class Concentration with Twin Variational Autoencoders for Unsupervised Cross-Modal Hashing

Lightweight Smoke Recognition Based on Deep Convolution and Self-Attention

Micro-YOLO+: Searching Optimal Methods for Compressing Object Detection Model Based on Speed, Size, Cost, and Accuracy

Single Image Dehazing Based on Contrastive Learning and Transformer

From Sparse to Dense: Semantic Graph Evolutionary Hashing for Unsupervised Cross-Modal Retrieval

Contact Info

Product

Resources

About