Two‐stage single image dehazing network using swin‐transformer

Li, Xiaoling; Hua, Zhen; Li, Jinjiang

doi:10.1049/ipr2.12506

Cited by 8 publications

(1 citation statement)

References 53 publications

(78 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The initial design operation is to multiply the two features to obtain a matrix, then sum pool the matrix to obtain the feature vector, and then use this vector to classify, but it usually suffers from as a high computational complexity as

\mathcal {O}(n^2)

. In recent years, an effective attention‐based fusion method has been developed by extending transformer [22–27]. The self‐attention mechanism in Transformer can be regarded as information fusion on a fully‐connected graph, which is more general to model the input data.…”

Section: Introductionmentioning

confidence: 99%

Multi‐modal object detection via transformer network

Liu

Wang

Gao

et al. 2023

IET Image Processing

View full text Add to dashboard Cite

According to the fact that single‐modal data usually contain limited information, a great deal of effort has been devoted to making use of the complementary information contained in the multi‐modal data on various patterns. Thus, this paper is concerned with an object detection method that can fully utilize multi‐modal data. First, the method introduces the transformer mechanism to realize the fusion of intra‐modal and inter‐modal features of different modal data. The aim is to take advantage of the complementarity of data between modalities, which helps to improve the performance of multi‐modal object detection. Second, a contrastive loss suitable for contrastive learning is applied. This enables the authors to effectively utilize label information. Extensive experiments are conducted on multiple object detection datasets to demonstrate the effectiveness of our proposed method.

show abstract