Haotian Yan scite author profile

Haotian Yan

5Publications

23Citation Statements Received

122Citation Statements Given

How they've been cited

How they cite others

119

Affiliations

Beijing University of Posts and Telecommunications, Guangdong University of Technology

Publications

Order By: Most citations

ConTNet: Why not use convolution and transformer at the same time?

Yan¹,

Li²,

Li³

et al. 2021

Preprint

View full text Add to dashboard Cite

Although convolutional networks (ConvNets) have enjoyed great success in computer vision (CV), it suffers from capturing global information crucial to dense prediction tasks such as object detection and segmentation. In this work, we innovatively propose ConTNet (Convolution-Transformer Network), combining transformer with Con-vNet architectures to provide large receptive fields. Unlike the recently-proposed transformer-based models (e.g., ViT, DeiT) that are sensitive to hyper-parameters and extremely dependent on a pile of data augmentations when trained from scratch on a midsize dataset (e.g., ImageNet1k), Con-TNet can be optimized like normal ConvNets (e.g., ResNet) and preserve an outstanding robustness. It is also worth pointing that, given identical strong data augmentations, the performance improvement of ConTNet is more remarkable than that of ResNet. We present its superiority and effectiveness on image classification and downstream tasks. For example, our ConTNet achieves 81.8% top-1 accuracy on ImageNet which is the same as DeiT-B with less than 40% computational complexity. ConTNet-M also outperforms ResNet50 as the backbone of both Faster-RCNN (by 2.6%) and Mask-RCNN (by 3.2%) on COCO2017 dataset. We hope that ConTNet could serve as a useful backbone for CV tasks and bring new ideas for model design. The code will be released at https://github.com/yanhao-tian/ConTNet.

show abstract

Did-Linknet: Polishing D-Block with Dense Connection and Iterative Fusion for Road Extraction

Yan

Zhang

Yang

et al. 2021

View full text Add to dashboard Cite

Computer Vision Applied in Medical Technology: The Comparison of Image Classification and Object Detection on Medical Images

Yan¹

2018

View full text Add to dashboard Cite

Image classification and object detection are two computer vision techniques that are currently commonly used. In this paper, convolutional neural network (CNN) and region-based CNN (RCNN) are used as examples to analyze and compare image classification and object detection. This paper will analyze the architectural characteristics and application scenarios of these two algorithms and analyzes the different characteristics of these two technologies in medical technology applications. CNN is an infrastructure classification algorithm, and image classification tasks are more common in medical image processing. RCNN is the development of CNN. Object detection technology can directly detect the presence and location of the lesion in medical images with RCNN. Combining the algorithms of the two techniques can also achieve some more complex image processing goals.

show abstract

The Advisable Technology of Key-Point Detection and Expression Recognition for an Intelligent Class System

Zhao

Yan

Wang

2019

J. Phys.: Conf. Ser.

View full text Add to dashboard Cite

Infrared image super-resolution with dual mechanism and residual triplet attention

Yan

Cheng

et al. 2021

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Haotian Yan

ConTNet: Why not use convolution and transformer at the same time?

Did-Linknet: Polishing D-Block with Dense Connection and Iterative Fusion for Road Extraction

Computer Vision Applied in Medical Technology: The Comparison of Image Classification and Object Detection on Medical Images

The Advisable Technology of Key-Point Detection and Expression Recognition for an Intelligent Class System

Infrared image super-resolution with dual mechanism and residual triplet attention

Contact Info

Product

Resources

About