A Simple Fix for Convolutional Neural Network via Coordinate Embedding

Ren, Liliang; Hao, Zhuonan

doi:10.48550/arxiv.2003.10589

Cited by 2 publications

(7 citation statements)

References 5 publications

(6 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Incorporation of coordinate information is an alternative approach that improves performance by providing more information for training. Recently, the incorporation of coordinate information has been proposed to improve object detection in 2D images ( Liu et al, 2018 ; Ren & Hao, 2020 ). Incorporation of coordinate information has the advantage of being able to work with existing CNN models without modification to their architecture.…”

Section: Discussionmentioning

confidence: 99%

“…However, the performance of these two approaches has nearly similar results ( Vaswani et al, 2017 ). In the recent implementations of image processing, researchers still used a linear function to generate a simple index sequence of coordinate information ( Liu et al, 2018 ; Ren & Hao, 2020 ). The coordinate information tends to be of more benefit on the NCCT dataset than the CECT dataset because the NCCT dataset has less contrast resolution.…”

Section: Discussionmentioning

confidence: 99%

“…The positional encoding was implemented by a linear function ( Gehring et al, 2017 ) and sinusoidal function ( Vaswani et al, 2017 ). In the image processing domain, the coordinate information encoding was also proposed in recent literature ( Liu et al, 2018 ; Ren & Hao, 2020 ) by adding extra channels in input images. Liu et al (2018) proposed incorporating coordinate information in two extra channels for the x, y axes of 2D images using a continuous sequence of integers starting with zero in row and column.…”

Section: Related Workmentioning

confidence: 99%

“…They demonstrated an improvement of CNNs in image classification, objection detection and generative models. Ren & Hao (2020) proposed a similar coordinate information embedding in the extra channels of images as the input of a downstream CNN. They demonstrated an improvement of object detection in traffic sign images.…”

Section: Related Workmentioning

confidence: 99%

“…Recent work on use of CNNs for medical image segmentation has explored various network architectures to improve performance ( Shen, Wu & Suk, 2017 ; Kim et al, 2019 ). With this line of work seeming to have reached a plateau, a promising approach to achieve further improvement is to incorporate additional data, such as coordinate information ( Liu et al, 2018 ; Ren & Hao, 2020 ). Segmentation of AAA is a good candidate for this approach since the AAA is a tubular structure almost always oriented from head to toe.…”

Section: Introductionmentioning

confidence: 99%

See 4 more Smart Citations

A retrospective study of 3D deep learning approach incorporating coordinate information to improve the segmentation of pre- and post-operative abdominal aortic aneurysm

Siriapisith

Kusakunniran

Haddawy

2022

PeerJ Computer Science

View full text Add to dashboard Cite

Abdominal aortic aneurysm (AAA) is one of the most common diseases worldwide. 3D segmentation of AAA provides useful information for surgical decisions and follow-up treatment. However, existing segmentation methods are time consuming and not practical in routine use. In this article, the segmentation task will be addressed automatically using a deep learning based approach which has been proved to successfully solve several medical imaging problems with excellent performances. This article therefore proposes a new solution of AAA segmentation using deep learning in a type of 3D convolutional neural network (CNN) architecture that also incorporates coordinate information. The tested CNNs are UNet, AG-DSV-UNet, VNet, ResNetMed and DenseVoxNet. The 3D-CNNs are trained with a dataset of high resolution (256 × 256) non-contrast and post-contrast CT images containing 64 slices from each of 200 patients. The dataset consists of contiguous CT slices without augmentation and no post-processing step. The experiments show that incorporation of coordinate information improves the segmentation results. The best accuracies on non-contrast and contrast-enhanced images have average dice scores of 97.13% and 96.74%, respectively. Transfer learning from a pre-trained network of a pre-operative dataset to post-operative endovascular aneurysm repair (EVAR) was also performed. The segmentation accuracy of post-operative EVAR using transfer learning on non-contrast and contrast-enhanced CT datasets achieved the best dice scores of 94.90% and 95.66%, respectively.

show abstract

Section: Discussionmentioning

confidence: 99%

Section: Discussionmentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

A retrospective study of 3D deep learning approach incorporating coordinate information to improve the segmentation of pre- and post-operative abdominal aortic aneurysm

Siriapisith

Kusakunniran

Haddawy

2022

PeerJ Computer Science

View full text Add to dashboard Cite

show abstract

Robust Inference of Multi-Task Convolutional Neural Network for Advanced Driving Assistance by Embedding Coordinates

Miyama

2022

World Congress on Electrical Engineering and Computer Systems and Science

View full text Add to dashboard Cite

In this study, we develop a multitasking CNN (Convolutional Neural Network) for advanced driving assistance. The network simultaneously performs three tasks: object detection, semantic segmentation, and disparity estimation. Edge computing requires low computation and low storage capacity, so the three tasks share not only one encoder, but also one decoder that employs a combination of depth-wise point-wise convolution and bilinear interpolation instead of the usual transpose convolution. This reduces the number of multiply-accumulate operations to 44.0% and the number of convolution weight parameters to 38.2%. In multitasking CNN training, the loss weights for each task were automatically adjusted by backpropagation, and the three tasks were learned in a balanced manner. Reducing the complexity of the decoder did not degrade the recognition accuracy, but rather improved it. Moreover, we found that entering pixel coordinates in this CNN significantly reduced misestimations for images that differed significantly from those during training.

show abstract

A Simple Fix for Convolutional Neural Network via Coordinate Embedding

Cited by 2 publications

References 5 publications

A retrospective study of 3D deep learning approach incorporating coordinate information to improve the segmentation of pre- and post-operative abdominal aortic aneurysm

A retrospective study of 3D deep learning approach incorporating coordinate information to improve the segmentation of pre- and post-operative abdominal aortic aneurysm

Robust Inference of Multi-Task Convolutional Neural Network for Advanced Driving Assistance by Embedding Coordinates

Contact Info

Product

Resources

About