2020
DOI: 10.48550/arxiv.2003.10589
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

A Simple Fix for Convolutional Neural Network via Coordinate Embedding

Abstract: Convolutional Neural Networks (CNN) has been widely applied in the realm of computer vision. However, given the fact that CNN models are translation invariant, they are not aware of the coordinate information of each pixel. Thus the generalization ability of CNN will be limited since the coordinate information is crucial for a model to learn affine transformations which directly operate on the coordinate of each pixel. In this project, we proposed a simple approach to incorporate the coordinate information to … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
7
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(7 citation statements)
references
References 5 publications
(6 reference statements)
0
7
0
Order By: Relevance
“…Incorporation of coordinate information is an alternative approach that improves performance by providing more information for training. Recently, the incorporation of coordinate information has been proposed to improve object detection in 2D images ( Liu et al, 2018 ; Ren & Hao, 2020 ). Incorporation of coordinate information has the advantage of being able to work with existing CNN models without modification to their architecture.…”
Section: Discussionmentioning
confidence: 99%
See 4 more Smart Citations
“…Incorporation of coordinate information is an alternative approach that improves performance by providing more information for training. Recently, the incorporation of coordinate information has been proposed to improve object detection in 2D images ( Liu et al, 2018 ; Ren & Hao, 2020 ). Incorporation of coordinate information has the advantage of being able to work with existing CNN models without modification to their architecture.…”
Section: Discussionmentioning
confidence: 99%
“…However, the performance of these two approaches has nearly similar results ( Vaswani et al, 2017 ). In the recent implementations of image processing, researchers still used a linear function to generate a simple index sequence of coordinate information ( Liu et al, 2018 ; Ren & Hao, 2020 ). The coordinate information tends to be of more benefit on the NCCT dataset than the CECT dataset because the NCCT dataset has less contrast resolution.…”
Section: Discussionmentioning
confidence: 99%
See 3 more Smart Citations