2016
DOI: 10.3389/frobt.2015.00036
|View full text |Cite
|
Sign up to set email alerts
|

A Taxonomy of Deep Convolutional Neural Nets for Computer Vision

Abstract: Traditional architectures for solving computer vision problems and the degree of success they enjoyed have been heavily reliant on hand-crafted features. However, of late, deep learning techniques have offered a compelling alternative -that of automatically learning problem-specific features. With this new paradigm, every problem in computer vision is now being re-examined from a deep learning perspective. Therefore, it has become important to understand what kind of deep networks are suitable for a given prob… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
95
0
1

Year Published

2018
2018
2024
2024

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 179 publications
(96 citation statements)
references
References 99 publications
0
95
0
1
Order By: Relevance
“…The survey reported in [41] reviewed the famous architectures from 2012-2015 along with their components. Similarly, there are prominent surveys that discuss different algorithms and applications of CNN [14], [16], [17], 5 [42], [43]. Likewise, the survey presented in [44] discusses taxonomy of CNNs based on acceleration techniques.…”
Section: Introductionmentioning
confidence: 99%
“…The survey reported in [41] reviewed the famous architectures from 2012-2015 along with their components. Similarly, there are prominent surveys that discuss different algorithms and applications of CNN [14], [16], [17], 5 [42], [43]. Likewise, the survey presented in [44] discusses taxonomy of CNNs based on acceleration techniques.…”
Section: Introductionmentioning
confidence: 99%
“…A widely used classifier is the SoftMax function. Compared to the rest of the network, its computational complexity is usually small [4] [30]. The first layer connects the network to the input volume which can be an image, a video frame, or a signal, depending on the application (a 3-channel R,G,B image for instance).…”
Section: A Convolutional Neural Networkmentioning
confidence: 99%
“…For the sake of generality, STREAM SUM, STREAM SCALE, STREAM SHIFT, and STREAM MIN are implemented, as well. Another widely used operation in Conv-Nets is pooling [4]. NST supports max-pooling [53] through the STREAM MAXPL command.…”
Section: A Inference With Nstsmentioning
confidence: 99%
See 1 more Smart Citation
“…Previous to the resurgence of CNN models [27], commonly followed computer vision approaches in VPR employed handcrafted robust features such as SIFT [28], SURF [28], ORB [29], etc. to represent images, encoding them into BoW-like models by using pre-trained dictionaries of visual words [4,11,13,30].…”
Section: Related Workmentioning
confidence: 99%