2016
DOI: 10.1016/j.patcog.2016.01.007
|View full text |Cite
|
Sign up to set email alerts
|

String representations and distances in deep Convolutional Neural Networks for image classification

Abstract: International audienceRecent advances in image classification mostly rely on the use of powerful local features combined with an adapted image representation. Although Convolutional Neural Network (CNN) features learned from ImageNet were shown to be generic and very efficient, they still lack of flexibility to take into account variations in the spatial layout of visual elements. In this paper, we investigate the use of structural representations on top of pre-trained CNN features to improve image classificat… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
12
0

Year Published

2017
2017
2023
2023

Publication Types

Select...
7
2

Relationship

0
9

Authors

Journals

citations
Cited by 55 publications
(12 citation statements)
references
References 32 publications
0
12
0
Order By: Relevance
“…The transformation types include translation, reflection, and light and color transformation. In contrast to previous network models, AlexNet expands storage capacity by increasing the number of GPUs [22]. It uses at least two GPUs to save kernels on average, and the GPUs communicate in the specified layers.…”
Section: B Alexnet Convolutional Neural Networkmentioning
confidence: 99%
“…The transformation types include translation, reflection, and light and color transformation. In contrast to previous network models, AlexNet expands storage capacity by increasing the number of GPUs [22]. It uses at least two GPUs to save kernels on average, and the GPUs communicate in the specified layers.…”
Section: B Alexnet Convolutional Neural Networkmentioning
confidence: 99%
“…For example, deep CNNs have been widely used for various computer vision tasks, such as large-scale detection and classification of object categories [2][3][4][5][6], deep CNN-based transfer learning [7,8] and speech recognition [9]. RNNs, another important branch of the deep neural networks family, were mainly designed for sequence modeling.…”
Section: Introductionmentioning
confidence: 99%
“…With the continuous development of deep learning techniques, convolutional neural networks (CNNs) have demonstrated significant advantages in the field of object recognition, such as string recognition (Barat and Ducottet, ), script identification (Shi et al., ), 3D object retrieval (Leng et al., ), and human‐action recognition (Xu et al., ). First proposed by Lecun et al.…”
Section: Introductionmentioning
confidence: 99%
“…With the continuous development of deep learning techniques, convolutional neural networks (CNNs) have demonstrated significant advantages in the field of object recognition, such as string recognition (Barat and Ducottet, 2016), script identification (Shi et al, 2016), 3D object retrieval (Leng et al, 2015), and human-action recognition (Xu et al, 2016b). First proposed by Lecun et al (LeCun and Bengio, 1995;LeCun, 1989), CNNs can be considered as a type of supervised highly nonlinear mapping that output targetfeatures in a specified format based on the input data.…”
Section: Introductionmentioning
confidence: 99%