2018 24th International Conference on Pattern Recognition (ICPR) 2018
DOI: 10.1109/icpr.2018.8545630
|View full text |Cite
|
Sign up to set email alerts
|

Document Image Classification with Intra-Domain Transfer Learning and Stacked Generalization of Deep Convolutional Neural Networks

Abstract: In this article, a region-based Deep Convolutional

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
41
0
1

Year Published

2020
2020
2021
2021

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 76 publications
(43 citation statements)
references
References 34 publications
1
41
0
1
Order By: Relevance
“…Multiple networks were trained on specific sections of the documents [21] to learn region-based high dimensional features later compressed via Principal Component Analysis (PCA). The use of multiple Deep Learning models was also exploited by Das et al by using an ensemble as a meta-classifier [16]. A VGG-16 [41] stack of networks using 5 different classifiers has been proposed, one of them trained on the full document and the others specifically over the header, footer, left body and right body.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Multiple networks were trained on specific sections of the documents [21] to learn region-based high dimensional features later compressed via Principal Component Analysis (PCA). The use of multiple Deep Learning models was also exploited by Das et al by using an ensemble as a meta-classifier [16]. A VGG-16 [41] stack of networks using 5 different classifiers has been proposed, one of them trained on the full document and the others specifically over the header, footer, left body and right body.…”
Section: Related Workmentioning
confidence: 99%
“…Then, transfer learning was demonstrated to work effectively [1,21] by using a network pre-trained on ImageNet [17]. And latest models have become increasingly heavier (greater number of parameters) [2,16,46] as shown in Table 1, with the speed and computational resources drawback this entails.…”
Section: Introductionmentioning
confidence: 99%
“…Deep Learning models and applications have been used in tasks such as image classification, [21][22][23] document analysis and text recognition, [24][25][26] natural language processing, [27][28][29] and video analysis [30][31][32] in industries ranging from automated driving to medical devices as shown in Figure 3. In References 35-37, the authors investigated the use of visual information to detect and interpret road signs using hierarchical classifier structures that combine Support Vector Machines (SVM) for image verification and Convolutional Neural Networks (CNN) for final recognition.…”
Section: Applications Of Deep Learningmentioning
confidence: 99%
“…Each layerwise style loss is multiplied by the predefined loss coefficient; if the coefficient is different from 0, we refer to the corresponding layer as an active layer: There are in total five blocks, the first two blocks have two Conv layers, each followed by ReLU and MaxPool layers, the last three have three Conv layers, each followed by ReLU and MaxPool layers. Image taken from (Das et al, 2018).…”
Section: Network Of Steelmentioning
confidence: 99%