2020 International Joint Conference on Neural Networks (IJCNN) 2020
DOI: 10.1109/ijcnn48605.2020.9206711
|View full text |Cite
|
Sign up to set email alerts
|

A Fast Fully Octave Convolutional Neural Network for Document Image Segmentation

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
16
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
4
2
1

Relationship

1
6

Authors

Journals

citations
Cited by 15 publications
(16 citation statements)
references
References 12 publications
0
16
0
Order By: Relevance
“…The main drawback of this method is the small dataset used, without enough variability, compared to a real life operational scenario. In [33], a method based on UNet was proposed to detect document edges and text regions in Brazilian ID Card images, with a Fully Octave Convolutional Neural Network, which replaces the Convolutional Layers by Octave Convolutional Layers, reducing the redundancy of feature maps and obtaining a lighter model. In the datasets developed, the first one is named CDPhotoDataset with 20,000 images, obtaining an IoU of 0.9916; the second one is named DTDDataset, with 800 real Brazilian documents and after data augmentation a total of 10,000 images, obtaining an IoU of 0.9599.…”
Section: Related Workmentioning
confidence: 99%
“…The main drawback of this method is the small dataset used, without enough variability, compared to a real life operational scenario. In [33], a method based on UNet was proposed to detect document edges and text regions in Brazilian ID Card images, with a Fully Octave Convolutional Neural Network, which replaces the Convolutional Layers by Octave Convolutional Layers, reducing the redundancy of feature maps and obtaining a lighter model. In the datasets developed, the first one is named CDPhotoDataset with 20,000 images, obtaining an IoU of 0.9916; the second one is named DTDDataset, with 800 real Brazilian documents and after data augmentation a total of 10,000 images, obtaining an IoU of 0.9599.…”
Section: Related Workmentioning
confidence: 99%
“…1st Challenge -Document Boundary Segmentation: The objective of this challenge is to develop boundary detection algorithms for different kinds of documents [21]. The entrants should develop an algorithm that takes as input an image containing a document, and return a new image of the same size with the background in black pixels and the region occupied by the document in white pixels.…”
Section: Challenge Tasksmentioning
confidence: 99%
“…2nd Challenge -Zone Text Segmentation: This challenge encourages the development of algorithms for automatic text detection in ID documents [21]. The entrants have to develop an algorithm capable of detecting text patterns in the provided set of images; that is, to process an image of a document (without background), and return a new image of the same size with non-interest regions in black pixels and regions of interest (text regions) in white pixels.…”
Section: Challenge Tasksmentioning
confidence: 99%
“…The processing of images of identification documents has received much attention in the literature. Researchers have presented approaches for identification documents classification [4], automatic handwritten signature segmentation [5], document boundary detection and document text detection [3]. As shown in Figure 1, the proposed algorithm is divided into six main steps, which are detailed below:…”
Section: Related Workmentioning
confidence: 99%
“…Once the organizations have the images of identification documents of their customers, they can execute some algorithms for the automation of the text field extraction tasks [3], document classification [4], signature extraction [5], in addition to other properties and patterns present in the identification documents images.…”
Section: Introductionmentioning
confidence: 99%