BID Dataset: a challenge dataset for document processing tasks

Soares., Álysson de Sá; Neves, Ricardo Batista das; Bezerra, Byron Leite Dantas

doi:10.5753/sibgrapi.est.2020.12997

Cited by 18 publications

(8 citation statements)

References 14 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…These documents are usually issued by governments, have strict design and security features, and their main goal is to define, verify and prove the holder's identity. The scope of usage of automatic system for identity document analysis include simplification and automatization of data entry when filling official forms [1], remote person identification [2], remote age checking [3], Know Your Customer / Anti Money Laundering (KYC / AML) procedures [4], and provision of governmental, financial, and other services.…”

Section: Introductionmentioning

confidence: 99%

“…As was mentioned in the previous sections, since identity documents by their nature contain sensitive information, there are very few publicly available datasets of identity document images, and those which exist contain either partial information, or contain synthetic examples of ungenuine documents. Existing datasets dedicated specifically to identity document images include LRDE Identity Document Image Database (LRDE IDID) [7], the recently published Brazilian Identity Document Dataset (BID Dataset) [4], and the Mobile Identity Document Video dataset family (MIDV) [8,9], to which the dataset presented in this paper also belongs. Some larger datasets, dedicated to address the issues of a broader document analysis problem, such as the ones from SmartDoc family [10], also contain identity document images.…”

Section: Introductionmentioning

confidence: 99%

“…The LRDE IDID [7] and the identity documents subset of SmartDoc [10] comprise a small amount of document samples, which allows them only to be used as reference benchmarks, without deeper analysis of identity processing methods. BID Dataset [4] addresses that issue, featuring 28800 synthetically generated document images with 8 different document types. At the same time, the images of BID Dataset were generated with artificial inscription of text field values over the automatically masked document regions, which might lead to the field presentation widely different from the one on actual documents, and the document owner faces were blurred in each document image, which make the dataset impossible to use for evaluation of face detection and location methods.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

MIDV-2020: a comprehensive benchmark dataset for identity document analysis

et al. 2022

View full text Add to dashboard Cite

Identity documents recognition is an important sub-field of document analysis, which deals with tasks of robust document detection, type identification, text fields recognition, as well as identity fraud prevention and document authenticity validation given photos, scans, or video frames of an identity document capture. Significant amount of research has been published on this topic in recent years, however a chief difficulty for such research is scarcity of datasets, due to the subject matter being protected by security requirements. A few datasets of identity documents which are available lack diversity of document types, capturing conditions, or variability of document field values. In this paper, we present a dataset MIDV-2020 which consists of 1000 video clips, 2000 scanned images, and 1000 photos of 1000 unique mock identity documents, each with unique text field values and unique artificially generated faces, with rich annotation. The dataset contains 72409 annotated images in total, making it the largest publicly available identity document dataset to the date of publication. We describe the structure of the dataset, its content and annotations, and present baseline experimental results to serve as a basis for future research. For the task of document location and identification content-independent, feature-based, and semantic segmentation-based methods were evaluated. For the task of document text field recognition, the Tesseract system was evaluated on field and character levels with grouping by field alphabets and document types. For the task of face detection, the performance of Multi Task Cascaded Convolutional Neural Networks-based method was evaluated separately for different types of image input modes. The baseline evaluations show that the existing methods of identity document analysis have a lot of room for improvement given modern challenges. We believe that the proposed dataset will prove invaluable for advancement of the field of document analysis and recognition.

show abstract

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

MIDV-2020: a comprehensive benchmark dataset for identity document analysis

et al. 2022

View full text Add to dashboard Cite

show abstract

“…Magee et al [33] explored the potential application of the Meijering filter [34] to the domain of recaptured identity document detection. The authors create a new dataset of [25] Spanish national ✗ Berenguel et al [26] Spanish national ✗ Gonzalez et al [1] Chilean national ✗ Polevoy et al [8] Various national ✓ Mudgalgundurao et al [29] German national and residence permits ✗ Chen et al [31] University student ✓ Benalcazar et al [9] Chilean national ✗ Magee et al [33] Brazilian national ✗ recaptured images based on the publicly available BID [35] dataset and use it to train an SVM classifier on the raw histogram data obtained by using the filter. Although their system does not compare well with approaches that utilize neural networks, it remains an attractive alternative due to being transparent and explainable.…”

Section: B Fake Id Detectionmentioning

confidence: 99%

Open-Set: ID Card Presentation Attack Detection Using Neural Style Transfer

Markham,

López,

Nieto-Hidalgo

et al. 2024

IEEE Access

View full text Add to dashboard Cite

The accurate detection of ID card Presentation Attacks (PA) is becoming increasingly important due to the rising number of online/remote services that require the presentation of digital photographs of ID cards for digital onboarding or authentication. Furthermore, cybercriminals are continuously searching for innovative ways to fool authentication systems to gain unauthorized access to these services. Although advances in neural network design and training have pushed image classification to the state of the art, one of the main challenges faced by the development of fraud detection systems is the curation of representative datasets for training and evaluation. The handcrafted creation of representative presentation attack samples often requires expertise and is very time-consuming, thus an automatic process of obtaining high-quality data is highly desirable. This work explores ID card Presentation Attack Instruments (PAI) in order to improve the generation of samples with four Generative Adversarial Networks (GANs) based image translation models and analyses the effectiveness of the generated data for training fraud detection systems. Using open-source data, we show that synthetic attack presentations are an adequate complement for additional real attack presentations, where we obtain an EER performance increase of 0.63 % points for print attacks and a loss of 0.29 % for screen capture attacks.

show abstract

“…To evaluate the newly published algorithms, the traditional open datasets such as PRImA [80] (document structure analysis), COCO-text [81] (text detection and recognition from natural images), and the datasets of the international project for the development of document analysis systems MAURDOR [82] are used. In addition to them, international teams create new datasets reflecting the specifics and characteristics of individual document types, for example, the BID [83] and MIDV-500 [1] datasets for the analysis of identity documents.…”

Section: Document Structure Analysis Algorithmsmentioning

confidence: 99%

Document image analysis and recognition: a survey

et al. 2022

View full text Add to dashboard Cite

This paper analyzes the problems of document image recognition and the existing solutions. Document recognition algorithms have been studied for quite a long time, but despite this, currently, the topic is relevant and research continues, as evidenced by a large number of associated publications and reviews. However, most of these works and reviews are devoted to individual recognition tasks. In this review, the entire set of methods, approaches, and algorithms necessary for document recognition is considered. A preliminary systematization allowed us to distinguish groups of methods for extracting information from documents of different types: single-page and multi-page, with text and handwritten contents, with a fixed template and flexible structure, and digitalized via different ways: scanning, photographing, video recording. Here, we consider methods of document recognition and analysis applied to a wide range of tasks: identification and verification of identity, due diligence, machine learning algorithms, questionnaires, and audits. The groups of methods necessary for the recognition of a single page image are examined: the classical computer vision algorithms, i.e., keypoints, local feature descriptors, Fast Hough Transforms, image binarization, and modern neural network models for document boundary detection, document classification, document structure analysis, i.e., text blocks and tables localization, extraction and recognition of the details, post-processing of recognition results. The review provides a description of publicly available experimental data packages for training and testing recognition algorithms. Methods for optimizing the performance of document image analysis and recognition methods are described.

show abstract

BID Dataset: a challenge dataset for document processing tasks

Cited by 18 publications

References 14 publications

MIDV-2020: a comprehensive benchmark dataset for identity document analysis

MIDV-2020: a comprehensive benchmark dataset for identity document analysis

Open-Set: ID Card Presentation Attack Detection Using Neural Style Transfer

Document image analysis and recognition: a survey

Contact Info

Product

Resources

About