2020
DOI: 10.1007/s42979-020-00176-1
|View full text |Cite
|
Sign up to set email alerts
|

Historical Document Image Binarization: A Review

Abstract: This review provides a comprehensive view of the field of historical document image binarization with a focus on the contributions made in the last decade. After the introduction of a standard benchmark dataset with the 2009 Document Image Binarization Contest, research in the field accelerated. Besides the standard methods for image thresholding, preprocessing, and post-processing, we review the literature on methods such as statistical models, pixel classification with learning algorithms, and parameter tuni… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
12
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
5
4
1

Relationship

0
10

Authors

Journals

citations
Cited by 45 publications
(14 citation statements)
references
References 162 publications
(314 reference statements)
0
12
0
Order By: Relevance
“…To measure the quality of the binarized content, we followed the same evaluation protocol used by Kota et al [32] where the metrics and tools from H-DIBCO [49] were used. These metrics try to approximate human perception of quality on binary image [51]. First, we consider the peak signal to noise ratio (PSNR) which measures how close the binary image is to the ground truth (higher is better).…”
Section: A Handwritten Content Extractionmentioning
confidence: 99%
“…To measure the quality of the binarized content, we followed the same evaluation protocol used by Kota et al [32] where the metrics and tools from H-DIBCO [49] were used. These metrics try to approximate human perception of quality on binary image [51]. First, we consider the peak signal to noise ratio (PSNR) which measures how close the binary image is to the ground truth (higher is better).…”
Section: A Handwritten Content Extractionmentioning
confidence: 99%
“…This has been shown to be effective in situations where a colour band shows a cluster of contrast level (Fujita and Aya, 2004). Image binarisation typically uses threshold algorithms to split histograms into two, sometimes using several sub-processes such as background detection and edge detection (Malepati, 2010) to ensure foreground textures stand out as much as possible against their backgrounds (Tensmeyer and Martinez, 2020). This can be especially useful when, despite all previous efforts, tracers are still not prevalent to their background and surroundings.…”
Section: Image Manipulationmentioning
confidence: 99%
“…In this review, we begin by presenting the two categories and subsequently discuss competitions in document binarization that have been held in conjunction with the ICFHR and ICDAR conferences. In addition, we refer readers to further details in two current and comprehensive research efforts [10], [11] in which a review of important works in the field of the document binarization has been explored and possible future research directions in this subject have been presented.…”
Section: Related Workmentioning
confidence: 99%