Text baseline detection, a single page trained system

Pastor, María Luisa Carrió

doi:10.1016/j.patcog.2019.05.031

Cited by 10 publications

(5 citation statements)

References 34 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Algorithms in the second category try to handle more complex layouts, e.g., arbitrary oriented text lines, and they can be sub-categorized as clustering-based methods (e.g., [3], [16]- [18]) and function analysis methods (e.g., [22]- [27]). Clustering-based methods first extract basic elements (interest points, connected components, etc.)…”

Section: Related Work a Text Baseline Detection In Historical Documentsmentioning

confidence: 99%

“…Gruuening et al [18] first extracted super pixels with the FAST algorithm [48], and then applied a standard clustering method based on some text line characteristics, e.g., curvilinearity, inter-line spacing and local homogeneity. Pastor [3] first filtered noisy local minima points with Extremely Randomized Trees (ERT) and then performed a modified DBScan [49] algorithm. Function analysis methods try to segment text lines by finding an optimal path across the document image.…”

Section: Related Work a Text Baseline Detection In Historical Documentsmentioning

confidence: 99%

See 1 more Smart Citation

Detecting Text Baselines in Historical Documents With Baseline Primitives

Jia

Sun

et al. 2021

IEEE Access

View full text Add to dashboard Cite

Previous deep learning-based approaches to text baseline detection in historical documents usually take it as a semantic segmentation task. These methods adopt a fully convolutional neural network (FCN) to predict baseline pixels first and then group them into lines by heuristic post-processing steps, which tend to suffer from the wrongly merged or wrongly split problem owing to the limited context information provided by pixels. To address these issues, we introduce the concept of a baseline primitive, which is defined as a virtual bounding box centered at each baseline pixel. After baseline primitive detection, a relation network is used to predict the link relationships between them. In this way, text baselines are generated by detecting baseline primitives and grouping them with these link relationships. Owing to the design of baseline primitives, wider context information can be leveraged to improve the link prediction accuracy. Therefore, our approach can effectively detect text baselines with small inter-line or large inter-word spacing. Quantitative experimental results demonstrate the effectiveness of our proposed baseline primitive design. Consequently, our approach achieves state-of-the-art performance on two public benchmarks, namely cBAD 2017 and cBAD 2019.

show abstract

Section: Related Work a Text Baseline Detection In Historical Documentsmentioning

confidence: 99%

Section: Related Work a Text Baseline Detection In Historical Documentsmentioning

confidence: 99%

Detecting Text Baselines in Historical Documents With Baseline Primitives

Jia

Sun

et al. 2021

IEEE Access

View full text Add to dashboard Cite

show abstract

“…One of these tasks is baseline detection which is one of the most critical tasks in text preprocessing stage. This process is critical because its results will affect the following tasks such as skew and slant correction, segmentation and features extraction processes [17][18][19][20][21]. The baseline is a pseudo horizontal line which connects the ascender strokes and descender strokes.…”

Section: Introductionmentioning

confidence: 99%

Novel Algorithm for Baseline Detection of Offline Arabic Handwritten Text Recognition

Ahmad Mustafa Ali Al Masri,

Muhammad Suzuri Hitam,

Wan Nural Jawahir Hj Wan Yussof

et al. 2024

ARASET

View full text Add to dashboard Cite

The baseline detection is one of the challenging tasks in the pre-processing stage of an online Arabic handwritten text recognition. The challenges include text skewness, touching letters or words, short words, sub-words, ligatures, isolated characters, small descenders, and diacritics. This paper presents a novel automated algorithm for baseline detection that is able to overcome the previously mentioned challenges. The proposed baseline algorithm mainly used a skeletonizing algorithm to estimate the baseline for each sub-word that appear in the image separately. The proposed algorithm had been tested using the benchmark handwritten Arabic database, i.e., IFN/ENIT database. The experimental results showed that it is able to reduce the average error between the actual baseline and the resulting baseline to 3.39 pixels for all sub-words and achieved an average of 3.6% in the ratio between the error pixels and the image height. In addition, the proposed algorithm shows that it is superior as compared to four others benchmarked baseline detection algorithms. It is anticipated that this algorithm will be able to be applied for many Arabic offline handwritten applications in the future.

show abstract

“…Although DLA is a very broad and complex field, most works on DLA focus only on automatic page segmentation and classification, either at text-line level (e.g., baseline detection) [7,15] or including region level segmentation [1,18]. For these subproblems, satisfactory results are currently achieved in most cases.…”

Section: Introductionmentioning

confidence: 99%

“…Modern techniques to obtain these elements from raw handwritten text images can yield very accurate results and are fairly robust to image degradation and other typical difficulties of handwritten documents[1,7,15,18].…”

mentioning

confidence: 99%

Reading order detection on handwritten documents

Quirós

Vidal

2022

Neural Comput & Applic

View full text Add to dashboard Cite

Recent advances in Handwritten Text Recognition and Document Layout Analysis have made it possible to convert digital images of manuscripts into electronic text. However, providing this text with the correct structure and context is still an open problem that needs to be solved to actually enable extracting the relevant information conveyed by the text. The most important structure needed for a set of text elements is their reading order. Most of the studies on the reading order problem are rule-based approaches and focus on printed documents. Much less attention has been paid so far to handwritten text documents, where the problem becomes particularly important—and challenging. In this work, we propose a new approach to automatically determine the reading order of text regions and lines in handwritten text documents. The task is approached as a sorting problem where the order-relation operator is automatically learned from examples. We experimentally demonstrate the effectiveness of our method on three different datasets at different hierarchical levels.

show abstract

Text baseline detection, a single page trained system

Cited by 10 publications

References 34 publications

Detecting Text Baselines in Historical Documents With Baseline Primitives

Detecting Text Baselines in Historical Documents With Baseline Primitives

Novel Algorithm for Baseline Detection of Offline Arabic Handwritten Text Recognition

Reading order detection on handwritten documents

Contact Info

Product

Resources

About