Fully Convolutional Networks for Handwriting Recognition

Such, Felipe Petroski; Peri, Dheeraj; Brockler, Frank; Hutkowski, Paul; Ptucha, Raymond

doi:10.1109/icfhr-2018.2018.00024

Cited by 21 publications

(10 citation statements)

References 28 publications

(46 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In most cases, the number of training data is several hundred or at most a few thousand for machine-learning applications in petroleum engineering [17][18][19][20][21][22][23][24][25][26][27][28], which was also noted in a previous paper [16]. Contrastingly, the number of training data is over ten thousand or even up to one million in computer-science-engineering-centered applications [29]. In spite of that, the number of training data does not necessarily guarantee reliability of training performance when inappropriate data is combined with the entire data pool.…”

Section: Introductionmentioning

confidence: 78%

Data-Driven Three-Phase Saturation Identification from X-ray CT Images with Critical Gas Hydrate Saturation

Kim

Lee

Lee³

et al. 2020

Energies

View full text Add to dashboard Cite

This study proposes three-phase saturation identification using X-ray computerized tomography (CT) images of gas hydrate (GH) experiments considering critical GH saturation (SGH,C) based on the machine-learning method of random forest. Eight GH samples were categorized into three low and five high GH saturation (SGH) groups. Mean square error of test results in the low and the high groups showed decreases of 37% and 33%, respectively, compared to that of the total eight. Additionally, a universal test set was configured from the total eight and tested with two trained machines for the low and high GH groups. Results revealed a boundary at ~50% of SGH signifying different saturation identification performance and the ~50% was estimated as SGH,C in this study. The trained machines for the low and high SGH groups had less performance on the larger and smaller values, respectively, of SGH,C. These findings conclude that we can take advantage of suitable separation of obtained training data, such as GH CT images, under the criteria of SGH,C. Moreover, the proposed data-driven method not only serves as a saturation identification method for GH samples in real time, but also provides a guideline to make decisions for data acquirement priorities.

show abstract

Section: Introductionmentioning

confidence: 78%

Data-Driven Three-Phase Saturation Identification from X-ray CT Images with Critical Gas Hydrate Saturation

Kim

Lee

Lee³

et al. 2020

Energies

View full text Add to dashboard Cite

show abstract

“…This task was the first tackled using LeNet [8], and is what is currently done for ideogrammatic languages such as Chinese [9] and Japanese [10]. For alphabetic languages, HTR can be also performed at word level [11], [12], [13], i.e., decoding single words that are detected in the image. This task is performed both on digitalized documents, and in scene images [14].…”

Section: Related Workmentioning

confidence: 99%

Watch Your Strokes: Improving Handwritten Text Recognition with Deformable Convolutions

Cojocaru

Cascianelli

Baraldi

et al. 2021

2020 25th International Conference on Pattern Recognition (ICPR)

View full text Add to dashboard Cite

Handwritten Text Recognition (HTR) in free-layout pages is a valuable yet challenging task which aims to automatically understand handwritten texts. State-of-the-art approaches in this field usually encode input images with Convolutional Neural Networks, whose kernels are typically defined on a fixed grid and focus on all input pixels independently. However, this is in contrast with the sparse nature of handwritten pages, in which only pixels representing the ink of the writing are useful for the recognition task. Furthermore, the standard convolution operator is not explicitly designed to take into account the great variability in shape, scale, and orientation of handwritten characters. To overcome these limitations, we investigate the use of deformable convolutions for handwriting recognition. The kernel of this type of convolution deforms according to the content of the neighborhood, and can therefore be more adaptable to geometric variations and other deformations of the text. Experiments conducted on the IAM and RIMES datasets demonstrate that the use of deformable convolutions is a promising direction for the design of novel architectures for handwritten text recognition.

show abstract

“…A fully convolutional handwriting model suggested by Petroski [28] utilizes an unknown length handwriting sample and generates an arbitrary symbol stream. Both local and global contexts are used by the dual-stream architecture and need strong pre-processing steps such as symbol alignment correction as well as complex post-processing steps such as link-time classification, dictionary matching, or language models.…”

Section: Related Workmentioning

confidence: 99%

“…The Weldegebriel [27] for classification proposes a hybrid model of two super classifiers: the CNN and the Extreme Gradient Boosting (XGBoost). CNN serves as an automatic training feature extractor for raw images for this integrated model, and XGBoost uses the extracted features as an input for recognition and classification.…”

Section: Related Workmentioning

confidence: 99%

Classification of Handwritten Names of Cities and Handwritten Text Recognition using Various Deep Learning Models

Nurseitov¹,

Bostanbekov²,

Kanatov³

et al. 2020

Adv. sci. technol. eng. syst. j.

View full text Add to dashboard Cite

This article discusses the problem of handwriting recognition in Kazakh and Russian languages. This area is poorly studied since in the literature there are almost no works in this direction. We have tried to describe various approaches and achievements of recent years in the development of handwritten recognition models in relation to Cyrillic graphics. The first model uses deep convolutional neural networks (CNNs) for feature extraction and a fully connected multilayer perceptron neural network (MLP) for word classification. The second model, called SimpleHTR, uses CNN and recurrent neural network (RNN) layers to extract information from images. We also proposed the Bluechet and Puchserver models to compare the results. Due to the lack of available open datasets in Russian and Kazakh languages, we carried out work to collect data that included handwritten names of countries and cities from 42 different Cyrillic words, written more than 500 times in different handwriting. We also used a handwritten database of Kazakh and Russian languages (HKR). This is a new database of Cyrillic words (not only countries and cities) for the Russian and Kazakh languages, created by the authors of this work.

show abstract

Fully Convolutional Networks for Handwriting Recognition

Cited by 21 publications

References 28 publications

Data-Driven Three-Phase Saturation Identification from X-ray CT Images with Critical Gas Hydrate Saturation

Data-Driven Three-Phase Saturation Identification from X-ray CT Images with Critical Gas Hydrate Saturation

Watch Your Strokes: Improving Handwritten Text Recognition with Deformable Convolutions

Classification of Handwritten Names of Cities and Handwritten Text Recognition using Various Deep Learning Models

Contact Info

Product

Resources

About