Handwritten Chinese/Japanese Text Recognition Using Semi-Markov Conditional Random Fields

Zhou, Xiangdong; Wang, Da‐Han; Tian, Feng; Liu, Cheng‐Lin; Nakagawa, Masaki

doi:10.1109/tpami.2013.49

Cited by 70 publications

(8 citation statements)

References 45 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…We used a trigram table extracted from the year 1993 volume of the Asahi newspaper and the year 2002 volume of the Nikkei newspaper to model linguistic context. From the TUAT-Kondate database collected from 100 people [12], we separated the text lines into 4 sets by writers and then used 3 sets (10,174 text lines written by 75 people) for training the weighting parameters and 1 set (3511 text lines written by 25 people) for testing as in [27,31]. We changed the role four times and took the average.…”

Section: Methodsmentioning

confidence: 99%

“…Therefore, recognition time grows exponentially as the length of input sequence increases. To reduce recognition time for handwritten Chinese and Japanese text, candidate character patterns formed by multiple primitive segments have been restricted in length [27,31]. The length restriction, however, is not applicable for handwritten English text due to a large variance in the lengths of candidate word patterns.…”

Section: Fixation Of Sps From Upsmentioning

confidence: 99%

“…Owing to the progress in deep neural network technology, one can consider deploying it for practical systems, but there are some obstacles such as speed and memory space for the large category size to be used in stand-alone systems, especially for hand-held mobile phones and tablets. On the other hand, the explicit segmentation technique also provides reliable performance in recognition of online handwritten Japanese text [31], and online handwritten Chinese text [27]. This approach is also applied for online handwritten English text recognition [19,23].…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

A unified method for augmented incremental recognition of online handwritten Japanese and English text

Nguyen

Indurkhya

Nakagawa

2019

IJDAR

View full text Add to dashboard Cite

We present a unified method to augmented incremental recognition for online handwritten Japanese and English text, which is used for busy or on-the-fly recognition while writing, and lazy or delayed recognition after writing, without incurring long waiting times. It extends the local context for segmentation and recognition to a range of recent strokes called "segmentation scope" and "recognition scope," respectively. The recognition scope is inside of the segmentation scope. The augmented incremental recognition triggers recognition at every several recent strokes, updates the segmentation and recognition candidate lattice, and searches over the lattice for the best result incrementally. It also incorporates three techniques. The first is to reuse the segmentation and recognition candidate lattice in the previous recognition scope for the current recognition scope. The second is to fix undecided segmentation points if they are stable between character/word patterns. The third is to skip recognition of partial candidate character/word patterns. The augmented incremental method includes the case of triggering recognition at every new stroke with the above-mentioned techniques. Experiments conducted on TUAT-Kondate and IAM online database show its superiority to batch recognition (recognizing text at one time) and pure incremental recognition (recognizing text at every input stroke) in processing time, waiting time, and recognition accuracy.

show abstract

Section: Methodsmentioning

confidence: 99%

Section: Fixation Of Sps From Upsmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

A unified method for augmented incremental recognition of online handwritten Japanese and English text

Nguyen

Indurkhya

Nakagawa

2019

IJDAR

View full text Add to dashboard Cite

show abstract

“…The recommended method was tested on dual handwriting datasets and reportedly outperformed methods that adopt shallow CRF. Zhou et al [ 17 ] proposed a method for detecting Japanese and Chinese text that accorded with semi-Markov CRF. The researchers began with descriptions of semi-CRF on lattices comprised of every possible segmentation-recognition hypothesis of strings, in order to directly approximate the a posteriori probabilities for each.…”

Section: Previous Workmentioning

confidence: 99%

“…Due to the fact that HMM, CRF and HCRF are especially powerful for the task of sequential feature classification, features extracted from a letter or a word image should be sequential or can be easily converted into a sequence that eventually is passed to the appropriate classifier for recognition. The most widely-adopted sequential features for offline handwriting recognition are those extracted using the principle of the so-called sliding window [ 6 , 10 , 14 , 17 ]. Typically, these types of features are sequences of observations extracted by shifting a window along the image of the word from right to left or vice versa.…”

Section: Shape Descriptions Features For Arabic Handwriting Recognmentioning

confidence: 99%

Generative vs. Discriminative Recognition Models for Off-Line Arabic Handwriting

Elzobi

Al-Hamadi

2018

Sensors

View full text Add to dashboard Cite

The majority of handwritten word recognition strategies are constructed on learning-based generative frameworks from letter or word training samples. Theoretically, constructing recognition models through discriminative learning should be the more effective alternative. The primary goal of this research is to compare the performances of discriminative and generative recognition strategies, which are described by generatively-trained hidden Markov modeling (HMM), discriminatively-trained conditional random fields (CRF) and discriminatively-trained hidden-state CRF (HCRF). With learning samples obtained from two dissimilar databases, we initially trained and applied an HMM classification scheme. To enable HMM classifiers to effectively reject incorrect and out-of-vocabulary segmentation, we enhance the models with adaptive threshold schemes. Aside from proposing such schemes for HMM classifiers, this research introduces CRF and HCRF classifiers in the recognition of offline Arabic handwritten words. Furthermore, the efficiencies of all three strategies are fully assessed using two dissimilar databases. Recognition outcomes for both words and letters are presented, with the pros and cons of each strategy emphasized.

show abstract

Deep RNN Architecture: Design and Evaluation

Sun

Wang

et al. 2019

Cognitive Computation Trends

View full text Add to dashboard Cite

Handwritten Chinese/Japanese Text Recognition Using Semi-Markov Conditional Random Fields

Cited by 70 publications

References 45 publications

A unified method for augmented incremental recognition of online handwritten Japanese and English text

A unified method for augmented incremental recognition of online handwritten Japanese and English text

Generative vs. Discriminative Recognition Models for Off-Line Arabic Handwriting

Deep RNN Architecture: Design and Evaluation

Contact Info

Product

Resources

About