2013
DOI: 10.1109/tpami.2013.49
|View full text |Cite
|
Sign up to set email alerts
|

Handwritten Chinese/Japanese Text Recognition Using Semi-Markov Conditional Random Fields

Abstract: This paper proposes a method for handwritten Chinese/Japanese text (character string) recognition based on semi-Markov conditional random fields (semi-CRFs). The high-order semi-CRF model is defined on a lattice containing all possible segmentation-recognition hypotheses of a string to elegantly fuse the scores of candidate character recognition and the compatibilities of geometric and linguistic contexts by representing them in the feature functions. Based on given models of character recognition and compatib… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
6
0

Year Published

2018
2018
2022
2022

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 70 publications
(8 citation statements)
references
References 45 publications
1
6
0
Order By: Relevance
“…We used a trigram table extracted from the year 1993 volume of the Asahi newspaper and the year 2002 volume of the Nikkei newspaper to model linguistic context. From the TUAT-Kondate database collected from 100 people [12], we separated the text lines into 4 sets by writers and then used 3 sets (10,174 text lines written by 75 people) for training the weighting parameters and 1 set (3511 text lines written by 25 people) for testing as in [27,31]. We changed the role four times and took the average.…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…We used a trigram table extracted from the year 1993 volume of the Asahi newspaper and the year 2002 volume of the Nikkei newspaper to model linguistic context. From the TUAT-Kondate database collected from 100 people [12], we separated the text lines into 4 sets by writers and then used 3 sets (10,174 text lines written by 75 people) for training the weighting parameters and 1 set (3511 text lines written by 25 people) for testing as in [27,31]. We changed the role four times and took the average.…”
Section: Methodsmentioning
confidence: 99%
“…Therefore, recognition time grows exponentially as the length of input sequence increases. To reduce recognition time for handwritten Chinese and Japanese text, candidate character patterns formed by multiple primitive segments have been restricted in length [27,31]. The length restriction, however, is not applicable for handwritten English text due to a large variance in the lengths of candidate word patterns.…”
Section: Fixation Of Sps From Upsmentioning
confidence: 99%
See 1 more Smart Citation
“…The recommended method was tested on dual handwriting datasets and reportedly outperformed methods that adopt shallow CRF. Zhou et al [ 17 ] proposed a method for detecting Japanese and Chinese text that accorded with semi-Markov CRF. The researchers began with descriptions of semi-CRF on lattices comprised of every possible segmentation-recognition hypothesis of strings, in order to directly approximate the a posteriori probabilities for each.…”
Section: Previous Workmentioning
confidence: 99%
“…Due to the fact that HMM, CRF and HCRF are especially powerful for the task of sequential feature classification, features extracted from a letter or a word image should be sequential or can be easily converted into a sequence that eventually is passed to the appropriate classifier for recognition. The most widely-adopted sequential features for offline handwriting recognition are those extracted using the principle of the so-called sliding window [ 6 , 10 , 14 , 17 ]. Typically, these types of features are sequences of observations extracted by shifting a window along the image of the word from right to left or vice versa.…”
Section: Shape Descriptions Features For Arabic Handwriting Recognmentioning
confidence: 99%