Document Recognition and Retrieval XVII 2010
DOI: 10.1117/12.840251
|View full text |Cite
|
Sign up to set email alerts
|

Touching character segmentation method for Chinese historical documents

Abstract: The OCR technology for Chinese historical documents is still an open problem. As these documents are hand-written or hand-carved in various styles, overlapped and touching characters bring great difficulty for character segmentation module. This paper presents an over-segmentation-based method to handle the overlapped and touching Chinese characters in historic documents. The whole segmentation process includes two parts: over-segmented and segmenting path optimization. In the former part, touching strokes wil… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2014
2014
2017
2017

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(1 citation statement)
references
References 3 publications
0
1
0
Order By: Relevance
“…At present, traditional text recognition technology and document analysis technology have made some progress, and the recognition rate of optical character recognition (OCR) can currently reach 99.99%, but high quality must be required for the picture and it is by scanning software and scanning process. Existing Chinese character recognition methods are mostly targeted at Chinese character recognition of small character set, and recognition rate is affected by font and character set [11] .…”
Section: Introductionmentioning
confidence: 99%
“…At present, traditional text recognition technology and document analysis technology have made some progress, and the recognition rate of optical character recognition (OCR) can currently reach 99.99%, but high quality must be required for the picture and it is by scanning software and scanning process. Existing Chinese character recognition methods are mostly targeted at Chinese character recognition of small character set, and recognition rate is affected by font and character set [11] .…”
Section: Introductionmentioning
confidence: 99%