2002
DOI: 10.1007/3-540-45869-7_24
|View full text |Cite
|
Sign up to set email alerts
|

Text/Graphics Separation Revisited

Abstract: Abstract. Text/graphics separation aims at segmenting the document into two layers: a layer assumed to contain text and a layer containing graphical objects. In this paper, we present a consolidation of a method proposed by Fletcher and Kasturi, with a number of improvements to make it more suitable for graphics-rich documents. We discuss the right choice of thresholds for this method, and their stability. We also propose a post-processing step for retrieving text components touching the graphics, through loca… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
87
0
1

Year Published

2006
2006
2017
2017

Publication Types

Select...
4
4
1

Relationship

1
8

Authors

Journals

citations
Cited by 112 publications
(88 citation statements)
references
References 16 publications
0
87
0
1
Order By: Relevance
“…R (%) P (%) (Neumann and Matas, 2012) 12 Because comics are really specific documents, real-scene text detection (Neumann and Matas, 2012) detect very few text lines. The best results we reached with a method from literature is a text/graphic separation method design for documents (Tombre et al, 2002) based on our adaptive segmentation (MCCT).…”
Section: Segmentmentioning
confidence: 97%
See 1 more Smart Citation
“…R (%) P (%) (Neumann and Matas, 2012) 12 Because comics are really specific documents, real-scene text detection (Neumann and Matas, 2012) detect very few text lines. The best results we reached with a method from literature is a text/graphic separation method design for documents (Tombre et al, 2002) based on our adaptive segmentation (MCCT).…”
Section: Segmentmentioning
confidence: 97%
“…In these results we compare our results to other text localization methods, which were either designed for unstructured documents (Tombre et al, 2002), complex backgrounds (Neumann and Matas, 2012).…”
Section: Text Localization In Unstructured Documentsmentioning
confidence: 99%
“…Following the convention of text/graphics separation in document analysis (Tombre et al 2002), we distinguish two types of primitive elements as textual and graphic. Both types of elements are further recognised at two sub-levels, namely grapheme and morpheme.…”
Section: Representation Primitivesmentioning
confidence: 99%
“…Many of the suggested algorithms concern specific type of document form [4][5], while others can fail in the presence of broken or skewed lines or in the case that text and ruling lines are extremely overlapped [6]. In the last case, a decision has to be made if a black pixel belongs to a line or a character, in order to turn it off or keep it on, respectively.…”
Section: Introductionmentioning
confidence: 99%