2017 IEEE International Conference on Computer Vision (ICCV) 2017
DOI: 10.1109/iccv.2017.475
|View full text |Cite
|
Sign up to set email alerts
|

Neural Ctrl-F: Segmentation-Free Query-by-String Word Spotting in Handwritten Manuscript Collections

Abstract: In this paper, we approach the problem of segmentationfree query-by-string word spotting for handwritten documents. In other words, we use methods inspired from computer vision and machine learning to search for words in large collections of digitized manuscripts. In particular, we are interested in historical handwritten texts, which are often far more challenging than modern printed documents. This task is important, as it provides people with a way to quickly find what they are looking for in large collecti… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
39
0

Year Published

2019
2019
2021
2021

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 39 publications
(40 citation statements)
references
References 30 publications
(75 reference statements)
0
39
0
Order By: Relevance
“…There are many successful applications of metric learning [11,3,27,33], such as ranking, image retrieval, face verification, speaker verification and so on. By far, applications of metric learning on document analysis or text reading were limited to the problem of word spotting and verification [1,26,34]. In this work, we verify the effectiveness of deep metric learning in text detection task.…”
Section: Related Workmentioning
confidence: 65%
“…There are many successful applications of metric learning [11,3,27,33], such as ranking, image retrieval, face verification, speaker verification and so on. By far, applications of metric learning on document analysis or text reading were limited to the problem of word spotting and verification [1,26,34]. In this work, we verify the effectiveness of deep metric learning in text detection task.…”
Section: Related Workmentioning
confidence: 65%
“…In [64], authors propose a hybrid approach where the document image is first subjected to dense text detection using sliding windows and later the word hypothesises are computed using the set of extremal regions. The third category of methods [6,84,85] in the segmentation-free setting is inspired by the recent success of region proposal based object detection techniques such as Faster R-CNN [57]. The Ctrl-F-Net [84] model proposes an end to end trainable detection and embedding network.…”
Section: Segmentation-free Approachesmentioning
confidence: 99%
“…The EoIs assigned to the decoders are Passport Number, Name, (Gender, Birth Date), Birth Place, (Issue Place, Expiry Date). Nine decoders are set to cover ten EoIs for business card, and decoding steps of each decoder are 21,13,21,21,21,21,32,10,21. The EoIs of each decoder are Telephone, Postcode, Mobile, URL, Email, FAX, Address, (Name, Title), Company.…”
Section: B Experiments Settingmentioning
confidence: 99%