DOI: 10.4995/thesis/10251/116834
|View full text |Cite
|
Sign up to set email alerts
|

A Probabilistic Formulation of Keyword Spotting

Abstract: Keyword Spotting, applied to handwritten text documents, aims to retrieve the documents, or parts of them, that are relevant for a query, given by the user, within a large collection of documents. The topic has gained a large interest in the last 20 years among Pattern Recognition researchers, as well as digital libraries and archives. vi RESUMEN mos para construir índices de palabras a partir de modelos probabilísticos, basados tanto en un léxico cerrado como abierto. Estos índices son muy similares a los uti… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
9
0

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 8 publications
(10 citation statements)
references
References 123 publications
0
9
0
Order By: Relevance
“…A possible direction would consist of the use of linguistic statistics [9]. A recent method for using language information is a dual-state word-beam search [10] for decoding the connectionist temporal classification (CTC [11]) layer of neural networks, which has been shown to be effective [10].…”
Section: Introductionmentioning
confidence: 99%
“…A possible direction would consist of the use of linguistic statistics [9]. A recent method for using language information is a dual-state word-beam search [10] for decoding the connectionist temporal classification (CTC [11]) layer of neural networks, which has been shown to be effective [10].…”
Section: Introductionmentioning
confidence: 99%
“…Nevertheless, of course, our results do still leave significant room for improvement, and we do think that in many cases it might actually come from the use of textual features which, in future works we plan extract using a recent methodology known as ''probabilistic indexing'' [17,23].…”
Section: Methodsmentioning
confidence: 97%
“…This de facto standard evaluation measures the text line segmentation as per its extraction polygon which incorrectly diminished the importance of the detection subtask. Moreover, the line extraction accuracy results obtained with this measure present little correlation with the transcription accuracy results of the systems using the extracted lines [21].…”
Section: Introductionmentioning
confidence: 84%
“…Usage of Conditional Random Fields (CRF) was tried out in different articles but it did not fare well in comparison to Stochastic Context Free Grammars [2,6]. Markov Random Fields have seen minimal use to differentiate between printed text and handwritten text [21].…”
Section: State Of the Artmentioning
confidence: 99%
See 1 more Smart Citation