2010
DOI: 10.1007/s10032-010-0132-6
|View full text |Cite
|
Sign up to set email alerts
|

Towards information retrieval on historical document collections: the role of matching procedures and special lexica

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
21
0

Year Published

2010
2010
2018
2018

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 25 publications
(22 citation statements)
references
References 8 publications
1
21
0
Order By: Relevance
“…Studies on HDR have generally focused on the differences between historical and modern languages. OCR errors have been omitted from the experimental settings by using manually created or manually corrected test data (e.g., Braun et al., ; Gotscharek, Reffle, Ringsletter, Schulz, & Neumann, ; Hauser, Heller, Leiss, Schulz, & Wanzeck, ; Kempken et al., , Koolen et al., ; O'Rourke et al., ). An exception is Pilz, Luther, Fuhr, and Ammon (), who created rules for handling OCR errors both manually and automatically based on edit costs between character replacements.…”
Section: Related Researchmentioning
confidence: 99%
See 2 more Smart Citations
“…Studies on HDR have generally focused on the differences between historical and modern languages. OCR errors have been omitted from the experimental settings by using manually created or manually corrected test data (e.g., Braun et al., ; Gotscharek, Reffle, Ringsletter, Schulz, & Neumann, ; Hauser, Heller, Leiss, Schulz, & Wanzeck, ; Kempken et al., , Koolen et al., ; O'Rourke et al., ). An exception is Pilz, Luther, Fuhr, and Ammon (), who created rules for handling OCR errors both manually and automatically based on edit costs between character replacements.…”
Section: Related Researchmentioning
confidence: 99%
“…Gotscharek et al. () described a corpus‐based approach to efficient construction of historical lexica with a focus on reducing the manual workload of lexicon construction. However, most studies have focused on the string level variation, ignoring the more complex issues related to conceptual, vocabulary, and syntactic change.…”
Section: Related Researchmentioning
confidence: 99%
See 1 more Smart Citation
“…In addition to the use of old fonts, incunabula show a significant variance of spelling for many words. Words' variants have been used for several centuries in many countries [7] bringing to a progressive standardization of Fig. 1 Fragment of one Genesis page [40] with the corresponding text transcription at the bottom spelling.…”
Section: Early Printed Booksmentioning
confidence: 99%
“…This alleviates the well-known domain and genre effects that seem to be currently primarily attracting this group's attention, cf. [14]. We are not sure whether incorporating the corpus vocabulary at correction time is possible in the fsa paradigm.…”
Section: Related Workmentioning
confidence: 99%