2012
DOI: 10.1007/978-3-642-33290-6_11
|View full text |Cite
|
Sign up to set email alerts
|

Increasing Recall for Text Re-use in Historical Documents to Support Research in the Humanities

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
4
0

Year Published

2014
2014
2023
2023

Publication Types

Select...
3
3
1

Relationship

0
7

Authors

Journals

citations
Cited by 8 publications
(4 citation statements)
references
References 6 publications
0
4
0
Order By: Relevance
“…In the text mining literature, there has been a huge volume of work on detecting similarity between or among texts (e.g. [SS95, CGPW97, FEC05, BCMB12]). Detection techniques often have to address particular requirements related to a specific context, such as languages, periods of articles, genres of articles, and so on.…”
Section: Introductionmentioning
confidence: 99%
“…In the text mining literature, there has been a huge volume of work on detecting similarity between or among texts (e.g. [SS95, CGPW97, FEC05, BCMB12]). Detection techniques often have to address particular requirements related to a specific context, such as languages, periods of articles, genres of articles, and so on.…”
Section: Introductionmentioning
confidence: 99%
“…Cf. https://www.etrap.eu/research/tracer/ (eTRAP Project, University of Göttingen) andBüchler (2013) andBüchler et al (2012).3 Cf. https://artfl-project.uchicago.edu/text-pair (ARTFL Project, University of Chicago) andAllen et al (2010) andHorton et al (2010).…”
mentioning
confidence: 99%
“…Specifically, he uses a fingerprinting approach by selecting certain ngrams from an upfront presegmentized corpus. Furthermore, focusing on high recall, the detection of Homeric quotations in Athenaeus' Deipnosophistai is investigated by Büchler et al (2012), searching for distinctive words within reuse. Efforts to automatically process ancient texts are also made around the Perseus Digital Library project (Crane, 1985).…”
Section: Text Reuse Detection In Historical Textmentioning
confidence: 99%
“…The computational detection of such passed, reused text in the form of historical text reuseincluding (verbatim) quotations, allusions, the unintended reuse of a saying, or even cases of cross-linguistic reuse in the form of translations-can be applied in many respects. It can help tracing down historical content (a.k.a., lines of transmission), which is essential to the field of textual criticism (Büchler et al, 2012), or it can help assigning a text to an author (Gupta & Lehal, 2009;Steyvers et al, 2004) if the original author is not clear. In the context of massive digitization projects, text reuse detection can identify relationships between text excerpts referring to the same source.…”
Section: Introductionmentioning
confidence: 99%