Abstract:The Cervantes Project is creating an Electronic Variorum Edition (EVE) of Cervantes' well-known Don Quixote de la Mancha, published beginning in 1605. In this paper, we report on visualizations of features of a text collection that help us validate our text transcriptions and understand the relationships among the different printings of an edition.
“…Our examination of the visualization pointed out examples of transcription errors (due to unclear patches in the original microfilms), one "unexpected" action taken by the collation algorithm (represented by the unusually long length of one of the variances), and differences of the form identified by others previously (including those in the first two pages). See [11] for further explanation of this display. Our static visualization only depicted the results in the screen, but did not provide further information such as the content of the variant, its length in characters, and the offset in the text; nor does allow interaction.…”
Section: Visualizations For Validating the Results Of The Collationmentioning
confidence: 99%
“…Furthermore, we recognize a practical need in our project to validate the accuracy of the transcriptions of the texts, since those are created manually from an electronic template. Previously, we have described briefly [11] our use of visualizations to depict the variants, and its utility in presenting anomalies that require further investigation. In this paper, we report further on our application of visualizations in detecting patterns of interest within collections of copies or editions of a text.…”
Abstract. As part of the Cervantes Project digital library, we are developing an Electronic Variorum Edition (EVE) of Don Quixote de la Mancha. Multiple editors can create an EVE with our Multi Variant Editor for Documents (MVED), which allows collation of one base text against several comparison texts to identify, link and edit all existing variants among them. In this context we are investigating the use of visualizations to depict graphically variants in order to validate the accuracy of the textual transcriptions and to understand the similarities and differences among different printings and editions. Our broader goal is to enable users to analyze the collation's results and to discover facts about the evolution of the Quixote textual history, and to provide evidence to eliminate printing and compositor's errors and thus to produce a more correct edition closer to Cervantes' original manuscript. This paper describes the visualization tool, and presents the initial results of its use.
“…Our examination of the visualization pointed out examples of transcription errors (due to unclear patches in the original microfilms), one "unexpected" action taken by the collation algorithm (represented by the unusually long length of one of the variances), and differences of the form identified by others previously (including those in the first two pages). See [11] for further explanation of this display. Our static visualization only depicted the results in the screen, but did not provide further information such as the content of the variant, its length in characters, and the offset in the text; nor does allow interaction.…”
Section: Visualizations For Validating the Results Of The Collationmentioning
confidence: 99%
“…Furthermore, we recognize a practical need in our project to validate the accuracy of the transcriptions of the texts, since those are created manually from an electronic template. Previously, we have described briefly [11] our use of visualizations to depict the variants, and its utility in presenting anomalies that require further investigation. In this paper, we report further on our application of visualizations in detecting patterns of interest within collections of copies or editions of a text.…”
Abstract. As part of the Cervantes Project digital library, we are developing an Electronic Variorum Edition (EVE) of Don Quixote de la Mancha. Multiple editors can create an EVE with our Multi Variant Editor for Documents (MVED), which allows collation of one base text against several comparison texts to identify, link and edit all existing variants among them. In this context we are investigating the use of visualizations to depict graphically variants in order to validate the accuracy of the textual transcriptions and to understand the similarities and differences among different printings and editions. Our broader goal is to enable users to analyze the collation's results and to discover facts about the evolution of the Quixote textual history, and to provide evidence to eliminate printing and compositor's errors and thus to produce a more correct edition closer to Cervantes' original manuscript. This paper describes the visualization tool, and presents the initial results of its use.
“…We are in the process of creating an Electronic Variorum Edition (EVE) of Miguel de Cervantes y Saavedra's (Spain, 1547-1616) Don Quixote [4,5,6]. The EVE will be included in the Cervantes Digital Library (CDL), part of the ongoing Cervantes Project [13].…”
Abstract. In this paper we describe ItLv (Interactive Timeline Viewer), a visualization tool currently used to depict the variants obtained in a textual collation. A textual collation is a process in which a base text is compared against several comparison texts to identify differences (variants) among them. The interactive options of ItLv provide different abstractions of a dataset by enabling the presentation and exploration of the relationships that exist within the dataset. Applying ItLv to the dataset resulting from a collation therefore helps understand the relationships among the texts. The example dataset used in this paper is a collation of six early editions of Cervantes' Don Quixote.
“…The Web has clearly allowed new intellectual communities to form, and many intellectual resources formerly available only onsite in special collections are now receiving substantial electronic use (e.g. [20][21][22][23]). The 315 million page-accesses we have tracked since 1996 are a useful diachronic dataset that we are beginning to use to track the evolution of user behavior, while the thousands of email messages to webmaster@perseus.tufts.edu trace the reactions of a patron group (albeit a self-selecting one).…”
Section: Audiencesmentioning
confidence: 99%
“…The efforts of the Perseus Project are based on a strong and somewhat polemical premise: namely, that digital libraries promise new methods by means of which new audiences can ask new questions about new ideas they would never otherwise have been able to explore. While we are based in a university and we are products of US higher education, we see the peer-to-peer interactions between professional colleagues [1][2][3] and indeed the formal instruction of 18-22 year old students [4,5] as instruments for a broader purpose. In one recent survey of 1,500 people, "two fifths … reported that they pursue a hobby or collection related to the past, and they spoke of those pursuits with words like 'love' and 'passion'" [6].…”
This paper surveys research areas relevant to cultural heritage digital libraries. The emerging National Science Digital Library promises to establish the foundation on which those of us beyond the scientific and engineering community will likely build. This paper thus articulates the particular issues that we have encountered in developing cultural heritage collections. We provide a broad overview of audiences, collections, and services.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.