IEEE/ACM Joint Conference on Digital Libraries 2014
DOI: 10.1109/jcdl.2014.6970163
|View full text |Cite
|
Sign up to set email alerts
|

The anatomy of a search and mining system for digital humanities

Abstract: Samtla (Search And Mining Tools with Linguistic Analysis) is an online integrated research environment designed in collaboration with historians and linguists to facilitate the study of digitised texts written in any language. It currently supports the research of two corpora: the Genizah collection held by the Taylor-Schechter Genizah Research Unit in Cambridge University, and a collection of Aramaic incantation texts from late antiquity. In contrast to standard search engines and text mining systems that rel… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
7
0

Year Published

2018
2018
2021
2021

Publication Types

Select...
4

Relationship

2
2

Authors

Journals

citations
Cited by 4 publications
(7 citation statements)
references
References 42 publications
0
7
0
Order By: Relevance
“…Their research result shows acceptable performance, which is on par with previously reported studies. Although there have been a lot of digital tools to be successfully developed to support digital humanities research, Harris et al . (2014) argued that there is still a huge gap between what researchers want and the available digital tools in terms of their functionality and usability based on assessing the experience of historians using digital tools for digital humanities research.…”
Section: Literature Reviewmentioning
confidence: 99%
“…Their research result shows acceptable performance, which is on par with previously reported studies. Although there have been a lot of digital tools to be successfully developed to support digital humanities research, Harris et al . (2014) argued that there is still a huge gap between what researchers want and the available digital tools in terms of their functionality and usability based on assessing the experience of historians using digital tools for digital humanities research.…”
Section: Literature Reviewmentioning
confidence: 99%
“…Also, Brooke et al (2015) introduced a software tool, GutenTag, which is aimed at giving literary researchers direct access to NLP techniques for the analysis of texts in the Project Gutenberg corpus. Harris et al (2014) presented the anatomy of Samtla, which is an online integrated research environment designed in collaboration with historians and linguists, to facilitate the study of digitized texts written in any language. Samtla is fundamentally different from standard text search/mining systems which rely on the bag-of-words representation of text.…”
Section: Literature Reviewmentioning
confidence: 99%
“…The result of their study shows acceptable performance, which is on par with previously reported studies. Although there have been a lot of digital tools to be successfully developed to support DH research, Harris et al (2014) argued that there is still a huge gap between what humanists actually want and what digital tools can do in terms of functionality and usability.…”
Section: Literature Reviewmentioning
confidence: 99%
“…Here the notion of "finding" is divided into search (i.e., locating exact and approximate parallel passages with respect to a given text segment, across multiple documents), and the other refers to discovery (i.e., identifying potentially related text segments). The domain-independent and language-independent digital infrastructure introduced in this paper is at the core of the Samtla (Search And Mining Tools for Language Archives) system 1 [21,22] which has been designed in collaboration with historians, and linguists. Samtla is a web application , based on the a Model-View-Controller (MVC) design pattern [33], and developed in Python 2 using the Django 3 web framework.…”
Section: Introductionmentioning
confidence: 99%