DOI: 10.4018/978-1-4666-4699-5.ch011
|View full text |Cite
|
Sign up to set email alerts
|

Big Data at Scale for Digital Humanities

Abstract: Big Data in the humanities is a new phenomenon that is expected to revolutionize the process of humanities research. The HathiTrust Research Center (HTRC) is a cyberinfrastructure to support humanities research on big humanities data. The HathiTrust Research Center has been designed to make the technology serve the researcher to make the content easy to find, to make the research tools efficient and effective, to allow researchers to customize their environment, to allow researchers to combine their own data w… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
2
0

Publication Types

Select...
2

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 23 publications
0
2
0
Order By: Relevance
“…In order to comply with non‐consumptive use, we created a dataset from random sentences across random pages in our texts. We recognize that this data is far from representative of the entirety of the whole volumes and publish this dataset in order to comply with legal precedent of non‐consumptivity established in the Authors Guild v. Google legal settlement (Kowalczyk, et al, 2014).…”
Section: Limitationsmentioning
confidence: 99%
“…In order to comply with non‐consumptive use, we created a dataset from random sentences across random pages in our texts. We recognize that this data is far from representative of the entirety of the whole volumes and publish this dataset in order to comply with legal precedent of non‐consumptivity established in the Authors Guild v. Google legal settlement (Kowalczyk, et al, 2014).…”
Section: Limitationsmentioning
confidence: 99%
“…The HTRC was created as a research arm of the HathiTrust consortium to consider tools for research for scholars who hope to analyse and interpret text at large scale (Unsworth 2011, Kowalczyk 2012, Kowalczyk et al 2013. The services that the HTRC provides include, to date, support for scholar-created custom research collections ('worksets'), statistical text analysis tools and online interfaces to them, an application programming interface (API) for metadata, and a data API for public domain materials.…”
mentioning
confidence: 99%