Abstract:This article is an attempt to represent Big Data research in digital humanities as a structured research field. A division in three concentric areas of study is presented. Challenges in the first circlefocusing on the processing and interpretations of large cultural datasets -can be organized linearly following the data processing pipeline. Challenges in the second circle -concerning digital culture at large -can be structured around the different relations linking massive datasets, large communities, collecti… Show more
“…Karpf, 2012;Edwards et al, 2013) and Digital Humanities (e.g. : Manovich, 2012;Kaplan, 2015;Terras et al, 2017) perhaps represents the greater focus of Big Data Analysis in library-related domains.…”
Section: Working Definitions Of Business Analytics and Big Datamentioning
Proposing a step-change in preservation system architectures
Structured AbstractPurpose To consider how Digital Preservation system architectures will support Business Analysis of large-scale collections of preserved resources, and the use of Big Data analyses by future researchers.
Design / methodology / approachArchitectural reviews of existing systems. Experimental surveys of large digital collections using existing Digital Preservation tools at Big Data scales. Design of a proposed new architecture to work with Big Data volumes of preserved digital resources -also based upon experience of managing a collection of 30 million digital images.
FindingsModern visualisation tools enable Business Analyses based on file-related metadata, but most currentlyavailable systems need more of this functionality 'out-of-the-box'. Scalability of preservation architecture to Big Data volumes depends upon the ability to run preservation processes in parallel, so indexes that enable effective sub-division of collections are vital. Not all processes scale easily: those that don't require complex management.
“…Karpf, 2012;Edwards et al, 2013) and Digital Humanities (e.g. : Manovich, 2012;Kaplan, 2015;Terras et al, 2017) perhaps represents the greater focus of Big Data Analysis in library-related domains.…”
Section: Working Definitions Of Business Analytics and Big Datamentioning
Proposing a step-change in preservation system architectures
Structured AbstractPurpose To consider how Digital Preservation system architectures will support Business Analysis of large-scale collections of preserved resources, and the use of Big Data analyses by future researchers.
Design / methodology / approachArchitectural reviews of existing systems. Experimental surveys of large digital collections using existing Digital Preservation tools at Big Data scales. Design of a proposed new architecture to work with Big Data volumes of preserved digital resources -also based upon experience of managing a collection of 30 million digital images.
FindingsModern visualisation tools enable Business Analyses based on file-related metadata, but most currentlyavailable systems need more of this functionality 'out-of-the-box'. Scalability of preservation architecture to Big Data volumes depends upon the ability to run preservation processes in parallel, so indexes that enable effective sub-division of collections are vital. Not all processes scale easily: those that don't require complex management.
“…Their vision is based on Manovich's ideology [38], which is focused on bringing the potential of social or cultural data into humanities and social sciences. Thus, Jean Burgess and Axel Bruns present the BSD concept by mentioning the shift of Big Data towards media, communication, cultural and computational social science, which has led to the wave of research on digital humanities [39][40][41]. According to Burgess and Bruns, such changes "...provoked in large part by the dramatic quantitative growth and apparently increased cultural importance of social mediahence, "big social data".…”
Section: Purpose-driven Approaches: Big Social Data For Societymentioning
“…Bronson and Knezevic 2016;Lazer et al 2014;Oboler, Welsh, and Cruz 2012) or big data-driven research in universities and non-profit institutions (see e.g. Borgman 2015;Gold and Klein 2016;Kaplan 2015;Franke et al 2016, Wyatt et al 2013, Kitchin 2013. For example, due to the dominance of media corporations in retrieving usergenerated big data, research institutions are increasingly dependent on access conditions defined by these companies.…”
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.