2021
DOI: 10.1002/asi.24544
|View full text |Cite
|
Sign up to set email alerts
|

Text analysis using deep neural networks in digital humanities and information science

Abstract: Combining computational technologies and humanities is an ongoing effort aimed at making resources such as texts, images, audio, video, and other artifacts digitally available, searchable, and analyzable. In recent years, deep neural networks (DNN) dominate the field of automatic text analysis and natural language processing (NLP), in some cases presenting a super-human performance. DNNs are the state-of-the-art machine learning algorithms solving many NLP tasks that are relevant for Digital Humanities (DH) re… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
11
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
6
2

Relationship

2
6

Authors

Journals

citations
Cited by 30 publications
(13 citation statements)
references
References 79 publications
(82 reference statements)
0
11
0
Order By: Relevance
“…Moreover, these transformations can be particularly tricky with data in the humanities. The domains of humanities research are highly specialized, so ML algorithms likewise require specialized training data or adaptations to work effectively in them (Suissa et al, 2022). Yet the benefits of doing so are considerable.…”
Section: Humanities and Language Artsmentioning
confidence: 99%
“…Moreover, these transformations can be particularly tricky with data in the humanities. The domains of humanities research are highly specialized, so ML algorithms likewise require specialized training data or adaptations to work effectively in them (Suissa et al, 2022). Yet the benefits of doing so are considerable.…”
Section: Humanities and Language Artsmentioning
confidence: 99%
“…Determining what is important, and how to process the important reports quickly, is the most challenging aspect of the analyst's task. A possible solution to enhance paragraph processing is to use innovative capabilities of text analysis (see, e.g., Chen et al, 2012 ; Suissa et al, 2022 ) or through semantic processing instead of simple standalone keywords, by exploiting open‐source software such as UK Defence Science and Technology Laboratory's Baleen (which is freely available from: https://github.com/dstl/baleen3 ). The latter is an extensible text processing capability that allows entity‐related information to be extracted from unstructured and semi‐structured data sources.…”
Section: Discussionmentioning
confidence: 99%
“…In recent years, DNNs have become the state-of-the-art method for text analysis in the cultural heritage space [111], and natural language question-answering systems based on DNN have become the state-of-the-art method for solving the question answering task [62]. The underlying task of question answering is Machine Reading Comprehension (MRC), which allows machines to read and comprehend a specified context passage for answering a question, similarly to language proficiency exams.…”
Section: Question Answering Using Dnnmentioning
confidence: 99%