Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conferen 2019
DOI: 10.18653/v1/d19-1668
|View full text |Cite
|
Sign up to set email alerts
|

Restoring ancient text using deep learning: a case study on Greek epigraphy

Abstract: Ancient History relies on disciplines such as Epigraphy, the study of ancient inscribed texts, for evidence of the recorded past. However, these texts, "inscriptions", are often damaged over the centuries, and illegible parts of the text must be restored by specialists, known as epigraphists. This work presents PYTHIA, the first ancient text restoration model that recovers missing characters from a damaged text input using deep neural networks. Its architecture is carefully designed to handle longterm context … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
34
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 43 publications
(41 citation statements)
references
References 21 publications
0
34
0
Order By: Relevance
“…We test BLM's capacity to rewrite specified portions of text on three tasks: text infilling (Zhu et al, 2019), ancient text restoration (Assael et al, 2019) and style transfer (Shen et al, 2017). Fig.…”
Section: Methodsmentioning
confidence: 99%
“…We test BLM's capacity to rewrite specified portions of text on three tasks: text infilling (Zhu et al, 2019), ancient text restoration (Assael et al, 2019) and style transfer (Shen et al, 2017). Fig.…”
Section: Methodsmentioning
confidence: 99%
“…The best results in solving the restoration task, so far, have been achieved in studies of machine-reading comprehension, specifically of the cloze-style, in which both the level of character and word are identified (45). In the field of ancient languages, we know of only one other study that used an algorithm with neural network architecture to recover missing letters, in the context of epigraphic inscriptions in ancient Greek (46). Their model, named PYTHIA, uses a sequence-to-sequence NN with LSTM and was trained on the Packard Humanities Institute (PHI) database, the largest digital dataset of ancient Greek inscriptions.…”
Section: Related Workmentioning
confidence: 99%
“…It supports detoxification or automatic conversion of the text to Chinese characters or Hiragana, providing an opportunity to assist the interpretation of the vast amount of historical text that has remained unconverted. Oxford and DeepMind conducted an initial study to restore ancient Greek letters using deep learning [1]. They proposed PYTHIA, the ancient text restoration model that recovers missing characters from a damaged input text using deep learning.…”
Section: Related Work and Background A Artificial Intelligence-bmentioning
confidence: 99%
“…Currently, there are no existing studies on ancient Korean machine translation. However, machine translation studies for other ancient languages are starting to appear, such as KuroNET [4,5] for ancient Japanese and research on ancient Greek epigraphies [1]. Hence, this study is the first to investigate ancient Korean machine translation and will serve as a starting point for further ancient Korean machine translation research.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation