2023
DOI: 10.1162/coli_a_00481
|View full text |Cite
|
Sign up to set email alerts
|

Machine Learning for Ancient Languages: A Survey

Thea Sommerschield,
Yannis Assael,
John Pavlopoulos
et al.

Abstract: Ancient languages preserve the cultures and histories of the past. However, their study is fraught with difficulties, and experts must tackle a range of challenging text-based tasks, from deciphering lost languages to restoring damaged inscriptions, to determining the authorship of works of literature. Technological aids have long supported the study of ancient texts, but in recent years advances in Artificial Intelligence and Machine Learning have enabled analyses on a scale and in a detail that are reshaping… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
4
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 11 publications
(4 citation statements)
references
References 179 publications
0
4
0
Order By: Relevance
“…Some efforts have expanded resources for specific ancient languages. The PROIEL offers 187,000 syntactic annotations for Ancient Greek [2], while the Index Thomisticus Treebank contains over 60 million words from Medieval Latin [2]. However, diversity issues persist, and most ancient languages remain under-resourced.…”
Section: Training Datasets For Ancient Scriptsmentioning
confidence: 99%
See 1 more Smart Citation
“…Some efforts have expanded resources for specific ancient languages. The PROIEL offers 187,000 syntactic annotations for Ancient Greek [2], while the Index Thomisticus Treebank contains over 60 million words from Medieval Latin [2]. However, diversity issues persist, and most ancient languages remain under-resourced.…”
Section: Training Datasets For Ancient Scriptsmentioning
confidence: 99%
“…Deciphering ancient scripts through the study of inscriptions, known as epigraphy, provides valuable insights into historical languages and cultures. However, the automated recognition and interpretation of ancient writing systems pose considerable challenges for ML techniques [1,2]. In particular, Old Aramaic scripts present a highly complex task for algorithmic analysis.…”
Section: Introductionmentioning
confidence: 99%
“…Another area is the study of calligraphic ductus , in which computerised palaeography enables a stylometric analysis to be carried out using algorithms. This is very useful when attempting to identify handwriting in fields such as palaeography, epigraphy and diplomacy (Azmi et al ., 2011; Cuéllar, 2023; Sommerschield et al ., 2023; Wolf et al ., 2011). It can even transcribe documents, examining and comparing them to recognise their authorship (Kang, 2021; Meza-Lovn, 2012; Tuzzi and Cortelazzo, 2018).…”
Section: Introductionmentioning
confidence: 99%
“…While numerous approaches for OCR/HTR and text reconstruction tackle different languages (e.g., Akkadian (Lazar et al, 2021;Fetaya et al, 2020), hieroglyphs (Barucci et al, 2021), etc. ), the analysis of historical texts is heavily biased towards Ancient Greek and Latin (Sommerschield et al, 2023).…”
Section: Introductionmentioning
confidence: 99%