Artificial Intelligence and Applications 2022
DOI: 10.5121/csit.2022.120908
|View full text |Cite
|
Sign up to set email alerts
|

Learning to Pronounce as Measuring Cross-Lingual Joint Orthography-Phonology Complexity

Abstract: Machine learning models allow us to compare languages by showing how hard a task in each language might be to learn and perform well on. Following this line of investigation, we explore what makes a language “hard to pronounce” by modelling the task of grapheme-to-phoneme (g2p) transliteration. By training a character-level transformer model on this task across 22 languages and measuring the model’s proficiency against its grapheme and phoneme inventories, we show that certain characteristics emerge that separ… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 0 publications
0
1
0
Order By: Relevance
“…The reading and writing of Korean characters strictly adhere to the fundamental prin-ciple of left-to-right and top-to-bottom orientations. It is worth mentioning that the syllable structures in Korean can be classified as CVC, VC, CV, or V [24][25][26][27]. To provide a visual example of Korean text, Figure 1 illustrates a representative sample.…”
Section: Attributes Extraction Of Korean Language Based On Prior Know...mentioning
confidence: 99%
“…The reading and writing of Korean characters strictly adhere to the fundamental prin-ciple of left-to-right and top-to-bottom orientations. It is worth mentioning that the syllable structures in Korean can be classified as CVC, VC, CV, or V [24][25][26][27]. To provide a visual example of Korean text, Figure 1 illustrates a representative sample.…”
Section: Attributes Extraction Of Korean Language Based On Prior Know...mentioning
confidence: 99%