Proceedings of the 17th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology 2020
DOI: 10.18653/v1/2020.sigmorphon-1.19
|View full text |Cite
|
Sign up to set email alerts
|

CLUZH at SIGMORPHON 2020 Shared Task on Multilingual Grapheme-to-Phoneme Conversion

Abstract: This paper describes the submission by the team from the Institute of Computational Linguistics, Zurich University, to the Multilingual Grapheme-to-Phoneme Conversion (G2P) Task of the SIGMORPHON 2020 challenge. The submission adapts our system from the 2018 edition of the SIGMORPHON shared task. Our system is a neural transducer that operates over explicit edit actions and is trained with imitation learning. It is well-suited for morphological string transduction partly because it exploits the fact that the i… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
11
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
5

Relationship

2
3

Authors

Journals

citations
Cited by 7 publications
(11 citation statements)
references
References 11 publications
0
11
0
Order By: Relevance
“…The CLUZH team adapted their earlier model, character-level neural transducer, to work on large datasets (Makarov and Clematide, 2020). The model has previously shown superior performance, especially in low-resource scenarios.…”
Section: Baseline Systemsmentioning
confidence: 99%
See 1 more Smart Citation
“…The CLUZH team adapted their earlier model, character-level neural transducer, to work on large datasets (Makarov and Clematide, 2020). The model has previously shown superior performance, especially in low-resource scenarios.…”
Section: Baseline Systemsmentioning
confidence: 99%
“…As a basis for all our submissions, we use a neural character-level transducer that edits the input string into the output string by a sequence of traditional edit actions: substitutions, insertions, deletion, and copy. The specific version of this approach was developed for grapheme-to-phoneme conversion (Makarov and Clematide, 2020a). Such neural transducers have typically performed well in morphological and related character-level transduction tasks in low to medium training data settings.…”
Section: Model Descriptionmentioning
confidence: 99%
“…During training, an aggressive exploration schedule p sampling (i) = 1 1+exp(i) where i is the training epoch number, exposes the model to configurations sampled by executing edit actions from the model. For an extended description of the SED policy and IL training, we refer the reader to the last year's system description paper (Makarov and Clematide, 2020).…”
Section: Model Descriptionmentioning
confidence: 99%
“…Two forms of error analysis were employed here. First, after Makarov and Clematide (2020), the most frequent error types in each language are shown in Table 7. From this table it is clear that many errors can be attributed either to the ambiguity of a language's writing system.…”
Section: Error Analysismentioning
confidence: 99%