Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics 2020
DOI: 10.18653/v1/2020.acl-main.598
|View full text |Cite
|
Sign up to set email alerts
|

Unsupervised Morphological Paradigm Completion

Abstract: We propose the task of unsupervised morphological paradigm completion. Given only raw text and a lemma list, the task consists of generating the morphological paradigms, i.e., all inflected forms, of the lemmas. From a natural language processing (NLP) perspective, this is a challenging unsupervised task, and high-performing systems have the potential to improve tools for low-resource languages or to assist linguistic annotators. From a cognitive science perspective, this can shed light on how children acquire… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
31
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
5
1

Relationship

3
3

Authors

Journals

citations
Cited by 18 publications
(34 citation statements)
references
References 35 publications
0
31
0
Order By: Relevance
“…Finally, introduced neural graphical models which completed paradigms based on principal parts. The unsupervised version of the paradigm completion task (Jin et al, 2020) has been the subject of a recent shared task (Kann et al, 2020b), with the conclusion that it is exremely challenging for current state-of-the-art systems. Here, we propose to, instead of generating paradigms from raw text, generate them from IGT, a resource available for many under-studied languages.…”
Section: Related Workmentioning
confidence: 99%
“…Finally, introduced neural graphical models which completed paradigms based on principal parts. The unsupervised version of the paradigm completion task (Jin et al, 2020) has been the subject of a recent shared task (Kann et al, 2020b), with the conclusion that it is exremely challenging for current state-of-the-art systems. Here, we propose to, instead of generating paradigms from raw text, generate them from IGT, a resource available for many under-studied languages.…”
Section: Related Workmentioning
confidence: 99%
“…The objective is to generate the complete paradigms for all lemmas. Our systems for this task consist of a combination of the official baseline system (Jin et al, 2020) and our systems for Task 0. The baseline system finds inflected forms in the text, decides on the number of inflected forms per lemma, and produces pseudo training files for morphological inflection.…”
Section: Lemma Featuresmentioning
confidence: 99%
“…Our systems for Task 2 consist of a combination of the official baseline system (Jin et al, 2020) and our inflection systems for Task 0. The system is given raw text and a source file with lemmas, and generates the complete paradigm of each lemma.…”
Section: Task 2: Model Descriptionmentioning
confidence: 99%
See 1 more Smart Citation
“…This is an intriguing research area that could give us the chance of recovering dead languages that have only limited written resources. Several researchers have attempted to solve this task, such as Goldsmith et al (2017), Jin et al (2020), and Erdmann et al (2020).…”
Section: Introductionmentioning
confidence: 99%