Proceedings of the 2nd Workshop on the Use of Computational Methods in the Study of Endangered Languages 2017
DOI: 10.18653/v1/w17-0118
|View full text |Cite
|
Sign up to set email alerts
|

Computational Support for Finding Word Classes: A Case Study of Abui

Abstract: We present a system that automatically groups verb stems into inflection classes, performing a case study of Abui verbs. Starting from a relatively small number of fully glossed Abui sentences, we train a morphological precision grammar and use it to automatically analyze and gloss words from the unglossed portion of our corpus. Then we group stems into classes based on their cooccurrence patterns with several prefix series of interest. We compare our results to a curated collection of elicited examples and il… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
9
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
1
1

Relationship

2
0

Authors

Journals

citations
Cited by 2 publications
(9 citation statements)
references
References 9 publications
0
9
0
Order By: Relevance
“…This section focuses on our approach to inferring the grammar specifications illustrated in the previous section. We take as our starting point the system of Zamaraeva et al (2019a) which integrates the morphological inference module (called MOM; Wax, 2014;Zamaraeva, 2016;Zamaraeva et al, 2017) and a module for inference of a few syntactic properties (Bender et al, 2014;Howell et al, 2017). To this integrated system we add extended inference for morphologically marked syntactic and semantic features, additional lexical classes and further syntactic properties to create , Building Analyses from Syntactic Inference in Local languages.…”
Section: Methodology: Inferring Grammar Specificationsmentioning
confidence: 99%
See 1 more Smart Citation
“…This section focuses on our approach to inferring the grammar specifications illustrated in the previous section. We take as our starting point the system of Zamaraeva et al (2019a) which integrates the morphological inference module (called MOM; Wax, 2014;Zamaraeva, 2016;Zamaraeva et al, 2017) and a module for inference of a few syntactic properties (Bender et al, 2014;Howell et al, 2017). To this integrated system we add extended inference for morphologically marked syntactic and semantic features, additional lexical classes and further syntactic properties to create , Building Analyses from Syntactic Inference in Local languages.…”
Section: Methodology: Inferring Grammar Specificationsmentioning
confidence: 99%
“…The AGGREGATION project (Bender et al, 2013(Bender et al, , 2014Howell et al, 2017;Zamaraeva et al, 2017Zamaraeva et al, , 2019a, describes its primary goal as providing the benefits of implemented, formal grammars to documentary linguists, without their having to invest time in develop-ing those grammars by hand. Such grammars are useful for testing linguistic hypotheses against data (Bierwisch, 1963;Müller, 1999;Bender, 2008b;Fokkens, 2014;Müller, 2015) as well as building treebanks which are useful for discovering examples of phenomena in a language (Bender et al, 2012;Letcher and Baldwin, 2013;Bouma et al, 2015).…”
Section: The Aggregation Projectmentioning
confidence: 99%
“…(1 Glossed examples like this one present an analysis of morphology on one particular sentence; the linguist will want to generalize to a set of hypotheses about the general morphological system in the language. For example, a question might be: Which prefixes occur with which verb stems (Zamaraeva et al, 2017) and is there any semantic coherence to the verb inflectional classes identified this way (Kratochvíl and Delpada, 2015)? In order to answer this kind of question more fully, linguists may find it helpful not only to find all possible cooccurrences of, say, prefixes and verbs, but also to visualize them as a graph.…”
Section: Case Studymentioning
confidence: 99%
“…At the same time, building language technology for morphologically complex lowresource languages requires a rule-based morphological analyzer when datasets are not large enough for ML approaches (see Garrette et al 2013;Erdmann and Habash 2018, inter alia). Our contribution is within the context of the AGGRE-GATION project, which aims to automatically infer broad typological characteristics and morphological patterns for understudied languages (Bender et al, 2013;Zamaraeva et al, 2017). We developed this visualization tool to help linguists to understand the morphological system implicit in large datasets and to refine automatically generated grammar specifications which model that morphological system.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation