Proceedings of the 14th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology 2016
DOI: 10.18653/v1/w16-2021
|View full text |Cite
|
Sign up to set email alerts
|

Inferring Morphotactics from Interlinear Glossed Text: Combining Clustering and Precision Grammars

Abstract: In this paper I present a k-means clustering approach to inferring morphological position classes (morphotactics) from Interlinear Glossed Text (IGT), data collections available for some endangered and low-resource languages. While the experiment is not restricted to low-resource languages, they are meant to be the targeted domain. Specifically my approach is meant to be for field linguists who do not necessarily know how many position classes there are in the language they work with and what the position clas… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
13
0

Year Published

2017
2017
2020
2020

Publication Types

Select...
5

Relationship

2
3

Authors

Journals

citations
Cited by 7 publications
(13 citation statements)
references
References 17 publications
0
13
0
Order By: Relevance
“…The morphological graph that our system visualizes comes from the MOM morphological inference software (Wax, 2014;Zamaraeva, 2016). MOM outputs a directed acyclic graph specified in DOT (Gansner et al, 1993) where nodes are inflectional classes of words and position classes of affixes, and edges reflect the ordering possibilities of those affixes.…”
Section: System Overview 21 Back-endmentioning
confidence: 99%
See 1 more Smart Citation
“…The morphological graph that our system visualizes comes from the MOM morphological inference software (Wax, 2014;Zamaraeva, 2016). MOM outputs a directed acyclic graph specified in DOT (Gansner et al, 1993) where nodes are inflectional classes of words and position classes of affixes, and edges reflect the ordering possibilities of those affixes.…”
Section: System Overview 21 Back-endmentioning
confidence: 99%
“…The grammar can be used to produce annotations for additional unglossed data, as in Zamaraeva et al 2017, because the inferred system generalizes beyond the specific combinations of morphemes observed. 1 2 System overview 2.1 Back-end The morphological graph that our system visualizes comes from the MOM morphological inference software (Wax, 2014;Zamaraeva, 2016). MOM outputs a directed acyclic graph specified in DOT (Gansner et al, 1993) where nodes are inflectional classes of words and position classes of affixes, and edges reflect the ordering possibilities of those affixes.…”
Section: Introductionmentioning
confidence: 99%
“…The goal of the Matrix-ODIN Morphology or 'MOM' system (Wax, 2014;Zamaraeva, 2016) is to extract, from a corpus of IGT, the information required to create a computational morphological grammar of the regularized forms found in the IGT. Specifically, this information includes: (i) a set of affixes, grouped into position classes; (ii) for each affix, the form of the affix (and eventually, the associated morphosyntactic or morphosemantic features, as indicated by the glosses); (iii) for each position class, the inputs it can take (i.e.…”
Section: Matrix-odin Morphologymentioning
confidence: 99%
“…It annotates each morpheme with a label, or gloss tag. When the amount of data is insufficient, the role of such linguistic knowledge in making generalizations becomes more prominent; see (Wax 2014;Zamaraeva 2016) for approaches to extraction of morphological rules that take this path.…”
Section: Introductionmentioning
confidence: 99%