Proceedings of the Fourth International Workshop on Computatinal Linguistics of Uralic Languages 2018
DOI: 10.18653/v1/w18-0209
|View full text |Cite
|
Sign up to set email alerts
|

Initial Experiments in Data-Driven Morphological Analysis for Finnish

Abstract: This paper presents initial experiments in data-driven morphological analysis for Finnish using deep learning methods. Our system uses a character based bidirectional LSTM and pretrained word embeddings to predict a set of morphological analyses for an input word form. We present experiments on morphological analysis for Finnish. We learn to mimic the output of the OMorFi analyzer on the Finnish portion of the Universal Dependency treebank collection. The results of the experiments are encouraging and show tha… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
23
1

Year Published

2019
2019
2020
2020

Publication Types

Select...
4
2

Relationship

2
4

Authors

Journals

citations
Cited by 9 publications
(26 citation statements)
references
References 8 publications
0
23
1
Order By: Relevance
“…Classically, rule-based analyzers have been augmented with statistical guessers which provide analyses for out-of-lexicon word forms (Lindén, 2009). Recently, purely data-driven morphological analysis has received increasing attention (Nicolai and Kondrak, 2017;Silfverberg and Hulden, 2018;Moeller et al, 2018;Silfverberg and Tyers, 2019). Purely data-driven systems learn an analysis model from a data set of morphologically analyzed word forms and can then be applied to unseen word forms.…”
Section: Discussionmentioning
confidence: 99%
“…Classically, rule-based analyzers have been augmented with statistical guessers which provide analyses for out-of-lexicon word forms (Lindén, 2009). Recently, purely data-driven morphological analysis has received increasing attention (Nicolai and Kondrak, 2017;Silfverberg and Hulden, 2018;Moeller et al, 2018;Silfverberg and Tyers, 2019). Purely data-driven systems learn an analysis model from a data set of morphologically analyzed word forms and can then be applied to unseen word forms.…”
Section: Discussionmentioning
confidence: 99%
“…Conversion systems for transforming sound signals of arbitrary sound events into their corresponding onomatopoeic representations have been developed using deep learning techniques . Moreover, a totally data‐driven approach called representation learning has also been investigated for training networks using embedding vectors corresponding to sound signals .…”
Section: Recent Research Trends In Environmental Sound Processingmentioning
confidence: 99%
“…The second one is created using data from the Turku Dependency Treebank. This dataset was originally presented by Silfverberg and Hulden (2018). We explicitly do not use any data from the Unimorph project.…”
Section: Datamentioning
confidence: 99%
“…Our second dataset was presented by Silfverberg and Hulden (2018). It is the Finnish part of the Universal Dependencies treebank v1 (Pyysalo et al, 2015) which has been analyzed using the OMorFi morphological analyzer (Pirinen et al, 2017).…”
Section: Finnish Treebank Datamentioning
confidence: 99%
See 1 more Smart Citation