2015
DOI: 10.1007/s10579-015-9326-3
|View full text |Cite
|
Sign up to set email alerts
|

FinnPos: an open-source morphological tagging and lemmatization toolkit for Finnish

Abstract: This paper describes FinnPos, an open-source morphological tagging and lemmatization toolkit for Finnish. The morphological tagging model is based on the averaged structured perceptron classifier. Given training data, new taggers are estimated in a computationally efficient manner using a combination of beam search and model cascade. The lemmatization is performed employing a combination of a rule-based morphological analyzer, OMorFi, and a data-driven lemmatization model. The toolkit is readily applicable for… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
25
0

Year Published

2015
2015
2021
2021

Publication Types

Select...
3
2
1

Relationship

3
3

Authors

Journals

citations
Cited by 25 publications
(27 citation statements)
references
References 14 publications
0
25
0
Order By: Relevance
“…The FinnPos tagger toolkit (Silfverberg et al, 2015) is used to train the models for the structured system and the HFST Python interface (Lindén et al, 2011) is used for constructing and operating finite-state machines. When training FinnPos models, we used default settings for most hyperparameters.…”
Section: Methodsmentioning
confidence: 99%
“…The FinnPos tagger toolkit (Silfverberg et al, 2015) is used to train the models for the structured system and the HFST Python interface (Lindén et al, 2011) is used for constructing and operating finite-state machines. When training FinnPos models, we used default settings for most hyperparameters.…”
Section: Methodsmentioning
confidence: 99%
“…FinnPos [20] is a data driven morphological tagging toolkit distributed with the HFST interface. The term morphological tagging [6] refers to assigning one full morphological label, including for example part-of-speech, tense, case and number, to each word in a text.…”
Section: Morphological Tagging Using Hfst-finnposmentioning
confidence: 99%
“…In contrast, FinnPos is especially geared toward morphologically rich languages with large label sets, that cause data sparsity and slow down estimation when using standard solutions. FinnPos gives state-of-the-art results for the morphologically rich language Finnish [20] both with regard to runtime and accuracy. In addition to morphological tagging, FinnPos also performs data driven lemmatization.…”
Section: Morphological Tagging Using Hfst-finnposmentioning
confidence: 99%
See 2 more Smart Citations