2013
DOI: 10.1007/s10579-013-9244-1
|View full text |Cite
|
Sign up to set email alerts
|

Building the essential resources for Finnish: the Turku Dependency Treebank

Abstract: In this paper, we present the final version of a publicly available treebank of Finnish, the Turku Dependency Treebank. The treebank contains 204,399 tokens (15,126 sentences) from 10 different text sources and has been manually annotated in a Finnishspecific version of the well-known Stanford Dependency scheme. The morphological analyses of the treebank have been assigned using a novel machine learning method to disambiguate readings given by an existing tool. As the second main contribution, we present the f… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
66
0

Year Published

2015
2015
2023
2023

Publication Types

Select...
4
2
1

Relationship

1
6

Authors

Journals

citations
Cited by 83 publications
(66 citation statements)
references
References 22 publications
(23 reference statements)
0
66
0
Order By: Relevance
“…The PropBank presented in this paper is built on top of the Turku Dependency Treebank (TDT) (Haverinen et al 2013b). This treebank consists of 204,399 tokens (15,126 sentences) of text from 10 different genres of general Finnish, such as the Finnish Wikipedia, financial news and amateur fiction.…”
Section: Corpus: the Turku Dependency Treebankmentioning
confidence: 99%
See 3 more Smart Citations
“…The PropBank presented in this paper is built on top of the Turku Dependency Treebank (TDT) (Haverinen et al 2013b). This treebank consists of 204,399 tokens (15,126 sentences) of text from 10 different genres of general Finnish, such as the Finnish Wikipedia, financial news and amateur fiction.…”
Section: Corpus: the Turku Dependency Treebankmentioning
confidence: 99%
“…1. For further details on the treebank, we refer the reader to the paper by Haverinen et al (2013b) and the annotation manual by Haverinen (2012).…”
Section: Syntactic Functions Of Relativizersmentioning
confidence: 99%
See 2 more Smart Citations
“…We therefore parsed our data using the Turku Finnish Dependency Parser (Haverinen et al, 2014) which is now available 1 . This parser works efficiently and we were able to process raw input text.…”
Section: Morphological Stemmingmentioning
confidence: 99%