Simon Krek scite author profile

Universal dependencies (UD) is a framework for morphosyntactic annotation of human language, which to date has been used to create treebanks for more than 100 languages. In this article, we outline the linguistic theory of the UD framework, which draws on a long tradition of typologically oriented grammatical theories. Grammatical relations between words are centrally used to explain how predicate–argument structures are encoded morphosyntactically in different languages while morphological features and part-of-speech classes give the properties of words. We argue that this theory is a good basis for cross-linguistically consistent annotation of typologically diverse languages in a way that supports computational natural language understanding as well as broader linguistic studies.

show abstract

The Universal Dependencies Treebank for Slovenian

Dobrovoljc¹,

Erjavec²,

Krek³

2017

View full text Add to dashboard Cite

This paper introduces the Universal Dependencies Treebank for Slovenian. We overview the existing dependency treebanks for Slovenian and then detail the conversion of the ssj200k treebank to the framework of Universal Dependencies version 2. We explain the mapping of part-of-speech categories, morphosyntactic features, and the dependency relations, focusing on the more problematic language-specific issues. We conclude with a quantitative overview of the treebank and directions for further work.

show abstract

Cross-lingual Dependency Parsing of Related Languages with Rich Morphosyntactic Tagsets

Agić¹,

Tiedemann²,

Merkler³

et al. 2014

View full text Add to dashboard Cite

This article describes MetaRomance, a rule-based cross-lingual parser for Romance languages submitted to CoNLL 2017 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies. The system is an almost delexi-calized parser which does not need training data to analyze Romance languages. It contains linguistically motivated rules based on PoS-tag patterns. The rules included in MetaRomance were developed in about 12 hours by one expert with no prior knowledge in Universal Dependencies , and can be easily extended using a transparent formalism. In this paper we compare the performance of MetaRo-mance with other supervised systems participating in the competition, paying special attention to the parsing of different treebanks of the same language. We also compare our system with a delexicalized parser for Romance languages, and take advantage of the harmonized annotation of Universal Dependencies to propose a language ranking based on the syntactic distance each variety has from Romance languages .

show abstract

Discovering Automated Lexicography: The Case of the Slovene Lexical Database

2016

View full text Add to dashboard Cite

Compilation, transcription and usage of a reference speech corpus: the case of the Slovene corpus GOS

Verdonik

Kosem

Vitez

et al. 2013

Lang Resources & Evaluation

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Simon Krek

Universal Dependencies

The Universal Dependencies Treebank for Slovenian

Cross-lingual Dependency Parsing of Related Languages with Rich Morphosyntactic Tagsets

Discovering Automated Lexicography: The Case of the Slovene Lexical Database

Compilation, transcription and usage of a reference speech corpus: the case of the Slovene corpus GOS

Contact Info

Product

Resources

About