YARN (Yet Another RussNet) project started in 2013 aims at creating a large open thesaurus for Russian using crowdsourcing. This paper describes synset assembly interface developed within the project -motivation behind it, design, usage scenarios, implementation details, and first experimental results.
The principles of data collection and organization for the new Universal Ideographic Dictionary-Thesaurus of Russian Vocabulary, created at the Ural Federal University (Yekaterinburg, Russia) under the guidance of Professor L. G. Babenko is discussed in the article. The specificity of previous lexicographic sources is reviewed. Ideographic databases, with which the staff of the Ural Semantic School worked earlier, are characterized. Examples of identifying denotative spheres and denotative-ideographic groups, the structure of which makes up the synopsis of the thesaurus are given. The number of these classes and the set of entries in the original database, which is the basis of the future dictionary, are discussed. In addition, examples of forms for working with dictionaries and the data structure (a set of fields) of the Universal Ideographic Dictionary-Thesaurus are shown. In addition, examples of forms for working with dictionaries and the data structure (a set of fields) of the Universal Ideographic Dictionary-Thesaurus are shown. The reasons for the quantitative asymmetry of the new thesaurus and the original database are considered: statistical, logical-semantic and lexical-grammatical factors. Conclusions are drawn about the prospects of the new project as uniting and developing the ideas of the Ural Semantic School.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.