Application of multilingual corpus in contrastive studies (on the example of the Bulgarian-Polish-Lithuanian parallel corpus)In this paper we present applications of a trilingual corpus in language research. Comparative and contrastive studies of Polish and Bulgarian as well as Polish and Lithuanian have been already conducted, but up to the best of our knowledge no such studies exist for Bulgarian and Lithuanian. On the one hand, it is interesting to note that two Slavic languages are compared to a Baltic language (Lithuanian). On the other hand, the three languages are marginally present in the EU because of the later ascension of the three countries to the EU. The paper shortly describes the first electronic Bulgarian–Polish–Lithuanian experimental corpus, currently under development only for research. We also focus our attention on the morphosyntactic annotation of the parallel trilingual corpus according to the Corpus Encoding Standard: we present a review of the Part-of-Speech (POS) classification of the participle in the three languages – Bulgarian, Polish, and Lithuanian in comparison to another POS, the adjective. We briefly discuss tagsets for corpus annotation from the point of view of possible unification in the future with some examples.
Experimental Polish-Lithuanian Corpus with the Semantic Annotation ElementsIn the article the authors present the experimental Polish-Lithuanian corpus (ECorpPL-LT) formed for the idea of Polish-Lithuanian theoretical contrastive studies, a Polish-Lithuanian electronic dictionary, and as help for a sworn translator. The semantic annotation being brought into ECorpPL-LT is extremely useful in Polish-Lithuanian contrastive studies, and also proves helpful in translation work.
Experimental Corpus of the Lithuanian Local Dialect of Punsk in Poland. Examples of the Lexical and Semantic AnnotationIn the article the author describes the experimental corpus of the Lithuanian local dialect of Puńsk in Poland (ECorp-of-Punsk). It is the first corpus of this type for the Lithuanian local dialect. The corpus consists of three subcorpora. The first one (referred to as fundamental) contains utterances given by Lithuanians in the local dialect, the second one – utterances given by Lithuanians in Polish, the third one – aligned Polish-dialectal texts. The texts recorded in the years 1986–2012 have been included in the Ecorp-of-Punsk resources.
Semantic contrastive linguistics theory and dialectological studiesTheoretical contrastive studies (hereinafter referred to as TCS) emerged with a view to compare and contrast natural languages on the basis of a logical interlanguage. The idea of making the TCS guidelines available to science resulted in discontinuing the division into the original language and the target language when comparing and contrasting two (or more languages), and at the same time, terminating the dependence of the resulting material (i.e. form indexes in the target language) on the formal structures in the original language. The TCS essence is included in the interlanguage, which is used as tertium comparationis in the studies. To get more on this topic see Koseska, Korytkowska, R. Roszko (2007). Till now, TCS have not been applied in dialectal studies. There are a lot of reasons for this conjuncture. First of all, dialectal studies usually concentrate on one code (i.e. only a single local dialect is being specified), whilst in TCS, a comparison and contrast between (at least two) languages is provided. Moreover, research on the dialectal differentiation of a specific language (i.e. at least two dialects (/ local dialects) are being specified together) is based on demonstrating the features shared and differentiated on the level of (a) lexis, (b) morphology (most often narrowed to demonstrate differential morphological features) and (c) syntactic (relatively most rarely). Thus, dialectal studies are essentially a description of the formal conjuncture, whereas semantic aspects are out of the area of researchers interest. With this article, I am going to break the current patterns and prove that dialectal studies can be conducted in accordance with the TCS guidelines. The advantage of such dialectal studies is not only a different/new look at a specific local dialect, but also a possibility of an instant comparison and contrast between the local dialect and the standardized language or other local dialects (of one language or another) on the semantic level providing the highest standard of the relevances demonstrated (i.e. similarities and differences).
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.