Elektroniczny Korpus
Tekstów Polskich Z Xvii I Xviii W.
– Problemy Teoretyczne I Warsztatowe

Gruszczyński, Włodzimierz; Adamiec, Dorota; Bronikowska, Renata; Wieczorek, Aleksandra

doi:10.33896/porj.2020.8.3

Cited by 12 publications

(6 citation statements)

References 1 publication

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…The study aimed at determining whether a word acquired a metaphorical meaning over time. We used two datasets: a subset of Elektroniczny korpus tekstów polskich z XVII i XVIII w (Gruszczyński et al 2020; henceforth KorBa) and a subset of Korpus Polszczyzny 1830-1918 (Bilińska et al 2016; henceforth F19) 6 . Both corpora feature circa 500 thousand word tokens.…”

Section: Results From Computational Approaches and Natural Language P...mentioning

confidence: 99%

Integrating Approaches to the Role of Metaphor in the Evolutionary Dynamics of Language

Pleyer,

Kuleshova,

Placiński

2024

Preprint

View full text Add to dashboard Cite

Metaphor occupies a central role not only in language use, but also in language change and evolution. Specifically, semantic extension motivated by metaphor plays an important role in extending the lexicon of languages. It is this process that enables the emergence of one of the key properties of modern languages, namely that they are open-ended, systematic, polysemous, structured semiotic systems. Here, we review results from three approaches whose integration presents an important cornerstone for an interdisciplinary account of the role of metaphor in the evolutionary dynamics of language: (1) Historical linguistics and diachronic semantics (2) Computational approaches and natural language processing, and (3) Experimental semiotics. Research in historical linguistics has shown that metaphor is a major mechanism of semantic change. Diachronic semantic analyses have not only mapped detailed historical trajectories of semantic extension motivated by metaphor, but also identified common metaphoric pathways of change as well as shared cognitive principles underlying them. Computational approaches and natural language processing have used findings and data from historical linguistics in attempts to automate the detection of metaphoric semantic change and to build data-driven models of the principles underlying it. Experimental semiotics is a paradigm in which participants have to create novel communication systems in the absence of language. It represents a paradigm that can investigate cultural linguistic evolution and the emergence of metaphors and metaphorical extensions under controlled laboratory settings to shed light on the interactional and cognitive principles involved in it. Combining results from these approaches represents an important first step towards an interdisciplinary, integrative account of the role of metaphor, and processes of polysemous meaning extension more generally, in the evolutionary dynamics of language.

show abstract

Section: Results From Computational Approaches and Natural Language P...mentioning

confidence: 99%

Integrating Approaches to the Role of Metaphor in the Evolutionary Dynamics of Language

Pleyer,

Kuleshova,

Placiński

2024

Preprint

View full text Add to dashboard Cite

show abstract

“…Wczesne językoznawstwo korpusowe, z jego traktowaniem świadectwa tekstu jak rodzimego użytkownika języka, przypominało metodę filologiczną Szybko jednak oko badacza zaczęło rejestrować przede wszystkim to, co w korpusie seryjne, ciężar dowodu zaś przesunął się na argumentację przede wszystkim ilościową Nie pojedyncza osobliwa konstrukcja czy forma, ale właśnie to, co typowe, staje się przedmiotem zainteresowania badaczy Tą drogą też coraz częściej podążają lingwiści interesujący się przeszłością języka Szersze zastosowanie zaawansowanych technik statystycznych wymaga powiększenia skali korpusów W językoznawstwie historycznym brak tekstów zawsze stanowił wąskie gardło, warto jednak zauważyć, że było to mniej dotkliwe, dopóki filolog pracował z fiszką i piórem, gdyż największym ograniczeniem było jego tempo pracy Współcześnie, gdy przeszukiwanie zbiorów o objętości milionów czy nawet miliardów słów nie stanowi problemu, to właśnie niedostateczna liczba dawnych tekstów staje się największą przeszkodą dla badacza Nie znaczy to oczywiście, że w epoce poprzedzającej powstanie korpusów elektronicznych w językoznawstwie historycznym świadomość roli danych ilościowych nie istniała Znakomitą pracą, w której przebieg zmian jakościowych jest śledzony poprzez precyzyjny opis ilościowy, jest opracowanie Ireny Bajerowej (1964), podobnie -tekst Anny Wierzbickiej (1966, by wymienić tylko dwie dawne prace Wróćmy więc do samych korpusów historycznych Z niewielkim ryzykiem pomyłki można powiedzieć, że pierwsze takie korpusy dokumentowały język angielski od jego początków do XVIII w (Rissanen 1992) Korpusem dawnej polszczyzny, który powstał jako pierwszy, a zarazem dokumentuje najstarszą warstwę języka, jest Korpus tekstów staropolskich (stworzony przez zespół Słownika staropolskiego IJP PAN, a opisany w pracy Twardzik, Górski 2003) 2 Korpus ten obejmuje zasadniczo wszystkie znane polskie teksty ciągłe do roku 1500 Wiek XVI reprezentuje korpus tworzony przez Pracownię Słownika Polszczyzny XVI Wieku IBL PAN 3 Oba te korpusy nie są lematyzowane ani opatrzone anotacją fleksyjną (morfosyntaktyczną) Okres 1600-1772 pokrywa Elektroniczny Korpus Tekstów Polskich z XVII i XVIII w (do 1772 r ) (KorBa, por Gruszczyński, Adamiec, Ogrodniczuk 2013;Gruszczyński, Adamiec, Bronikowska, Kieraś, Modrzejewski, Wieczorek, Woliński 2022) Oczywiście granica pomiędzy korpusem historycznym i diachronicznym może być płynna Przykładowo KorBa zasadniczo nie jest skonstruowana jako korpus diachroniczny Pokrywa on jednak okres 172 lat (a więc niewiele mniej niż CLMET), okres, w którym zaszło wiele zmian, w tym zmian systemowych, doskonale więc może służyć do badania ich przebiegu Użytkownik dzięki metadanym może tworzyć dowolne chronologicznie uporządkowane podkorpusy, jakkolwiek musi pamiętać o tym, że będą się one różniły zapewne zarówno wielkością, jak i budową Problem zróżnicowanej budowy podkorpusów korpusu diachronicznego jest zresztą nieusuwalny Zauważmy, że w wypadku polszczyzny wiek XV reprezentują niemal wyłącznie teksty religijne i prawne, współcześnie dalece nie najważniejsze Stopniowo pojawiają się nowe typy tekstów, a podstawowa dzisiaj prasa wyłania się na szerszą skalę dopiero w XIX w Piszemy o tym w kontekście planowanego przedsięwzięcia -stworzenia Narodowego Diachronicznego Korpusu Polszczyzny, który miałby scalić istniejące korpusy historyczne tak, by reprezentując wszystkie epoki, stanowiły korpus diachroniczny (Król et al 2019), lecz także by bardzo wyraźnie podkreślić, że opisywany tutaj korpus jest korpusem historycznym, ale też i synchronicznym…”

Section: Michał Woźniakunclassified

Korpus XIX w. Uniwersytetu Warszawskiego i IJP PAN

2023

View full text Add to dashboard Cite

CORPUS OF THE 19TH CENTURY OF THE WARSAW UNIVERSITY AND IJP PAN The article describes a historical corpus which documents the 19th and early 20th century. The corpus was created as part of a research grant whose objective was to investigate the development of the aspectual system of Polish in the last 250 years against the background of Czech and Russian. An important resource for this investigation was a database of aspectual triplets, which, in turn, was based on materials such as text corpora. Since there was no large corpus of the 19th and early 20th century available, there was a need to bridge this gap. In the course of the project, such corpus was made and it is now publicly accessible with no restrictions. This comprehensive corpus contains over 12 million contemporary words. Its texts originate from major Polish virtual libraries. It is POS-tagged with a tagger dedicated for 19th century texts. A web-based concordancer, an adjusted version of ParaVoz, allows for querying the corpus. The queries may be constrained by metadata.

show abstract

“…Examples from the 17 th and 18 th c. have been extracted from the Corpus of Polish Texts of the 17 th and 18 th c. with aid of the search engine Korba (https://korba.edu.pl/) (Gruszczyński, Adamiec & Ogrodniczuk 2013). As in the case of the previous analysis, to ensure relative commensuration of the obtained results, we used an annotation system and searching procedures that were compatible with the annotations and searches made in the National Corpus of the Polish Language.…”

Section: Empirical Researchmentioning

confidence: 99%

The rise of the WZIĄĆ (TAKE) Serial Verb Construction in Polish

Andrason¹,

Gębka-Wolak²,

Moroz³

2022

SPILPLUS

View full text Add to dashboard Cite

The present study is dedicated to the emergence of an asymmetrical serial verb construction (SVC) with the verb wziąć in Polish. By making use of a dynamic prototype-driven approach to linguistic categorization and by reviewing the historical corpora that range from the first Old Polish texts in the 14th c. until the end of the New Polish period in 1939, the authors conclude that the wziąć SVC has resulted from the fusion of the original conjunctively coordinated (CC) clauses. Although two types of clause-fusion mechanisms have operated during the grammaticalization of the wziąć SVCs, their contribution to this process has been dissimilar. The evolution from the syndetic CC with the coordinator i to the wziąć SVC via a pseudo-coordinated (PC) stage (i.e., the wziąć-i PC) has constituted a faster and stronger drift, while the more direct evolution originating in the asyndetic CC with wziąć has been slower and less pervasive.

show abstract

Elektroniczny Korpus Tekstów Polskich Z Xvii I Xviii W. – Problemy Teoretyczne I Warsztatowe

Cited by 12 publications

References 1 publication

Integrating Approaches to the Role of Metaphor in the Evolutionary Dynamics of Language

Integrating Approaches to the Role of Metaphor in the Evolutionary Dynamics of Language

Korpus XIX w. Uniwersytetu Warszawskiego i IJP PAN

The rise of the WZIĄĆ (TAKE) Serial Verb Construction in Polish

Contact Info

Product

Resources

About