2019
DOI: 10.2478/jazcas-2019-0061
|View full text |Cite
|
Sign up to set email alerts
|

From the National Corpus of Polish to the Polish Corpus Infrastructure

Abstract: The National Corpus of Polish emerged as a cumulative result of many years of work on large reference corpora by computer scientists and linguists in Poland. While its impact on research in linguistics, humanities and language technology is unquestionable and highly significant, the construction of the national corpus was halted in 2011. In the paper we call for activating the research community and funding institutions around the construction of a corpus infrastructure with the national corpus at its heart. I… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
1
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 12 publications
0
1
0
Order By: Relevance
“…Several written corpora of contemporary Polish have been created, which could be used for automatic topic detection. Still, due to copyright issues, they are not freely available for download (Ogrodniczuk et al, 2022).…”
Section: Literature Reviewmentioning
confidence: 99%
“…Several written corpora of contemporary Polish have been created, which could be used for automatic topic detection. Still, due to copyright issues, they are not freely available for download (Ogrodniczuk et al, 2022).…”
Section: Literature Reviewmentioning
confidence: 99%
“…In the long term perspective, the EPB has a chance of becoming a part of the National Corpus of Polish (NCP), as long as the research community and funding institutions will take action and focus on prolonging the national corpus project (Ogrodniczuk et al, 2019). Researchers have already been signalling the need to develop the NCP, not merely in terms of timeliness, that is adding new material to reflect the current use of the language, but also with regard to parallel subcorpora which are already part of the biggest and most respected national corpora worldwide.…”
Section: Gender Discourse In Belarusian Translational Data and In Con...mentioning
confidence: 99%