Janusz S. Bień scite author profile

The paper describes a system for dealing with nestings of belief in terms of the mechanism of computational environment. A method is offered for computing the beliefs of A about B (and so on) in terms of the systems existing knowledge structures about A and B separately. A proposal for belief percolation is put forward: percolation being a side effect of the process of the computation of nested beliefs, but one which could explain the acquisition of unsupported beliefs. It is argued that the mechanism proposed is compatible with a general least effort hypothesis concerning human mental functioning.

show abstract

Representing Parkosz's alphabet in the Junicode font

Bień¹

2022

View full text Add to dashboard Cite

The IMPACT project Polish Ground-Truth texts as a Djvu corpus

Bień

2014

View full text Add to dashboard Cite

The purpose of the paper is twofold. First, to describe the already implemented idea of DjVu corpora, i.e. corpora which consist of both scanned images and a transcription of the texts with the words associated with their occurrences in the scans. Secondly, to present a case study of a corpus consisting of almost 5 000 pages of Polish historical texts dating from 1570 to 1756 (it is practically the very first corpus of historical Polish). The tools described have universal character and are freely available under the GNU GPL license, hence they can be used also for other purposes.

show abstract

Efficient Search in Hidden Text of Large DjVu Documents

Bień

2011

View full text Add to dashboard Cite

The paper describes an open-source tool which allows to present endusers with results of advanced language technologies. It relies on the DjVu format, which for some applications is still superior to other modern formats including PDF/A. The DjVu GPLed tools are not limited just to the DjVuLibre library, but are being supplemented by various new programs, such as pdf2djvu developed by Jakub Wilk. It allows in particular to convert to DjVu the PDF output of popular OCR programs like FineReader preserving the hidden text layer and some other features.The tool in question has been conceived by the present author and consist of a modification of the Poliqarp corpus query tool, used for National Corpus of Polish; his ideas have been very succesfully implemented by Jakub Wilk. The new system, called here simply Poliqarp for DjVu, inherits from its origin not only the powerfull search facilities based on two-level regular expressions, but also the ability to represent low-level ambiguities and other linguistic phenomena. Although at present the tool is used mainly to facilitate access to the results of dirty OCR, it is ready to handle also more sophisticated output of linguistic technologies.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Janusz S. Bień

Beliefs, points of view, and multiple environments

Beliefs, Points of View, and Multiple Environments*

Representing Parkosz's alphabet in the Junicode font

The IMPACT project Polish Ground-Truth texts as a Djvu corpus

Efficient Search in Hidden Text of Large DjVu Documents

Contact Info

Product

Resources

About