1991
DOI: 10.1080/02572117.1991.10586891
|View full text |Cite
|
Sign up to set email alerts
|

Towards computer-assisted word frequency studies in Northern Sotho

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
8
0

Year Published

2003
2003
2017
2017

Publication Types

Select...
4
1
1

Relationship

0
6

Authors

Journals

citations
Cited by 10 publications
(8 citation statements)
references
References 0 publications
0
8
0
Order By: Relevance
“…The corpus era introduced for African languages and especially for Sesotho sa Leboa lexicography by Prinsloo (1991), see also Chapter 3, opened new doors for the lemmatisation of nouns and verbs namely lemmatisation based on frequency of use. Using corpus data the lexicographer can ensure that frequently used words are not accidentally omi� ed and, on the other hand, that precious dictionary space is not taken up by articles of which the lemma is unlikely to be looked-up by the target users.…”
Section: Frequency-based Approachmentioning
confidence: 99%
“…The corpus era introduced for African languages and especially for Sesotho sa Leboa lexicography by Prinsloo (1991), see also Chapter 3, opened new doors for the lemmatisation of nouns and verbs namely lemmatisation based on frequency of use. Using corpus data the lexicographer can ensure that frequently used words are not accidentally omi� ed and, on the other hand, that precious dictionary space is not taken up by articles of which the lemma is unlikely to be looked-up by the target users.…”
Section: Frequency-based Approachmentioning
confidence: 99%
“…pocket size or medium size. This is exactly what word frequency studies according to Prinsloo (1991) are all about: Selecting just the right corpus of words (reflexives) for a specific dictionary and secondly preventing the omission of essential words (reflexives). The relevance of this statement is clearly underlined in the introduction to the Setswana English Afrikaans Dictionary (1990), where Snyman and Shole honestly admit:…”
Section: Resultsmentioning
confidence: 93%
“…Table 1 it is clear that 'only' and 'time' are highly used words over a broad spectrum in contrast to 'Kennan' and 'two-day'. (See Prinsloo (1991) and Johansson and Hofland (1989) for detailed discussions on the Brown-and Lob Corpora as well as word frequency studies for Northern Sotho.) Thus a specific reflexive will only be regarded as relatively highly used if it occurs frequently in (a) the corpus as a whole and (b) every book or magazine (and not for example with a high frequency in one book, but not at all in the next five).…”
Section: Morphological and Semantic Realities Facing The Lexicographermentioning
confidence: 99%
“…Trainable software solved, among other things, the recognition problem regarding s versus š. The corpus became known as the Pretoria Sesotho sa Leboa Corpus (PSC) and gradually grew from 156 000 running words or 'tokens' in 1990 (Prinsloo, 1991) to 5.8 million words a decade later . The availability of a corpus opened doors to a variety of new research possibilities for and insights into lexicography, linguistics, translation studies, etc.…”
Section: Lexicographic Difficulties Faced By the Pioneersmentioning
confidence: 99%