2012
DOI: 10.1007/978-3-642-25261-7_21
|View full text |Cite
|
Sign up to set email alerts
|

Wikipedia-Based Document Categorization

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
8
0

Year Published

2013
2013
2021
2021

Publication Types

Select...
6
1
1

Relationship

2
6

Authors

Journals

citations
Cited by 8 publications
(8 citation statements)
references
References 4 publications
0
8
0
Order By: Relevance
“…Several methods of automatic Polish text categorization and clustering have been implemented and examined over the past years. Ciesielski et al [2] presented a novel method of text categorization based on the Polish Wikipedia resources. Kuta and Kitowski [3] use clustering algorithms applied to two different corpora of the Polish language.…”
Section: Background and Related Workmentioning
confidence: 99%
“…Several methods of automatic Polish text categorization and clustering have been implemented and examined over the past years. Ciesielski et al [2] presented a novel method of text categorization based on the Polish Wikipedia resources. Kuta and Kitowski [3] use clustering algorithms applied to two different corpora of the Polish language.…”
Section: Background and Related Workmentioning
confidence: 99%
“…Here, of course, it is necessary to use a shallow analysis of natural language, identifying named entities and the use of appropriate semantic resources (lists of individuals or organizations or types of organizations that are trustworthy). Also one needs methods for appropriate classification of the content of the page [3] to match it against the list of experts. [8] proposes a number of methods for assessing the quality of Web pages edited by communities, in which the method of time series analysis of changes and of the list of readers / writers is exploited.…”
Section: Measuring Information Qualitymentioning
confidence: 99%
“…Within the system NEKST the following types of semantic transformations have been implemented: -user suggestions [22], -substitution with synonyms, hypernyms, hyponyms and other related concepts, -concept disambiguation [3], -document categorization [3], -personalized PageRank [15], -cluster analysis and assignment of cluster keywords to documents [2], -explicit separation of document cluster and document search, -extraction of named entities and relations between them [23], -diversification of responses to queries, -dynamic summarizing [13], and -identification and classification of harmful contents.…”
Section: Measuring Utilitymentioning
confidence: 99%
“…Via this component the traditional notion of document similarity (based on angles between vectors in term space) is amended to include the concept of semantic similarity. The notion of semantic similarity, as used in this paper, was described in [1]. Both methods introduced in the paper are based on our SemCat (Semantic Categorizer) algorithm, that has also been introduced in [1].…”
Section: Introductionmentioning
confidence: 99%
“…The notion of semantic similarity, as used in this paper, was described in [1]. Both methods introduced in the paper are based on our SemCat (Semantic Categorizer) algorithm, that has also been introduced in [1].…”
Section: Introductionmentioning
confidence: 99%