The platform will undergo maintenance on Sep 14 at about 7:45 AM EST and will be unavailable for approximately 2 hours.
2007
DOI: 10.1007/s10844-006-0003-2
|View full text |Cite
|
Sign up to set email alerts
|

Classifying web documents in a hierarchy of categories: a comprehensive study

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
57
0

Year Published

2008
2008
2024
2024

Publication Types

Select...
5
3
1

Relationship

0
9

Authors

Journals

citations
Cited by 92 publications
(59 citation statements)
references
References 37 publications
1
57
0
Order By: Relevance
“…In clustering sample sets, we add or delete sample A or B , and for fuzzy data sample set A , B , the convergence control function of fuzzy data closed loop operation and maintenance management in massive information is obtained under the control of attenuation constant 1 T and 2 T :…”
Section: Big Data Classification and High Dimensional Information Reomentioning
confidence: 99%
See 1 more Smart Citation
“…In clustering sample sets, we add or delete sample A or B , and for fuzzy data sample set A , B , the convergence control function of fuzzy data closed loop operation and maintenance management in massive information is obtained under the control of attenuation constant 1 T and 2 T :…”
Section: Big Data Classification and High Dimensional Information Reomentioning
confidence: 99%
“…Marine green energy resources are the renewable natural energy resources contained in the oceans, which is renewable and inexhaustible in the era of existence of solar system [1].…”
Section: Introductionmentioning
confidence: 99%
“…Optimization is made feasible by utilizing decomposition of the original problem and making incremental conditional gradient search in the subproblems. Ceci & Malerba (2007) present a comprehensive study on hierarchical classification of Web documents. They extend a previous work (Ceci & Malerba, 2003) considering hierarchical feature selection mechanisms, a naïve Bayes algorithm aimed at avoiding problems related to different document lengths, the validation of their framework for a probabilistic SVM-based classifier, and (iv) an automated threshold selection algorithm.…”
Section: Hierarchical Text Categorizationmentioning
confidence: 99%
“…Some other researchers (Ceci and Malerba 2007;Sun and Lim 2001) have proposed that evaluation measures specific to the hierarchical case should be used in HTC, so that credit is given to ''partially correct'' classification, i.e., to the misclassification of a document into a category topologically close to the correct one. We think that these measures are difficult to judge in the abstract, since whether a user would gain any more benefit from a ''partially correct'' classification than from a ''completely wrong'' classification remains open to question, and fundamentally dependent on the particular application.…”
Section: Related Workmentioning
confidence: 99%