2004
DOI: 10.3233/jcm-2004-41-209
|View full text |Cite
|
Sign up to set email alerts
|

Using unlabeled data to improve classification in the naive bayes approach: Application to web searches

Abstract: This paper introduces a method to build a classifier based on labeled and unlabeled data. We set up the EM algorithm steps for the particular case of the naive Bayes approach and show empirical work for the restricted web page database. Original contributions includes the application of the EM algorithm to simulated data in order to see the behavior of the algorithm for different numbers of labeled and unlabeled data, and to study the effect of the sampling mechanism for the unlabeled data on the results.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Year Published

2007
2007
2007
2007

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
references
References 11 publications
0
0
0
Order By: Relevance