2009
DOI: 10.1016/j.datak.2008.08.008
|View full text |Cite
|
Sign up to set email alerts
|

An active learning framework for semi-supervised document clustering with language modeling

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
23
0

Year Published

2010
2010
2019
2019

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 47 publications
(23 citation statements)
references
References 15 publications
0
23
0
Order By: Relevance
“…(2) Incrementally updating the cluster tree: when the number of documents increases sequentially in a document set, it is inefficient to reform the cluster tree for each new insertion. That is, it is admirable to reflect the current state of the whole document set by incrementally updating the cluster tree [28][29][30]. Therefore, we intend to propose an efficient incremental clustering algorithm for assigning a new document to the most similar existing cluster in the future.…”
Section: Discussionmentioning
confidence: 99%
“…(2) Incrementally updating the cluster tree: when the number of documents increases sequentially in a document set, it is inefficient to reform the cluster tree for each new insertion. That is, it is admirable to reflect the current state of the whole document set by incrementally updating the cluster tree [28][29][30]. Therefore, we intend to propose an efficient incremental clustering algorithm for assigning a new document to the most similar existing cluster in the future.…”
Section: Discussionmentioning
confidence: 99%
“…Nogueira et al introduce a new active semi-supervised hierarchical clustering method [13]. This strategy uses not only cluster-level constraints [9] where the user can indicate a pair of clusters to be merged but also an innovative concept called confidence. When there is lower confidence in a cluster merge the user can be queried and provide a cluster-level constraint.…”
Section: Related Workmentioning
confidence: 98%
“…In this paper, the user is asked to give feedback at the feature level instead of the document level. Except active learning of document constraints such as [15], most semi-supervised clustering algorithms involve the user supervision outside the clustering process. In this way, all the document constraints are defined before the clustering starts.…”
Section: Related Workmentioning
confidence: 99%