2009
DOI: 10.1007/978-3-642-03348-3_58
|View full text |Cite
|
Sign up to set email alerts
|

Incremental Document Clustering Based on Graph Model

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2009
2009
2022
2022

Publication Types

Select...
3
2
1

Relationship

1
5

Authors

Journals

citations
Cited by 6 publications
(3 citation statements)
references
References 5 publications
0
3
0
Order By: Relevance
“…In this paper we extend our initial results [20] and present an incremental Vietnamese document clustering approach that overcomes not only the limitations associated with vector space Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee.…”
Section: Introductionmentioning
confidence: 79%
“…In this paper we extend our initial results [20] and present an incremental Vietnamese document clustering approach that overcomes not only the limitations associated with vector space Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee.…”
Section: Introductionmentioning
confidence: 79%
“…The phrase-based similarity is calculated from the shared phrases and using cosine and phrase, the hybrid similarity is calculated. In paper (Bakr et al, 2012;Hammouda & Kamel, 2004b;Momin et al, 2006;Nguyen-Hoang et al, 2009) authors have also used hybrid similarity measure to cluster the documents. K-Nearest Neighbour, Single-pass, and Hierarchical Agglomerative clustering algorithm are applied to the dataset using the hybrid similarity as a distance measure, and the quality of clusters is evaluated (Hammouda & Kamel, 2002, 2004a.…”
Section: Document Representation Modelsmentioning
confidence: 99%
“…The sampling approaches (Aggarwal et al, 2009;Cheng et al, 1998;Guha et al, 1998;Kranen et al, 2011;Lee et al, 2009;Ng et al, 2002;Pal et al, 2002;Sakai et al, 2009;Yildizli et al, 2011) usually choose the samples by a certain rule such as chisquare or divergence hypothesis (Hathaway et al, 2006). The incremental approaches (Bradley et al, 1998;Farnstrom et al, 2000;Gupta et al, 2004;Karkkainen et al, 2007;Luhr et al, 2009;Nguyen-Hoang et al, 2009;Ning et al, 2009;O'Callaghan et al, 2002;Ramakrishnan et al, 1996;Siddiqui et al, 2009;Wan et al, 2010Wan et al, , 2011 generally maintain past knowledge from the previous runs of a clustering algorithm to produce or improve the future clustering model. Nevertheless, as Hore et al (2007) pointed out, many existing algorithms for large and very large data sets are used for the crisp case, rarely for the fuzzy case.…”
Section: Introductionmentioning
confidence: 99%