2010
DOI: 10.1007/978-3-642-13022-9_55
|View full text |Cite
|
Sign up to set email alerts
|

ITSA ⋆ : An Effective Iterative Method for Short-Text Clustering Tasks

Abstract: Abstract. The current tendency for people to use very short documents, e.g. blogs, text-messaging, news and others, has produced an increasing interest in automatic processing techniques which are able to deal with documents with these characteristics. In this context, "short-text clustering" is a very important research field where new clustering algorithms have been recently proposed to deal with this difficult problem. In this work, ITSA , an iterative method based on the bio-inspired method PAntSA is propo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Year Published

2014
2014
2017
2017

Publication Types

Select...
3
2

Relationship

2
3

Authors

Journals

citations
Cited by 5 publications
(8 citation statements)
references
References 10 publications
0
8
0
Order By: Relevance
“…CLUDIPSO and CLUDIPSO ⋆ used the 11 Corpora are considered medium size if the number of documents is between 1000 and 10000.…”
Section: Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…CLUDIPSO and CLUDIPSO ⋆ used the 11 Corpora are considered medium size if the number of documents is between 1000 and 10000.…”
Section: Resultsmentioning
confidence: 99%
“…The study of alternative methods to perform the initial sub-grouping and start the search process with higher quality initial subgroups could be an interesting extension. Another possible extension, is to use the results obtained with CLUDIPSO ⋆ as input to a recent (and effective) boosting method [11] and obtain thus, higher quality solutions.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…In particular, some recent bio-inspired proposals have gained increasing interest in short -text clustering. These approaches include algorithms based on Particle Swarm Optimization techniques (Cagnina et al 2008; Ingaramo et al 2009) and ant-behavior-based approaches (Errecalde, Ingaramo and Rosso 2010; Ingaramo, Errecalde and Rosso 2010 a, Errecalde and Rosso 2010 b).…”
Section: Introductionmentioning
confidence: 99%
“…Among the considerable number of ICVMs proposed up to now, the GS coefficient has shown good results as cluster validation method with respect to other well-known validity measures (Brun et al 2007). Furthermore, the silhouette coefficient has also shown its potential for determining the optimal number of groups in a clustering problem (Rousseeuw 1987; Tan et al 2005; Choi et al 2011), estimating how difficult a corpus is for an arbitrary clustering algorithm (Errecalde et al 2008), computing a target function to be optimized (Cagnina et al 2008; Ingaramo et al 2009), automatically determining a threshold for a similarity function (Bonato dos Santos et al 2011) and as a key component in other internal process of clustering algorithms (Aranganayagi and Thangavel 2007; Errecalde et al 2010; Ingaramo et al 2010a).…”
Section: Introductionmentioning
confidence: 99%