2019
DOI: 10.1109/access.2019.2927345
|View full text |Cite
|
Sign up to set email alerts
|

Discovering Topic Representative Terms for Short Text Clustering

Abstract: Clustering short texts are one of the most important text analysis methods to help extract knowledge from online social media platforms, such as Twitter, Facebook, and Weibo. However, the instant features (such as abbreviation and informal expression) and the limited length of short texts challenge the clustering task. Fortunately, short texts about the same topic often share some common terms (or term stems), which can effectively represent a topic (i.e., supported by a cluster of short texts), and we also ca… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
29
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
4
3

Relationship

2
5

Authors

Journals

citations
Cited by 42 publications
(29 citation statements)
references
References 24 publications
0
29
0
Order By: Relevance
“…Several techniques have been proposed to tackle the challenges posed by sentence classification, including dimension reduction (Zelikovitz and Hirsh 2000;Sriram et al 2010;Khoo et al 2006;Bollegala et al 2018), topic modeling (Cheng et al 2014;Chen et al 2011;Yang et al 2015), clustering (Kozlowski and Rybinski 2019;Yin et al 2017;Bollegala et al 2018;Dai et al 2013;Kozlowski and Rybinski 2017;Yang et al 2019), and word embedding (Kozlowski and Rybinski 2019;Kim 2014;Lee and Dernoncourt 2016;Hill et al 2016). Kim (Kim 2014) proposed a single layer of CNN applied for sentence classification.…”
Section: Sentence Classification and Short Text Classificationmentioning
confidence: 99%
“…Several techniques have been proposed to tackle the challenges posed by sentence classification, including dimension reduction (Zelikovitz and Hirsh 2000;Sriram et al 2010;Khoo et al 2006;Bollegala et al 2018), topic modeling (Cheng et al 2014;Chen et al 2011;Yang et al 2015), clustering (Kozlowski and Rybinski 2019;Yin et al 2017;Bollegala et al 2018;Dai et al 2013;Kozlowski and Rybinski 2017;Yang et al 2019), and word embedding (Kozlowski and Rybinski 2019;Kim 2014;Lee and Dernoncourt 2016;Hill et al 2016). Kim (Kim 2014) proposed a single layer of CNN applied for sentence classification.…”
Section: Sentence Classification and Short Text Classificationmentioning
confidence: 99%
“…Jia et al proposed WordCom [13], which creates concept vectors by identifying semantic word communities in a weighted word co-occurrence network. TRTD [12] Weibo also has a forwarding function, and this process brings about the rapid dissemination and diffusion of information.…”
Section: A Topic Detectionmentioning
confidence: 99%
“…Content may change prior to final publication. in (12), in which a = |S * |, where FMI is an external evaluation method, which is used to determine the similarity between two clusterings. FMI can be calculated as shown in (13).…”
Section: ) Abstract-averagingmentioning
confidence: 99%
See 2 more Smart Citations