2015 IEEE 31st International Conference on Data Engineering 2015
DOI: 10.1109/icde.2015.7113425
|View full text |Cite
|
Sign up to set email alerts
|

STREAMCUBE: Hierarchical spatio-temporal hashtag clustering for event exploration over the Twitter stream

Abstract: Abstract-What is happening around the world? When and where? Mining the geo-tagged Twitter stream makes it possible to answer the above questions in real-time. Although a single tweet can be short and noisy, proper aggregations of tweets can provide meaningful results. In this paper, we focus on hierarchical spatio-temporal hashtag clustering techniques. Our system has the following features: (1) Exploring events (hashtag clusters) with different space granularity. Users can zoom in and out on maps to find out… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
76
0

Year Published

2015
2015
2021
2021

Publication Types

Select...
4
3
1
1

Relationship

0
9

Authors

Journals

citations
Cited by 126 publications
(77 citation statements)
references
References 27 publications
(16 reference statements)
1
76
0
Order By: Relevance
“…Regarding query types, existing techniques address either frequent or trending items discovery, based on the supported application. Event detection applications [1], [3], [9], [14], [16], [20], [21], [29], however, are more focused on grouping several trending keywords together to report an event rather than focusing on the scalability and performance of retrieving trending keywords. Thus, these applications are orthogonal to our work in a way that GARNET can use one of these techniques in order to report an event based on the returned trending keywords.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Regarding query types, existing techniques address either frequent or trending items discovery, based on the supported application. Event detection applications [1], [3], [9], [14], [16], [20], [21], [29], however, are more focused on grouping several trending keywords together to report an event rather than focusing on the scalability and performance of retrieving trending keywords. Thus, these applications are orthogonal to our work in a way that GARNET can use one of these techniques in order to report an event based on the returned trending keywords.…”
Section: Related Workmentioning
confidence: 99%
“…7. Frequent/Trending Items in Microblogs: Event Detection [1], [3], [9], [14], [16], [20], [21], [29]; GeoScope [4]; AFIA [24] keywords at different locations in recent time interval. However, AFIA is only able to answer frequent keywords queries and is not geared towards answering trending queries.…”
Section: Related Workmentioning
confidence: 99%
“…Unit of process Measure of burst (Sayyadi et al, 2009) Noun phrase TF, DF, IDF (O'Connor et al, 2010) Uni-gram, bi-gram, tri-gram Burstiness (Mathioudakis and Koudas, 2010) Uni-gram Burstiness (Weng and Lee, 2011) Uni-gram DF-IDF, H-measure (wavelet analysis) (Metzler et al, 2012) Uni-gram Burstiness (Li et al, 2012) N-gram of any length Deviation from Gaussian distribution (Cui et al, 2012) Hashtag Deviation from Gaussian distribution (Benhardus and Kalita, 2013 (Abdelhaq et al, 2013) Uni-gram Deviation from Gaussian distribution (Schubert et al, 2014) Uni-gram (pair) Deviation from Gaussian distribution (Feng et al, 2015) Hashtag Deviation from Gaussian distribution burst (i.e., the rapid increase of occurrence) of a phrase causes the burst of overlapping incomplete N-grams. For example, if phrase "let it be" bursts, the occurrence of some overlapping N-grams such as "let it", "let it be is", and "it be is" inevitably increases, possibly generating bursty incomplete Ngrams.…”
Section: Methodsmentioning
confidence: 99%
“…They introduced a parameter-free clustering approach called feature-pivot clustering, which attempted to detect and cluster bursty features into hot stories. Similarly, [8] interpreted events as hashtag clusters and propose a hierarchical spatio-temporal clustering of tweets into events.…”
Section: Related Workmentioning
confidence: 99%