2018
DOI: 10.5120/ijca2018916252
|View full text |Cite
|
Sign up to set email alerts
|

Real Time Event Detection Adopting Incremental TF-IDF based LSH and Event Summary Generation

Abstract: Recently, twitter users are leveraged to detect social and physical events such as festivals and traffic jam at real time. Real time event detection and summarization from Cricket sports is the process of detecting events such as boundary at real time from live Cricket tweet stream as soon as event happens and generating a quick game summary. This is an interesting, yet a complex problem. Because of the need for rapid detection of sports events and for the generation of a concise summary from huge volume of tw… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
9
0

Year Published

2019
2019
2020
2020

Publication Types

Select...
3
1
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(9 citation statements)
references
References 12 publications
(5 reference statements)
0
9
0
Order By: Relevance
“…Accordingly, their novelty scores are defined as the distance of the incoming data to: an existing data point for P2P models, a cluster of existing data points for P2C models, and all the existing data points for P2A models. The P2P models are normally nearest neighbour-based [2] [19] [3] or approximate nearest neighbour-based [12] [7] that aim at finding the most similar existing story to the incoming story. The P2C models use clusters of existing stories to represent previous events and evaluate the incoming story by comparing it with these clustered events [2] [19].…”
Section: First Story Detectionmentioning
confidence: 99%
See 4 more Smart Citations
“…Accordingly, their novelty scores are defined as the distance of the incoming data to: an existing data point for P2P models, a cluster of existing data points for P2C models, and all the existing data points for P2A models. The P2P models are normally nearest neighbour-based [2] [19] [3] or approximate nearest neighbour-based [12] [7] that aim at finding the most similar existing story to the incoming story. The P2C models use clusters of existing stories to represent previous events and evaluate the incoming story by comparing it with these clustered events [2] [19].…”
Section: First Story Detectionmentioning
confidence: 99%
“…The most well-known term vector model is TF-IDF, short for term frequencyinverse document frequency, in which the weight of each term in a specific document is calculated as the product of the TF (term frequency) component and the IDF (inverse document frequency) component. There are many schemes of calculating these two components, but a widely-applied scheme is shown as follows [3] [7]:…”
Section: First Story Detectionmentioning
confidence: 99%
See 3 more Smart Citations