Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining 2017
DOI: 10.1145/3097983.3098087
|View full text |Cite
|
Sign up to set email alerts
|

DenseAlert

Abstract: Consider a stream of retweet events -how can we spot fraudulent lock-step behavior in such multi-aspect data (i.e., tensors) evolving over time? Can we detect it in real time, with an accuracy guarantee? Past studies have shown that dense subtensors tend to indicate anomalous or even fraudulent behavior in many tensor data, including social media, Wikipedia, and TCP dumps. us, several algorithms have been proposed for detecting dense subtensors rapidly and accurately. However, existing algorithms assume that t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
2
0

Year Published

2017
2017
2023
2023

Publication Types

Select...
3
3
1

Relationship

0
7

Authors

Journals

citations
Cited by 43 publications
(3 citation statements)
references
References 30 publications
0
2
0
Order By: Relevance
“…Typical anomaly detection methods [15,25,31,61,81], such as local outlier factor (LOF) [24] and tree-based approaches [34,56], can be used in event tensors by converting multiple attributes to numerical ones. [20,45,62,79,80] use a stream of multi-aspect records as input. MemStream [21] can learn dynamically changing trends to handle time-varying data distribution known as concept drift [27,35,57].…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Typical anomaly detection methods [15,25,31,61,81], such as local outlier factor (LOF) [24] and tree-based approaches [34,56], can be used in event tensors by converting multiple attributes to numerical ones. [20,45,62,79,80] use a stream of multi-aspect records as input. MemStream [21] can learn dynamically changing trends to handle time-varying data distribution known as concept drift [27,35,57].…”
Section: Related Workmentioning
confidence: 99%
“…Given a large, online stream of time-stamped events, how can we statistically summarize all the event streams and find important patterns, rules, and anomalies? Time-stamped event data are generated and collected by many real applications [10,17,29,84], including online marketing analytics [52,75], social network/location-based services [28,71], and cybersecurity systems [19,80], with increasingly larger sizes and faster rates of transactions. For example, an online shopping service could generate millions of logging entries every second, with rich information about items and users.…”
Section: Introductionmentioning
confidence: 99%
“…A sophisticated approach handles the number of increasing dimensions without affecting accuracy in either situation. High-dimensional data is often referred to as multi-dimensional or multi-aspect or multi-modal data throughout the literature [43,44]. Highly detailed data from diverse platforms such as the web, social media is typically high-dimensional in nature.…”
Section: The High Dimensionality Problemmentioning
confidence: 99%