2017 IEEE International Conference on Data Mining (ICDM) 2017
DOI: 10.1109/icdm.2017.43
|View full text |Cite
|
Sign up to set email alerts
|

Importance Sketching of Influence Dynamics in Billion-Scale Networks

Abstract: Abstract-The blooming availability of traces for social, biological, and communication networks opens up unprecedented opportunities in analyzing diffusion processes in networks. However, the sheer sizes of the nowadays networks raise serious challenges in computational efficiency and scalability.In this paper, we propose a new hyper-graph sketching framework for influence dynamics in networks. The central of our sketching framework, called SKIS, is an efficient importance sampling algorithm that returns only … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
36
0

Year Published

2018
2018
2021
2021

Publication Types

Select...
4
2
1

Relationship

0
7

Authors

Journals

citations
Cited by 17 publications
(36 citation statements)
references
References 27 publications
0
36
0
Order By: Relevance
“…Since an influence estimator is randomized, each algorithm run generates random solutions as well. Despite this nature, most of the previous studies conducted few-trials experiments only, e.g., the number of trials is 3 in [70], 5 in [69], 10 in [13,15,47], 20 in [30], 50 in [16], and not explicitly stated in [14,17,19,24,26,27,31,38,39,56,57,[60][61][62]; conclusions based on them would be questionable. In this paper, we analyze the empirical distribution of random solutions made from 1,000 trials to gain a deeper understanding of the stochastic behavior of randomized algorithms.…”
Section: Our Motivationsmentioning
confidence: 99%
See 1 more Smart Citation
“…Since an influence estimator is randomized, each algorithm run generates random solutions as well. Despite this nature, most of the previous studies conducted few-trials experiments only, e.g., the number of trials is 3 in [70], 5 in [69], 10 in [13,15,47], 20 in [30], 50 in [16], and not explicitly stated in [14,17,19,24,26,27,31,38,39,56,57,[60][61][62]; conclusions based on them would be questionable. In this paper, we analyze the empirical distribution of random solutions made from 1,000 trials to gain a deeper understanding of the stochastic behavior of randomized algorithms.…”
Section: Our Motivationsmentioning
confidence: 99%
“…Unlike the case of Oneshot and Snapshot, most of the research on RIS focus on a proper selection of sample number θ , or equivalently, a stopping condition for RR-set generation. The standard requirement is to draw as few RR sets as possible that yield a "theoretical worst-case guarantee" on a (1 − 1/e − ϵ)-approximation with probability 1 − δ [7,30,56,57,60,61,[68][69][70].…”
Section: Efficient Implementationsmentioning
confidence: 99%
“…The main idea of RIS is not to estimate the influence from seed nodes, but to randomly sample nodes and run Monte Carlo simulations in opposite direction to search the nodes which can influence the sampled nodes. This motivates many studies to further improve the sampling technology [27]- [30] and memory consumption [31], [32].…”
Section: A Traditional Influence Maximizationmentioning
confidence: 99%
“…IVM includes two components: generating IBS to estimate the benefit function and new strategy to find candidate solution and checks its approximation guarantee condition by developing two lower and upper bound functions. iterations (line [4][5][6][7][8][9][10][11][12][13][14]. In each iterator t, the algorithm maintains a set R t consists N 1 · 2 t−1 and finds a candidate solution S t by using Improve Greedy Algorithm (IGA) for Budgeted Maximum Coverage (BMC) problem [6].…”
Section: Importance Benefit Samplingmentioning
confidence: 99%
“…Our algorithm, namely Importance samplebased for Viral Marketing (IVM), contains two innovative techniques: 1) We note that importance samples (in the space of all benefit samples) can be used to estimate the benefit function. This leads to a general result of using importance sketches to estimate the influence spread function for IM [12]. 2) Base on that we design a new strategy to check approximation guarantee condition of candidate solutions.…”
Section: Introductionmentioning
confidence: 99%