Importance Sketching of Influence Dynamics in Billion-Scale Networks

Nguyen, Hung T.; Nguyen, Tri; Phan, NhatHai; Dinh, Thang N.

doi:10.1109/icdm.2017.43

Cited by 17 publications

(36 citation statements)

References 27 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Since an influence estimator is randomized, each algorithm run generates random solutions as well. Despite this nature, most of the previous studies conducted few-trials experiments only, e.g., the number of trials is 3 in [70], 5 in [69], 10 in [13,15,47], 20 in [30], 50 in [16], and not explicitly stated in [14,17,19,24,26,27,31,38,39,56,57,[60][61][62]; conclusions based on them would be questionable. In this paper, we analyze the empirical distribution of random solutions made from 1,000 trials to gain a deeper understanding of the stochastic behavior of randomized algorithms.…”

Section: Our Motivationsmentioning

confidence: 99%

“…Unlike the case of Oneshot and Snapshot, most of the research on RIS focus on a proper selection of sample number θ , or equivalently, a stopping condition for RR-set generation. The standard requirement is to draw as few RR sets as possible that yield a "theoretical worst-case guarantee" on a (1 − 1/e − ϵ)-approximation with probability 1 − δ [7,30,56,57,60,61,[68][69][70].…”

Section: Efficient Implementationsmentioning

confidence: 99%

See 1 more Smart Citation

The Solution Distribution of Influence Maximization: A High-level Experimental Study on Three Algorithmic Approaches

Ohsaka

2020

Proceedings of the 2020 ACM SIGMOD International Conference on Management of Data

View full text Add to dashboard Cite

Influence maximization is among the most fundamental algorithmic problems in social influence analysis. Over the last decade, a great effort has been devoted to developing efficient algorithms for influence maximization, so that identifying the "best" algorithm has become a demanding task. In SIGMOD'17, Arora, Galhotra, and Ranu reported benchmark results on eleven existing algorithms and demonstrated that there is no single state-of-the-art offering the best trade-off between computational efficiency and solution quality.In this paper, we report a high-level experimental study on three well-established algorithmic approaches for influence maximization, referred to as Oneshot, Snapshot, and Reverse Influence Sampling (RIS). Different from Arora et al., our experimental methodology is so designed that we examine the distribution of random solutions, characterize the relation between the sample number and the actual solution quality, and avoid implementation dependencies. Our main findings are as follows: 1. For a sufficiently large sample number, we obtain a unique solution regardless of algorithms. 2. The average solution quality of Oneshot, Snapshot, and RIS improves at the same rate up to scaling of sample number. 3. Oneshot requires more samples than Snapshot, and Snapshot requires fewer but larger samples than RIS. We discuss the time efficiency when conditioning Oneshot, Snapshot, and RIS to be of identical accuracy. Our conclusion is that Oneshot is suitable only if the size of available memory is limited, and RIS is more efficient than Snapshot for large networks; Snapshot is preferable for small, low-probability networks.

show abstract

Section: Our Motivationsmentioning

confidence: 99%

Section: Efficient Implementationsmentioning

confidence: 99%

The Solution Distribution of Influence Maximization: A High-level Experimental Study on Three Algorithmic Approaches

Ohsaka

2020

Proceedings of the 2020 ACM SIGMOD International Conference on Management of Data

View full text Add to dashboard Cite

show abstract

“…The main idea of RIS is not to estimate the influence from seed nodes, but to randomly sample nodes and run Monte Carlo simulations in opposite direction to search the nodes which can influence the sampled nodes. This motivates many studies to further improve the sampling technology [27]- [30] and memory consumption [31], [32].…”

Section: A Traditional Influence Maximizationmentioning

confidence: 99%

Location-Based Seeds Selection for Influence Blocking Maximization in Social Networks

et al. 2019

View full text Add to dashboard Cite

Influence blocking maximization (IBM) is a key problem for viral marketing in competitive social networks. Although the IBM problem has been extensively studied, existing works neglect the fact that the location information can play an important role in influence propagation. In this paper, we study the location-based seeds selection for IBM problem, which aims to find a positive seed set in a given query region to block the negative influence propagation in a given block region as much as possible. In order to overcome the low efficiency of the simulation-based greedy algorithm, we propose a heuristic algorithm IS-LSS and its improved version IS-LSS+, both of which are based on the maximum influence arborescence structure and Quadtree index, while IS-LSS+ further improves the efficiency of IS-LSS by using an upper bound method and Quadtree cell lists. The experimental results on real-world datasets demonstrate that our proposed algorithms are able to achieve matching blocking effect to the greedy algorithm as the increase in the number of positive seeds and often better than other heuristic algorithms, whereas they are four orders of magnitude faster than the greedy algorithm.

show abstract

“…IVM includes two components: generating IBS to estimate the benefit function and new strategy to find candidate solution and checks its approximation guarantee condition by developing two lower and upper bound functions. iterations (line [4][5][6][7][8][9][10][11][12][13][14]. In each iterator t, the algorithm maintains a set R t consists N 1 · 2 t−1 and finds a candidate solution S t by using Improve Greedy Algorithm (IGA) for Budgeted Maximum Coverage (BMC) problem [6].…”

Section: Importance Benefit Samplingmentioning

confidence: 99%

“…Our algorithm, namely Importance samplebased for Viral Marketing (IVM), contains two innovative techniques: 1) We note that importance samples (in the space of all benefit samples) can be used to estimate the benefit function. This leads to a general result of using importance sketches to estimate the influence spread function for IM [12]. 2) Base on that we design a new strategy to check approximation guarantee condition of candidate solutions.…”

Section: Introductionmentioning

confidence: 99%

Importance Sample-Based Approximation Algorithm for Cost-Aware Targeted Viral Marketing

Pham¹,

Duong²,

Thai³

2019

Computational Data and Social Networks

View full text Add to dashboard Cite

Cost-aware Targeted Viral Marketing (CTVM), a generalization of Influence Maximization (IM), has received a lot of attentions recently due to its commercial values. Previous approximation algorithms for this problem required a large number of samples to ensure approximate guarantee. In this paper, we propose an efficient approximation algorithm which uses fewer samples but provides the same theoretical guarantees based on generating and using important samples in its operation. Experiments on real social networks show that our proposed method outperforms the state-of-the-art algorithm which provides the same approximation ratio in terms of the number of required samples and running time.

show abstract

Importance Sketching of Influence Dynamics in Billion-Scale Networks

Cited by 17 publications

References 27 publications

The Solution Distribution of Influence Maximization: A High-level Experimental Study on Three Algorithmic Approaches

The Solution Distribution of Influence Maximization: A High-level Experimental Study on Three Algorithmic Approaches

Location-Based Seeds Selection for Influence Blocking Maximization in Social Networks

Importance Sample-Based Approximation Algorithm for Cost-Aware Targeted Viral Marketing

Contact Info

Product

Resources

About