2019
DOI: 10.1016/j.physa.2018.12.012
|View full text |Cite
|
Sign up to set email alerts
|

Multiple propagation paths enhance locating the source of diffusion in complex networks

Abstract: We investigate the problem of locating the source of diffusion in complex networks without complete knowledge of nodes' states. Some currently known methods assume the information travels via a single, shortest path, which by assumption is the fastest way. We show that such a method leads to the overestimation of propagation time for synthetic and real networks, where multiple shortest paths as well as longer paths between vertices exist. We propose a new method of source estimation based on maximum likelihood… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 15 publications
(6 citation statements)
references
References 12 publications
(14 reference statements)
0
6
0
Order By: Relevance
“…Four observer placement strategies are evaluated—Betweenness Centrality (BC) 52 that places observers at the nodes with the highest betweenness centrality, High Coverage Rate (HCR) 10 that maximises the number of neighbourhoods to which the observers belong, High Variance Observers (HVO) 49 which maximises the cardinality of the set of nodes lying on the shortest paths between observers, and simply random (RND) as a null model. For location methods, we chose the methods introduced by Pinto et al (LPTV) 1 that maximises likelihood estimator using a breadth-first search tree approximation, by Shen et al (TRBS) 2 that uses a so-called backwards spreading from observers onto the graph, by Xu et al (PC) 5 that relies on the correlation between the topological distance and time of propagation from the source, by Paluch et al (GMLA) 3 that modifies the LPTV by introducing a special node selection method, and by Gajewski et al (EPL) 4 that also modifies the LPTV to account for loops in the graph. Instead of showing the results as the function of the observer budget, K , we chose the observer density in our plots to easily compare systems of different sizes.…”
Section: Resultsmentioning
confidence: 99%
See 3 more Smart Citations
“…Four observer placement strategies are evaluated—Betweenness Centrality (BC) 52 that places observers at the nodes with the highest betweenness centrality, High Coverage Rate (HCR) 10 that maximises the number of neighbourhoods to which the observers belong, High Variance Observers (HVO) 49 which maximises the cardinality of the set of nodes lying on the shortest paths between observers, and simply random (RND) as a null model. For location methods, we chose the methods introduced by Pinto et al (LPTV) 1 that maximises likelihood estimator using a breadth-first search tree approximation, by Shen et al (TRBS) 2 that uses a so-called backwards spreading from observers onto the graph, by Xu et al (PC) 5 that relies on the correlation between the topological distance and time of propagation from the source, by Paluch et al (GMLA) 3 that modifies the LPTV by introducing a special node selection method, and by Gajewski et al (EPL) 4 that also modifies the LPTV to account for loops in the graph. Instead of showing the results as the function of the observer budget, K , we chose the observer density in our plots to easily compare systems of different sizes.…”
Section: Resultsmentioning
confidence: 99%
“…It assumes the normal distribution to calculate expected variances, which is not consistent with actual delay distributions of Susceptible-Infected processes that have geometric distribution. Multiple parallel paths affect not only variances and covariances of the expected times to arrive at observers starting from a specific potential source, but also their mean values 4 . While the distribution assumptions are mismatched, the inclusion of parallel paths allows this method to best predict the actual characteristics of the spreading process and pinpoint the source with the best accuracy.…”
Section: Discussionmentioning
confidence: 99%
See 2 more Smart Citations
“…It is, however, computationally expensive and approximates a tree network which leads to discrepancies for networks with a local structure. These drawbacks have been alleviated by optimized methods that are much faster, or those that take full network structure into account, offering improved accuracy at a higher computational cost (Gajewski et al., 2019; Paluch et al., 2018). Other methods seek to reduce the assumptions required about the spreading process, either by estimating the time distribution from the data or by considering local correlation of arrival times and distances (She et al., 2016; Wang and Sun, 2020).…”
Section: Data Science Approach To Misinformationmentioning
confidence: 99%