Abstract:In many applications in social network analysis, it is important to model the interactions and infer the influence between pairs of actors, leading to the problem of dyadic event modeling which has attracted increasing interests recently. In this paper we focus on the problem of dyadic event attribution, an important missing data problem in dyadic event modeling where one needs to infer the missing actor-pairs of a subset of dyadic events based on their observed timestamps. Existing works either use fixed mode… Show more
“…In social media such approaches have been used extensively [18,30,31], mainly due to their capability to model phenomena like user interactions and diffusion susceptibility. Hidden network properties, such as connexions between individuals [23,24] or the evolution of the diffusion network topology [13], have been uncovered using generative methods, such as selfexciting point processes. The main difficulty with such approaches is scalability, since each social phenomenon must be accounted manually in the model.…”
Predicting popularity, or the total volume of information outbreaks, is an
important subproblem for understanding collective behavior in networks. Each of
the two main types of recent approaches to the problem, feature-driven and
generative models, have desired qualities and clear limitations. This paper
bridges the gap between these solutions with a new hybrid approach and a new
performance benchmark. We model each social cascade with a marked Hawkes
self-exciting point process, and estimate the content virality, memory decay,
and user influence. We then learn a predictive layer for popularity prediction
using a collection of cascade history. To our surprise, Hawkes process with a
predictive overlay outperform recent feature-driven and generative approaches
on existing tweet data [43] and a new public benchmark on news tweets. We also
found that a basic set of user features and event time summary statistics
performs competitively in both classification and regression tasks, and that
adding point process information to the feature set further improves
predictions. From these observations, we argue that future work on popularity
prediction should compare across feature-driven and generative modeling
approaches in both classification and regression tasks
“…In social media such approaches have been used extensively [18,30,31], mainly due to their capability to model phenomena like user interactions and diffusion susceptibility. Hidden network properties, such as connexions between individuals [23,24] or the evolution of the diffusion network topology [13], have been uncovered using generative methods, such as selfexciting point processes. The main difficulty with such approaches is scalability, since each social phenomenon must be accounted manually in the model.…”
Predicting popularity, or the total volume of information outbreaks, is an
important subproblem for understanding collective behavior in networks. Each of
the two main types of recent approaches to the problem, feature-driven and
generative models, have desired qualities and clear limitations. This paper
bridges the gap between these solutions with a new hybrid approach and a new
performance benchmark. We model each social cascade with a marked Hawkes
self-exciting point process, and estimate the content virality, memory decay,
and user influence. We then learn a predictive layer for popularity prediction
using a collection of cascade history. To our surprise, Hawkes process with a
predictive overlay outperform recent feature-driven and generative approaches
on existing tweet data [43] and a new public benchmark on news tweets. We also
found that a basic set of user features and event time summary statistics
performs competitively in both classification and regression tasks, and that
adding point process information to the feature set further improves
predictions. From these observations, we argue that future work on popularity
prediction should compare across feature-driven and generative modeling
approaches in both classification and regression tasks
“…People find self-exciting point processes naturally suitable to model continuous-time events where the occurrence of one event can affect the likelihood of subsequent events in the future. One important self-exciting process is Hawkes process, which is first used to analyze earthquakes [25,37], and then widely applied to many different areas, such as market modeling [13,3], crime modeling [29], terrorist [26], conflict [35,21], and viral videos on the Web [10]. A novel Hawkes model was also proposed to model both temporal and textual information in viral [34].…”
We consider a search task as a set of queries that serve the same user information need. Analyzing search tasks from user query streams plays an important role in building a set of modern tools to improve search engine performance. In this paper, we propose a probabilistic method for identifying and labeling search tasks based on the following intuitive observations: queries that are issued temporally close by users in many sequences of queries are likely to belong to the same search task, meanwhile, different users having the same information needs tend to submit topically coherent search queries. To capture the above intuitions, we directly model query temporal patterns using a special class of point processes called Hawkes processes, and combine topic models with Hawkes processes for simultaneously identifying and labeling search tasks. Essentially, Hawkes processes utilize their self-exciting properties to identify search tasks if influence exists among a sequence of queries for individual users, while the topic model exploits query co-occurrence across different users to discover the latent information needed for labeling search tasks. More importantly, there is mutual reinforcement between Hawkes processes and the topic model in the unified model that enhances the performance of both. We evaluate our method based on both synthetic data and real-world query log data. In addition, we also apply our model to query clustering and search task identification. By comparing with state-of-the-art methods, the results demonstrate that the improvement in our proposed approach is consistent and promising.
“…In order to fully exploit the social ties between users and information in social networks, we base our trend detection algorithm on information diffusion models [5], [6], [7], [8], and more specifically on a Hawkes-based model for information diffusion in social networks [9], [10], [11], [12]. The Hawkesbased model allows: 1) leveraging on the knowledge of the influences between users and contents, 2) to fully explore the real time of broadcasts, 3) leveraging on the knowledge of users intrinsic (or exogenous) rates.…”
We develop in this paper a trend detection algorithm, designed to find trendy topics being disseminated in a social network. We assume that the broadcasts of messages in the social network is governed by a self-exciting point process, namely a Hawkes process, which takes into consideration the real broadcasting times of messages and the interaction between users and topics. We formally define trendiness and derive trend indices for each topic being disseminated in the social network. These indices take into consideration the time between the detection and the message broadcasts, the distance between the real broadcast intensity and the maximum expected broadcast intensity, and the social network topology. The proposed trend detection algorithm is simple and uses stochastic control techniques in order to calculate the trend indices. It is also fast and aggregates all the information of the broadcasts into a simple one-dimensional process, thus reducing its complexity and the quantity of data necessary to the detection.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.