Online Interactive Collaborative Filtering Using Multi-Armed Bandit with Dependent Arms

Wang, Qing; Zeng, Chunqiu; Zhou, Wubai; Li, Tao; Iyengar, S. Sitharama; Shwartz, Larisa; Grabarnik, Genady Ya.

doi:10.1109/tkde.2018.2866041

Cited by 60 publications

(34 citation statements)

References 39 publications

(39 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…On the other hand, CoLin [50] assumes that the reward in bandit is generated through an additive model, indicating that friends' feedback on their recommendations can be passed via the network to explain the target user's feedback. Recently, researchers also tried to capture the arm dependency by organizing different arms into different clusters [36,48] and it is very similar to our studied problem. However, simply performing clustering on attributed networks may ignore the inherent node dependencies and may lead to suboptimal results in interactive anomaly discovery.…”

Section: Multi-armed Bandit Algorithmsmentioning

confidence: 96%

Interactive Anomaly Detection on Attributed Networks

Ding

Liu

2019

Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining

112

View full text Add to dashboard Cite

Performing anomaly detection on attributed networks concerns with finding nodes whose patterns or behaviors deviate significantly from the majority of reference nodes. Its success can be easily found in many real-world applications such as network intrusion detection, opinion spam detection and system fault diagnosis, to name a few. Despite their empirical success, a vast majority of existing efforts are overwhelmingly performed in an unsupervised scenario due to the expensive labeling costs of ground truth anomalies. In fact, in many scenarios, a small amount of prior human knowledge of the data is often effortless to obtain, and getting it involved in the learning process has shown to be effective in advancing many important learning tasks. Additionally, since new types of anomalies may constantly arise over time especially in an adversarial environment, the interests of human expert could also change accordingly regarding to the detected anomaly types. It brings further challenges to conventional anomaly detection algorithms as they are often applied in a batch setting and are incapable to interact with the environment. To tackle the above issues, in this paper, we investigate the problem of anomaly detection on attributed networks in an interactive setting by allowing the system to proactively communicate with the human expert in making a limited number of queries about ground truth anomalies. Our objective is to maximize the true anomalies presented to the human expert after a given budget is used up. Along with this line, we formulate the problem through the principled multi-armed bandit framework and develop a novel collaborative contextual bandit algorithm, named GraphUCB. In particular, our developed algorithm: (1) explicitly models the nodal attributes and node dependencies seamlessly in a joint framework; and (2) handles the exploration-exploitation dilemma when querying anomalies of different types. Extensive experiments on real-world datasets show the improvement of the proposed algorithm over the state-of-the-art algorithms.

show abstract

Section: Multi-armed Bandit Algorithmsmentioning

confidence: 96%

Interactive Anomaly Detection on Attributed Networks

Ding

Liu

2019

Proceedings of the Twelfth ACM International Conference on Web Search and Data Mining

112

View full text Add to dashboard Cite

show abstract

“…In the early studies of CRSs it was common to ask for users opinions on an item itself (Zhao et al, 2013;Wang et al, 2017). These approaches usually combine the features of static recommender systems such as CF with user interaction in real-time.…”

Section: Item Elicitationmentioning

confidence: 99%

Soliciting User Preferences in Conversational Recommender Systems via Usage-related Questions

Kostric

Balog

Radlinski

2021

Fifteenth ACM Conference on Recommender Systems

View full text Add to dashboard Cite

Conversational Recommender Systems are recommender systems that utilize multi-turn interactions in order to help users find items of interest. Their advantage over traditional, one-shot recommender systems lies in their ability to elicit and adapt to the changing user preference in real time.Common approaches to eliciting user preferences include asking about items and item attributes. This strategies can fail, if the user does not have the prerequisite knowledge about the item or item attributes but they know what they plan to use the item for. In this thesis we propose a novel approach to eliciting preferences by asking implicit questions based on item usage.We identify the sentences form a large corpora of user reviews that contain information about item usage. Based on those sentences and by utilizing crowd workers, we generate questions that could be used in an preference elicitation setting. Lastly, based on the labelled dataset, we train a large neural model to automatically generate question for any viable sentence in the corpus.Using standard metrics for automatic evaluations of generated questions and manual evaluation, we demonstrate the potential viability of such a system in a production setting.Finally, we identify clusters of questions where the system fails. i Thank you to the University of Stavanger for the great years of studying, for giving me the opportunity to do this research and for the access to essential hardware.Thank you Krisztian Balog for being my mentor and a great teacher. Thank you for your contribution in bringing forward this idea and for continuous invaluable guidance throughout the project. Our weekly meetings gave me inspiration and motivation to keep working and expanding the scope of the research.Thank you Filip Radlinski also for being my mentor, supporting the idea and providing great insights and new ideas.Thank you to my girlfriend Kirsti for unwavering support through my years of study, and especially these last couple of months while working with this thesis.Thank you to my daughter Matilde, born just after we started this project. While giving me some sleepless nights and countless of distractions, you bring joy to my every day, even when I am stressed or tired.

show abstract

“…In [28], the authors studied online collaborative filtering with dependent arms, where they combined particle filtering with the upper confidence bound (UCB) algorithm and Thompson sampling. The limitation of the work is that it does not consider adversarial effects.…”

Section: Related Workmentioning

confidence: 99%

Learning the Truth in Social Networks Using Multi-Armed Bandit

Odeyomi

2020

IEEE Access

View full text Add to dashboard Cite

This paper explains how agents in a social network can learn the arbitrary time-varying true state of the network. This is practical in social networks where information is released and updated without any coordination. Most existing literature for learning the true state using the non-Bayesian learning approach, assumes that this true state is fixed, which is impractical. To address this problem, the social network is modeled as a graph network, and the time-varying true state is treated as a multi-armed bandit problem. The few works that have applied multi-armed bandit to a social network did not take into consideration the adversarial effects. Therefore, this paper proposes two non-stochastic multi-armed bandit algorithms that can handle the time-varying true state, even in the presence of an oblivious adversary. Regret bounds on the algorithms are obtained, and the simulation performance shows that all agents can converge to the most stable state. The sublinearity of the proposed algorithms is also compared with two well-known non-stochastic multi-armed bandit algorithms.INDEX TERMS strongly connected network, non-Bayesian learning, diffusion learning, multi-armed bandit, regret.

show abstract

Online Interactive Collaborative Filtering Using Multi-Armed Bandit with Dependent Arms

Cited by 60 publications

References 39 publications

Interactive Anomaly Detection on Attributed Networks

Interactive Anomaly Detection on Attributed Networks

Soliciting User Preferences in Conversational Recommender Systems via Usage-related Questions

Learning the Truth in Social Networks Using Multi-Armed Bandit

Contact Info

Product

Resources

About