N. Mert Vural scite author profile

N. Mert Vural

4Publications

18Citation Statements Received

150Citation Statements Given

How they've been cited

How they cite others

149

Affiliations

University of Toronto, Bilkent University

Publications

Order By: Most citations

Minimax Optimal Algorithms for Adversarial Bandit Problem With Multiple Plays

Vural

Gokcesu

et al. 2019

IEEE Trans. Signal Process.

View full text Add to dashboard Cite

We investigate the adversarial bandit problem with multiple plays under semi-bandit feedback. We introduce a highly efficient algorithm that asymptotically achieves the performance of the best switching m-arm strategy with minimax optimal regret bounds. To construct our algorithm, we introduce a new expert advice algorithm for the multiple-play setting. By using our expert advice algorithm, we additionally improve the best-known high-probability bound for the multi-play setting by O( √ m). Our results are guaranteed to hold in an individual sequence manner since we have no statistical assumption on the bandit arm gains. Through an extensive set of experiments involving synthetic and real data, we demonstrate significant performance gains achieved by the proposed algorithm with respect to the state-of-the-art algorithms.Index Terms-Adversarial multi-armed bandit, multiple plays, switching bandit, minimax optimal, individual sequence manner. I. INTRODUCTION A. PreliminariesM ULTI-ARMED bandit problem is extensively investigated in the online learning [1]-[6] and signal processing [7]-[11] literatures, especially for the applications where feedback is limited, and exploration-exploitation must be balanced optimally. In the classical framework, the multi-armed bandit problem deals with choosing a single arm out of K arms at each round so as to maximize the total reward. We study the multiple-play version of this problem, where we choose an m sized subset of K arms at each round. We assume that r The size m is constant throughout the game and known a priori by the learner.r The order of arm selections does not have an effect on the arm gains.r The total gain of the selected m arms is the sum of the gains of the selected individual arms.

show abstract

Achieving Online Regression Performance of LSTMs With Simple RNNs

Vural

İlhan

Yılmaz

et al. 2022

IEEE Trans. Neural Netw. Learning Syst.

View full text Add to dashboard Cite

Recurrent neural networks (RNNs) are widely used for online regression due to their ability to generalize nonlinear temporal dependencies. As an RNN model, long short-term memory networks (LSTMs) are commonly preferred in practice, as these networks are capable of learning long-term dependencies while avoiding the vanishing gradient problem. However, due to their large number of parameters, training LSTMs requires considerably longer training time compared to simple RNNs (SRNNs). In this article, we achieve the online regression performance of LSTMs with SRNNs efficiently. To this end, we introduce a first-order training algorithm with a linear time complexity in the number of parameters. We show that when SRNNs are trained with our algorithm, they provide very similar regression performance with the LSTMs in two to three times shorter training time. We provide strong theoretical analysis to support our experimental results by providing regret bounds on the convergence rate of our algorithm. Through an extensive set of experiments, we verify our theoretical work and demonstrate significant performance improvements of our algorithm with respect to LSTMs and the other state-of-the-art learning models.

show abstract

An Efficient and Effective Second-Order Training Algorithm for LSTM-Based Adaptive Learning

Vural

Ergüt

Kozat

2021

IEEE Trans. Signal Process.

View full text Add to dashboard Cite

We study adaptive (or online) nonlinear regression with Long-Short-Term-Memory (LSTM) based networks, i.e., LSTM-based adaptive learning. In this context, we introduce an efficient Extended Kalman filter (EKF) based second-order training algorithm. Our algorithm is truly online, i.e., it does not assume any underlying data generating process and future information, except that the target sequence is bounded. Through an extensive set of experiments, we demonstrate significant performance gains achieved by our algorithm with respect to the state-of-the-art methods. Here, we mainly show that our algorithm consistently provides 10 to 45% improvement in the accuracy compared to the widely-used adaptive methods Adam, RMSprop, and DEKF, and comparable performance to EKF with a 10 to 15 times reduction in the run-time.

show abstract

LSTM-Based Online Learning with Extended Kalman Filter Based Training Algorithm

İlhan

Vural

Kozat

2020

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

N. Mert Vural

Minimax Optimal Algorithms for Adversarial Bandit Problem With Multiple Plays

Achieving Online Regression Performance of LSTMs With Simple RNNs

An Efficient and Effective Second-Order Training Algorithm for LSTM-Based Adaptive Learning

LSTM-Based Online Learning with Extended Kalman Filter Based Training Algorithm

Contact Info

Product

Resources

About