Abstract:This paper addresses opportunistic spectrum access (OSA) in non-cooperative cognitive radio networks (CRNs). The sel sh behaviors of the secondary users (SUs) will cause a CRN to collapse. The SUs are thus enabled to build beliefs about how other SUs would respond to their decision makings. The interaction among the SUs is modeled as a stochastic learning process. In this way, each SU can independently learn the behaviors of the competitors, optimize the OSA strategies, and nally achieve the goal of reciprocit… Show more
“…It is assumed that there are five available channels and ten users. The error packet is 3 10 − and the average received SNR increases from 5 to 15 dB . The simulation result shows that the proposed multi-agent Q-learning algorithm achieves better throughput performance.…”
Section: A Convergence Behaviormentioning
confidence: 99%
“…Opportunistic spectrum access (OSA) has become increasingly popular due to its potential to improve the efficiency of spectrum usage [3]- [4]. However, many authors studying the problem of OSA assumed that the wireless environment is static and does not vary with time.…”
This article investigates the problem of opportunistic spectrum access in dynamic environment, in which the signal-to-noise ratio (SNR) is time-varying. Different from existing work on continuous feedback, we consider more practical scenarios in which the transmitter receives an Acknowledgment (ACK) if the received SNR is larger than the required threshold, and otherwise a Non-Acknowledgment (NACK). That is, the feedback is discrete. Several applications with different threshold values are also considered in this work. The channel selection problem is formulated as a non-cooperative game, and subsequently it is proved to be a potential game, which has at least one pure strategy Nash equilibrium. Following this, a multi-agent Q-learning algorithm is proposed to converge to Nash equilibria of the game. Furthermore, opportunistic spectrum access with multiple discrete feedbacks is also investigated. Finally, the simulation results verify that the proposed multi-agent Q-learning algorithm is applicable to both situations with binary feedback and multiple discrete feedbacks.
“…It is assumed that there are five available channels and ten users. The error packet is 3 10 − and the average received SNR increases from 5 to 15 dB . The simulation result shows that the proposed multi-agent Q-learning algorithm achieves better throughput performance.…”
Section: A Convergence Behaviormentioning
confidence: 99%
“…Opportunistic spectrum access (OSA) has become increasingly popular due to its potential to improve the efficiency of spectrum usage [3]- [4]. However, many authors studying the problem of OSA assumed that the wireless environment is static and does not vary with time.…”
This article investigates the problem of opportunistic spectrum access in dynamic environment, in which the signal-to-noise ratio (SNR) is time-varying. Different from existing work on continuous feedback, we consider more practical scenarios in which the transmitter receives an Acknowledgment (ACK) if the received SNR is larger than the required threshold, and otherwise a Non-Acknowledgment (NACK). That is, the feedback is discrete. Several applications with different threshold values are also considered in this work. The channel selection problem is formulated as a non-cooperative game, and subsequently it is proved to be a potential game, which has at least one pure strategy Nash equilibrium. Following this, a multi-agent Q-learning algorithm is proposed to converge to Nash equilibria of the game. Furthermore, opportunistic spectrum access with multiple discrete feedbacks is also investigated. Finally, the simulation results verify that the proposed multi-agent Q-learning algorithm is applicable to both situations with binary feedback and multiple discrete feedbacks.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.