2019 IEEE Globecom Workshops (GC Wkshps) 2019
DOI: 10.1109/gcwkshps45667.2019.9024429
|View full text |Cite
|
Sign up to set email alerts
|

A Reinforcement Learning Approach for the Multichannel Rendezvous Problem

Abstract: In this paper, we consider the multichannel rendezvous problem in cognitive radio networks (CRNs) where the probability that two users hopping on the same channel have a successful rendezvous is a function of channel states. The channel states are modelled by two-state Markov chains that have a good state and a bad state. These channel states are not observable by the users. For such a multichannel rendezvous problem, we are interested in finding the optimal policy to minimize the expected time-to-rendezvous (… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
6
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 9 publications
(6 citation statements)
references
References 17 publications
0
6
0
Order By: Relevance
“…Statistical dependency has an important impact in many more applications. These include cognitive radio [45], queuing [46], and improper signaling [47]. A more general dependency analysis using copulas might also be beneficial in the context of finite-length information theory.…”
Section: Discussionmentioning
confidence: 99%
“…Statistical dependency has an important impact in many more applications. These include cognitive radio [45], queuing [46], and improper signaling [47]. A more general dependency analysis using copulas might also be beneficial in the context of finite-length information theory.…”
Section: Discussionmentioning
confidence: 99%
“…An evaluation of SARSA and various other model-free algorithms is presented in Dafazio [9]. The evaluation considered diverse and difficult problems within a consistent environment (Arcade [15]) involving configuration aspects like Epsilon-greedy policy, exploration versus exploitation and state space with SARSA outperforming algorithms like Q-learning, ETTR (Expected Time-to-Rendezvous) [34], R-learning [16], GQ algorithm [35] and Actor-Critic algorithm [6].…”
Section: Network Communications Resource Allocation With Sarsamentioning
confidence: 99%
“…Introduction: Multichannel blind rendezvous problem refers to how to guarantee that two secondary users (or called users for simplicity) hop on the same channel at the same timeslot in cognitive radio networks (CRNs) [1]. A number of works have been conducted on how to design channel hopping (CH) sequences for this fundamental problem in CRNs [2].…”
mentioning
confidence: 99%
“…A number of works have been conducted on how to design channel hopping (CH) sequences for this fundamental problem in CRNs [2]. Existing works can be categorized from different perspectives as follows: (1) According to whether users can adopt the same CH strategy, they are divided into Asymmetric role [3][4][5] and Symmetric role [6]; (2) According to the status of available channels, they can be divided into Homogeneous channels [7], Heterogeneous channels [8], Channel invariant [9], and Channel variable [10]; (3) According to whether users start CH at the same time, they can be divided into Asynchronous clock [11,12] and Synchronous clock [13]; (4) According to one or more users receiving information, they can be divided into Pairwise destination (Unicast) [14] and Multiple destination (Multicast) [15]. The ideal CH algorithms are expected to give good performance in terms of the maximum time-to-rendezvous (MTTR) (i.e.…”
mentioning
confidence: 99%
See 1 more Smart Citation