2020
DOI: 10.1109/access.2020.3005885
|View full text |Cite
|
Sign up to set email alerts
|

Deep Q-Learning Based Optimization of VLC Systems With Dynamic Time-Division Multiplexing

Abstract: The traditional method to solve nondeterministic-polynomial-time (NP)-hard optimization problems is to apply meta-heuristic algorithms. In contrast, Deep Q Learning (DQL) uses memory of experience and deep neural network (DNN) to choose steps and progress towards solving the problem. The dynamic time-division multiple access (DTDMA) scheme is a viable transmission method in visible light communication (VLC) systems. In DTDMA systems, the time-slots of the users are adjusted to maximize the spectral efficiency … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
7
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
4
1

Relationship

1
4

Authors

Journals

citations
Cited by 6 publications
(7 citation statements)
references
References 20 publications
0
7
0
Order By: Relevance
“…The above optimization problem is a non-convex optimization problem for N > 2 [14]. A non-convex optimization problem has many local minima and it is hard to determine the globally optimal solution [14], [24], [2]. Different methods such as evolutionary algorithm (EA) [25], [26], [27], [28], monotonic optimization [2], DRL [2], etc., can be applied to solve them.…”
Section: System Model and Optimization Problemmentioning
confidence: 99%
See 3 more Smart Citations
“…The above optimization problem is a non-convex optimization problem for N > 2 [14]. A non-convex optimization problem has many local minima and it is hard to determine the globally optimal solution [14], [24], [2]. Different methods such as evolutionary algorithm (EA) [25], [26], [27], [28], monotonic optimization [2], DRL [2], etc., can be applied to solve them.…”
Section: System Model and Optimization Problemmentioning
confidence: 99%
“…A state-representation should capture the attributes of the system that are relevant to the decision-making [32]. Before discussing the state-representation, we would to define two functions f L (∆ i ) and f U (∆ i ), as follows We can determine the data-rates of users (R i ) using (2). Please note that the users are sorted in the ascending order of the square of their channel gains, i.e., |h…”
Section: State-representationmentioning
confidence: 99%
See 2 more Smart Citations
“…With the development of artificial intelligence, algorithms like deep Q-network have been widely used for decision making in various practical problems [18][19][20]. e use of reinforcement learning in path planning is increasing and has provided different goal-oriented path planning for various types of vehicles due to its strong performance and high applicability in decision making of path selection.…”
Section: Introductionmentioning
confidence: 99%