2019
DOI: 10.1007/978-3-030-34885-4_3
|View full text |Cite
|
Sign up to set email alerts
|

Towards Model-Based Reinforcement Learning for Industry-Near Environments

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
3
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
3
2

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(4 citation statements)
references
References 6 publications
0
3
0
Order By: Relevance
“…Andersen et al [60], in their paper, proposed a new "Dreaming Variational Autoencoder" model to speed up detection of potential threats. The authors cite that often expert systems already exist in fully automated warehouses, but are not flexible enough to work in dynamic environments.…”
Section: Deep Learning Methodsmentioning
confidence: 99%
“…Andersen et al [60], in their paper, proposed a new "Dreaming Variational Autoencoder" model to speed up detection of potential threats. The authors cite that often expert systems already exist in fully automated warehouses, but are not flexible enough to work in dynamic environments.…”
Section: Deep Learning Methodsmentioning
confidence: 99%
“…Similarly, Anderson et al [1] proposed Dreaming Variational Autoencoder, an architecture for modeling the environment using VAE and RNN, which uses the real trajectories from the actual environment to imitate the behavior of the actual environment. Conversely, Anderson et al [2] found that in high-dimensional tasks, simple heuristics exploration are often trapped in local minima of the state space, which may cause the generative model to become inaccurate or even collapse.…”
Section: Related Workmentioning
confidence: 99%
“…(1) DQN and DDQN are deep Q learning [13,14] and double deep Q learning [7], which are benchmark comparison algorithms; (2) CDQN and CDDQN are DQN or DDQN based on curiosity-driven exploration [5,15]; (3) DQN-VAE and DDQN-VAE add a VAE structure to DQN or DDQN, which only uses the VAE model to alleviate insufficient sample diversity; (4) DQN-CVAE and DDQN-CVAE are our proposed algorithms that combine (2) and (3). It was different from (3) in that we use curiosity-driven exploration to improve the efficiency of exploration.…”
Section: Evaluation Criteria and Comparison Algorithmsmentioning
confidence: 99%
See 1 more Smart Citation