2021
DOI: 10.1109/access.2021.3082136
|View full text |Cite
|
Sign up to set email alerts
|

Integrating Multiple Policies for Person-Following Robot Training Using Deep Reinforcement Learning

Abstract: Given a training environment which follows Markov decision process for a specific task, a deep reinforcement learning (DRL) agent is able to find possible optimal policies which map states of the environment to appropriate actions by repeatedly trying various actions to maximize training rewards. However, the learned policies cannot be reused directly in the training process for other new tasks resulting wasted precious time and resources. To solve this problem, we propose a DRL-based method for training an ag… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
references
References 38 publications
(49 reference statements)
0
0
0
Order By: Relevance