2020 International Joint Conference on Neural Networks (IJCNN) 2020
DOI: 10.1109/ijcnn48605.2020.9206849
|View full text |Cite
|
Sign up to set email alerts
|

Adaptation of a wheel loader automatic bucket filling neural network using reinforcement learning

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
9
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 17 publications
(9 citation statements)
references
References 22 publications
0
9
0
Order By: Relevance
“…Artificial intelligence (AI)-based methods include fuzzy logic for wheel-loader action selection [15] and digging control [16] by using feed-forward neural networks to model digging resistance and machine dynamics. Automatic bucket filling by learning from demonstration was recently demonstrated in [17][18][19] and extended in [20] with a reinforcement learning algorithm for automatic adaptation of an already-trained model to a new pile of different soil. The imitation model in [17] is a time-delayed neural network that predicts the lift and tilt actions of joysticks during the filling of a bucket; it was trained with 100 examples from an expert operator and used no information about the material or the pile.…”
Section: Related Work and Our Contributionmentioning
confidence: 99%
See 1 more Smart Citation
“…Artificial intelligence (AI)-based methods include fuzzy logic for wheel-loader action selection [15] and digging control [16] by using feed-forward neural networks to model digging resistance and machine dynamics. Automatic bucket filling by learning from demonstration was recently demonstrated in [17][18][19] and extended in [20] with a reinforcement learning algorithm for automatic adaptation of an already-trained model to a new pile of different soil. The imitation model in [17] is a time-delayed neural network that predicts the lift and tilt actions of joysticks during the filling of a bucket; it was trained with 100 examples from an expert operator and used no information about the material or the pile.…”
Section: Related Work and Our Contributionmentioning
confidence: 99%
“…The imitation model in [17] is a time-delayed neural network that predicts the lift and tilt actions of joysticks during the filling of a bucket; it was trained with 100 examples from an expert operator and used no information about the material or the pile. With the adaptation algorithm in [20], the network adapts from loading mediumcoarse gravel to cobble gravel, with a five-to ten-percent increase in bucket filling after 40 loadings. The first use of reinforcement learning to control a scooping mechanism was recently published [21].…”
Section: Related Work and Our Contributionmentioning
confidence: 99%
“…Because simulation models are not derived from the real world, RL-based simulation cannot learn features of the real world well. Dadhich et al [5] used RL to achieve the automatic bucket-filling of wheel loaders through real-time interaction with the real environment. However, interacting with the real environment to train the RL algorithm is costly and time-consuming.…”
Section: Related Workmentioning
confidence: 99%
“…Bucket-filling is a relatively repetitive task for the operators of wheel-loaders and is suitable for automation. Automatic bucket-filling is also required for efficient remote operation and the development of fully autonomous solutions [5]. The interaction condition between the bucket and the pile strongly affects the bucket-filling.…”
Section: Introductionmentioning
confidence: 99%
“…Furthermore, machine control and perception strategies use software algorithms, datasets, and data fusion which also place high computational hardware requirements [6]. Additionally, methods such as imitation learning (IL) are used to construct surrogate models for control of autonomous HDMM, wherein a human-operator demonstrates a task, which is subsequently "imitated" by the model [33]. The human performance may differ and thus, work performance evaluation models are valuable to demonstrate ideal working methods for IL.…”
Section: ) Sensorsmentioning
confidence: 99%