2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2019
DOI: 10.1109/iros40897.2019.8968129
|View full text |Cite
|
Sign up to set email alerts
|

Multi-Agent Image Classification via Reinforcement Learning

Abstract: We investigate a classification problem using multiple mobile agents capable of collecting (partial) posedependent observations of an unknown environment. The objective is to classify an image over a finite time horizon. We propose a network architecture on how agents should form a local belief, take local actions, and extract relevant features from their raw partial observations. Agents are allowed to exchange information with their neighboring agents to update their own beliefs. It is shown how reinforcement… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
11
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
5
2
1

Relationship

1
7

Authors

Journals

citations
Cited by 16 publications
(11 citation statements)
references
References 16 publications
0
11
0
Order By: Relevance
“…In order to acquire adequate observations, the robot could consider traversing the environment while collecting local information as sequential data and attempting to learn their inter-correlation with recurrent neural networks. The same type of framework that uses localized observations and recurrent neural networks to learn the image and map classification is illustrated in our previous works [22,21,16,17], and shows promising performance in various scenarios.…”
Section: Introductionmentioning
confidence: 85%
See 2 more Smart Citations
“…In order to acquire adequate observations, the robot could consider traversing the environment while collecting local information as sequential data and attempting to learn their inter-correlation with recurrent neural networks. The same type of framework that uses localized observations and recurrent neural networks to learn the image and map classification is illustrated in our previous works [22,21,16,17], and shows promising performance in various scenarios.…”
Section: Introductionmentioning
confidence: 85%
“…In the first case study, we consider the robot is traveling over the image from the MNIST dataset [14,22]. The training and testing is performed in PyTorch [23] ADAM [10] and a learning rate l r = 0.0001.…”
Section: B Mnist Datasetmentioning
confidence: 99%
See 1 more Smart Citation
“…A common model for RL is the standard Markov Decision Process. , RL can be divided into model-based RL and model-free RL, , as well as active RL and passive RL . DL models can also be used in RL to form deep RL (DRL). , These methods have been applied to diverse fields in biology and chemistry, including drug discovery, , protein design, , and chemical engineering. , They have also been used in image recognition , and financial markets . Next, we will summarize how RL can be applied to the study of biochemical molecules in the context of small data sets.…”
Section: Methods For Small Molecular Data Challengesmentioning
confidence: 99%
“…[ 11 ], where each agent controls a sliding square window. This method is more robust and has better generalization based on the results of the experiments on the CIFAR-10 dataset and the MNIST dataset [ 21 ].…”
Section: Introductionmentioning
confidence: 99%