2010 IEEE/RSJ International Conference on Intelligent Robots and Systems 2010
DOI: 10.1109/iros.2010.5650006
|View full text |Cite
|
Sign up to set email alerts
|

Imitation learning for task allocation

Abstract: Abstract-At the heart of multi-robot task allocation lies the ability to compare multiple options in order to select the best. In some domains this utility evaluation is not straightforward, for example due to complex and unmodeled underlying dynamics or an adversary in the environment. Explicitly modeling these extrinsic influences well enough so that they can be accounted for in utility computation (and thus task allocation) may be intractable, but a human expert may be able to quickly gain some intuition ab… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
5
0

Year Published

2012
2012
2021
2021

Publication Types

Select...
3
3
1

Relationship

0
7

Authors

Journals

citations
Cited by 17 publications
(5 citation statements)
references
References 14 publications
0
5
0
Order By: Relevance
“…A recent structured prediction approach [37] for task allocation uses a combination of reinforcement learning and quadratic integer programming for learning directly from data to optimize assignments. The approaches in [17], [37], however, assume the existence of a single strategy for task allocation. A distributed approach for multi-agent task allocation [38] learns to select the most appropriate of two pre-specified strategies, namely Earliest Deadline First (EDF) or Nearest Task First (NTF).…”
Section: Learning For Team-level Coordinationmentioning
confidence: 99%
See 1 more Smart Citation
“…A recent structured prediction approach [37] for task allocation uses a combination of reinforcement learning and quadratic integer programming for learning directly from data to optimize assignments. The approaches in [17], [37], however, assume the existence of a single strategy for task allocation. A distributed approach for multi-agent task allocation [38] learns to select the most appropriate of two pre-specified strategies, namely Earliest Deadline First (EDF) or Nearest Task First (NTF).…”
Section: Learning For Team-level Coordinationmentioning
confidence: 99%
“…The learning methods discussed so far consider only homogeneous robots [37], do not show generalization to teams unseen during training [17], [18], depend on the ability to interact with the environment to learn policies [39] or adhere to a limited number of pre-specified strategies [38]. In contrast, our framework is capable of learning generalizable and heterogeneous strategies for task allocation in heterogeneous multi-agent systems from expert demonstrations, and do not rely on environmental interactions.…”
Section: Learning For Team-level Coordinationmentioning
confidence: 99%
“…Reinforcement learning is an early used architecture in this field [20]. Meanwhile, imitation learning is also used [21], which is characterized by expert's knowledge. Nunes and Gini study an auction algorithm to allocate temporal-constraint tasks, and model the temporal constraints on the tasks as a simple temporal problem [22].…”
Section: Related Work a Multi-robot Task Allocationmentioning
confidence: 99%
“…In the past decade, multirobot task allocation has been a popular research topic in robotics. Furthermore, a few learning‐based multirobot task allocation approaches have been developed for such applications as fire‐fighting disaster response (Duvallet and Stentz, ; Jones et al., ), patrolling (Tangamchit et al., ), and multirobot auctioning (Pippin and Christensen, ).…”
Section: Related Workmentioning
confidence: 99%
“…In Duvallet and Stentz (), imitation learning was implemented to incorporate human expert knowledge into the decision‐making process of a market‐based multirobot task allocation method for fire‐fighting disaster response. Namely, the human expert's solutions to a multirobot task allocation problem were represented by a set of demonstrated allocations that were then utilized to train a pricing policy, e.g., assign different values to different buildings.…”
Section: Related Workmentioning
confidence: 99%