Scott J. Harmon scite author profile

DeLoach

Robby

Abstract. Policies have traditionally been a way to specify properties of a system. In this paper, we show how policies can be applied to the Organization Model for Adaptive Computational Systems (OMACS). In OMACS, policies may constrain assignments of agents to roles, the structure of the goal model for the organization, or how an agent may play a particular role. In this paper, we focus on policies limiting system traces; this is done to leverage the work already done for specification and verification of properties in concurrent programs. We show how traditional policies can be characterized as law policies; that is, they must always be followed by a system. In the context of multiagent systems, law policies limit the flexibility of the system. Thus, in order to preserve the system flexibility while still being able to guide the system into preferring certain behaviors, we introduce the concept of guidance policies. These guidance policies need not always be followed; when the system cannot continue with the guidance policies, they may be suspended. We show how this can guide how the system achieves the top-level goal while not decreasing flexibility of the system. Guidance policies are formally defined and, since multiple guidance policies can introduce conflicts, a strategy for resolving conflicts is given.

Leveraging Organizational Guidance Policies with Learning to Self-Tune Multiagent Systems

DeLoach

Robby

et al. 2008

Abstract Requirement Analysis in Multiagent System Design

DeLoach

Robby

2009

Abstract-"Good, fast, or cheap, pick two." What drives designers to make decisions on how to architect a system? The stake-holder has certain abstract qualities in mind: efficiency, quality, reliability, and so forth. How do we make sure our system is guided by these qualities? What happens when the system cannot always provide all the qualities? We describe a framework for analyzing a design allowing decisions about what qualities are more important to be made at design-time.

A Comparison of Hybrid Incremental Reuse Strategies for Reinforcement Learning in Genetic Programming

Rodríguez

Zhong

et al. 2004

Easy missions is an approach to machine learning that seeks to synthesize solutions for complex tasks from those for simpler ones. ISLES (Incrementally Staged Learning from Easier Subtasks) [1] is a genetic programming (GP) technique that achieves this by using identified goals and fitness functions for subproblems of the overall problem. Solutions evolved for these subproblems are then reused to speed up learning, either as automatically defined functions (ADF) or by seeding a new GP population. Previous positive results using both approaches for learning in multi-agent systems (MAS) showed that incremental reuse using easy missions achieves comparable or better overall fitness than single-layered GP. A key unresolved issue dealt with hybrid reuse using ADF with easy missions. Results in the keep-away soccer (KAS) [2] domain (a test bed for MAS learning) were also inconclusive on whether compactness-inducing reuse helped or hurt overall agent performance. In this paper, we compare reuse using singlelayered (with and without ADF) GP and easy missions GPs to two new types of GP learning systems with incremental reuse. In our research we performed six experiments. The first experiment used standard, conventional GP without any enhancement. We will refer to this as single-layered GP. The second used using standard GP enhanced with ADF. We will refer to this as single-layered ADF. The rest of the experiments used double-layered (two stages of evolution). The third used ISLES with Standard GP in the first and second stage. We will refer to this as ISLES-SGP/SGP. The fourth used ISLES with Standard GP in the first stage and ADF in the second stage. We will refer to this as ISLES-SGP/ADF. The fifth used ISLES with ADF in the first stage and Standard GP in the second stage. We will refer to this as ISLES-ADF/SGP. The sixth and final experiment used ISLES with ADF in the first and second stage. We will refer to this as ISLES-ADF/ADF. Each experiment was done using ECJ [3] and a KAS simulator created by S. Gustafson [1]. For both single-layered experiments, the target concept was to minimize the number of turn overs. For all of the experiments with ISLES, the first stage goal was to maximize the number of successful passes between two teammates in the absence of takers. The second stage goal was to minimize the number of turnovers from keepers (3 keepers) to takers (1 taker). We took the average of ten runs for each experiment. The population size for all the experiments was 4000. For the single-layered experiments, we stopped at the 101th generation. For the ISLES experiments we stopped the first stage at

Guidance and Law Policies in Multiagent Systems

Hall¹,

Harmon²,

DeLoach³

2007