Next: World Modeling Up: The CMUnited-98 Champion Simulator Previous: Introduction

Agent Architecture Overview

CMUnited-98 agents are capable of perception, cognition, and action. By perceiving the world, they build a model of its current state. Then, based on a set of behaviors, they choose an action appropriate for the current world state.

A driving factor in the design of the agent architecture is the fact that the simulator operates in fixed cycles of length 100 msec. As presented in Section [1], the simulator accepts commands from clients throughout a cycle and then updates the world state all at once at the end of the cycle. Only one action command (dash, kick, turn, or catch) is executed for a given client during a given cycle.

Therefore, agents (simulator clients) should send exactly one action command to the simulator in every simulator cycle. If more than one command is sent in the same cycle, a random one is executed, possibly leading to undesired behavior. If no command is sent during a simulator cycle, an action opportunity has been lost: opponent agents who have acted during that cycle may gain an advantage.

In addition, since the simulator updates the world at the end of every cycle, it is advantageous to try to determine the state of the world at the end of the previous cycle when choosing an action for the current cycle. As such, the basic agent loop during a given cycle t is as follows:

Assume the agent has consistent information about the state of the world at the end of cycle t-2 and has sent an action during cycle t-1.
While the server is still in cycle t-1, upon receipt of a sensation (see, hear, or sense_body), store the new information in temporary structures. Do not update the current state.
When the server enters cycle t (determined either by a running clock or by the receipt of a sensation with time stamp t), use all of the information available (temporary information from sensations and predicted effects of past actions) to update the world model to match the server's world state (the ``real world state'') at the end of cycle t-1. Then choose and send an action to the server for cycle t.
Repeat for cycle t+1.

While the above algorithm defines the overall agent loop, much of the challenge is involved in updating the world model effectively and choosing an appropriate action. The remainder of this section goes into these processes in detail.

Next: World Modeling Up: The CMUnited-98 Champion Simulator Previous: Introduction

Peter Stone
Mon Nov 30 20:08:29 EST 1998