Jump to: navigation, search

Difference between revisions of "Ad Hoc Teamwork: HFO"

Line 2: Line 2:
  
 
[http://www.cs.utexas.edu/~AustinVilla/sim/halffieldoffense/ Original specification of the HFO task.]
 
[http://www.cs.utexas.edu/~AustinVilla/sim/halffieldoffense/ Original specification of the HFO task.]
 +
 +
In this research, we investigate the performance of PLASTIC-Policy in the HFO domain.  PLASTIC-Policy learns policies for cooperating with past teammates and selects from among these policies on the fly.  In these videos, PLASTIC-Policy has learned about 7 previous types teammates.  As a baseline, we compare the performance of the Combined Policy, where the ad hoc agent combines its observations about all 7 types of teammates to learn a new policy.
 +
 +
== Limited Half Field Offense ==
 +
In the limited half field offense task, 2 offensive agents play against 2 defensive agents, including the goalie.  In these videos, the defense is using behavior designed by Helios.  We sample teammates coming from 2 different teams for the videos.
 +
 +
'''Cooperating with a teammate designed by Cyrus:'''
 +
 +
Combined Policy:
 +
{{#ev:youtube|zQaxS14uLAI}}
 +
 +
PLASTIC-Policy
 +
{{#ev:youtube|a52EGHnfKmQ}}
 +
 +
'''Cooperating with a teammate from the agent2d code release:'''
 +
 +
Combined Policy:
 +
{{#ev:youtube|LaXrnOHviuo}}
 +
 +
PLASTIC-Policy:
 +
{{#ev:youtube|xpSTdfWgAoU}}
 +
 +
 +
{{#ev:youtube|hYKyjJXAI88}}
 +
 +
{{#ev:youtube|Bhj-h3bgeDk}}
 +
 +
{{#ev:youtube|U3ENo7Rxw0I}}
 +
 +
{{#ev:youtube|ITHkok9pfGw}}

Revision as of 22:20, 13 October 2014

Ad hoc teamwork in the Half Field Offense (HFO) task in the 2D simulated RoboCup domain.

Original specification of the HFO task.

In this research, we investigate the performance of PLASTIC-Policy in the HFO domain. PLASTIC-Policy learns policies for cooperating with past teammates and selects from among these policies on the fly. In these videos, PLASTIC-Policy has learned about 7 previous types teammates. As a baseline, we compare the performance of the Combined Policy, where the ad hoc agent combines its observations about all 7 types of teammates to learn a new policy.

Limited Half Field Offense

In the limited half field offense task, 2 offensive agents play against 2 defensive agents, including the goalie. In these videos, the defense is using behavior designed by Helios. We sample teammates coming from 2 different teams for the videos.

Cooperating with a teammate designed by Cyrus:

Combined Policy:

PLASTIC-Policy

Cooperating with a teammate from the agent2d code release:

Combined Policy:

PLASTIC-Policy: