Difference between revisions of "Ad Hoc Teamwork: HFO"

Revision as of 22:20, 13 October 2014

Ad hoc teamwork in the Half Field Offense (HFO) task in the 2D simulated RoboCup domain.

In this research, we investigate the performance of PLASTIC-Policy in the HFO domain. PLASTIC-Policy learns policies for cooperating with past teammates and selects from among these policies on the fly. In these videos, PLASTIC-Policy has learned about 7 previous types teammates. As a baseline, we compare the performance of the Combined Policy, where the ad hoc agent combines its observations about all 7 types of teammates to learn a new policy.

Limited Half Field Offense

In the limited half field offense task, 2 offensive agents play against 2 defensive agents, including the goalie. In these videos, the defense is using behavior designed by Helios. We sample teammates coming from 2 different teams for the videos.

Cooperating with a teammate designed by Cyrus:

Combined Policy:

PLASTIC-Policy

Cooperating with a teammate from the agent2d code release:

Combined Policy:

PLASTIC-Policy:

@@ Line 2: / Line 2: @@
 [http://www.cs.utexas.edu/~AustinVilla/sim/halffieldoffense/ Original specification of the HFO task.]
+In this research, we investigate the performance of PLASTIC-Policy in the HFO domain.  PLASTIC-Policy learns policies for cooperating with past teammates and selects from among these policies on the fly.  In these videos, PLASTIC-Policy has learned about 7 previous types teammates.  As a baseline, we compare the performance of the Combined Policy, where the ad hoc agent combines its observations about all 7 types of teammates to learn a new policy.
+== Limited Half Field Offense ==
+In the limited half field offense task, 2 offensive agents play against 2 defensive agents, including the goalie.  In these videos, the defense is using behavior designed by Helios.  We sample teammates coming from 2 different teams for the videos.
+'''Cooperating with a teammate designed by Cyrus:'''
+Combined Policy:
+{{#ev:youtube|zQaxS14uLAI}}
+PLASTIC-Policy
+{{#ev:youtube|a52EGHnfKmQ}}
+'''Cooperating with a teammate from the agent2d code release:'''
+Combined Policy:
+{{#ev:youtube|LaXrnOHviuo}}
+PLASTIC-Policy:
+{{#ev:youtube|xpSTdfWgAoU}}
+{{#ev:youtube|hYKyjJXAI88}}
+{{#ev:youtube|Bhj-h3bgeDk}}
+{{#ev:youtube|U3ENo7Rxw0I}}
+{{#ev:youtube|ITHkok9pfGw}}