• Classified by Topic • Classified by Publication Type • Sorted by Date • Sorted by First Author Last Name • Classified by Funding Source •

Noa Agmon, Samuel Barrett, and
Peter Stone. **Modeling Uncertainty in Leading Ad Hoc Teams**. In *Proc.
of 13th Int. Conf. on Autonomous Agents and Multiagent Systems (AAMAS)*, May 2014.

[PDF]255.6kB [postscript]1.7MB

Ad hoc teamwork exists when a team of agents needs to cooperate without being able to communicate or use coordination schemes that were designed a-priori. Sometimes ad hoc teamwork amounts to acting so as to bring out the best in your teammates by ``leading'' them to the optimal joint action. Doing so can be challenging even when their behavior is fully known. In this paper, we take the challenge to the next level by considering the situation in which there is uncertainty about the teammates' behaviors. We discuss the problem of recursive modeling of the teammate's uncertain behavior in two-agent teams and conclude not only that the depth that is useful to model is bounded, but also the number of models useful to consider is linear in the number of actions (and not exponential, as expected). We then show that adopting a naive perspective might lead to negative long-term results in large teams, and thus introduce REACT, an algorithm for determining the action an agent should perform in order to maximize the team's expected utility. Finally, we show empirically that in randomly generated utility matrices, using REACT to select actions outperforms making incorrect assumptions about the identities of teammates.

@InProceedings{AAMAS14-Agmon, author = {Noa Agmon and Samuel Barrett and Peter Stone}, title = {Modeling Uncertainty in Leading Ad Hoc Teams}, booktitle = {Proc. of 13th Int. Conf. on Autonomous Agents and Multiagent Systems (AAMAS)}, location = {Paris, France}, month = {May}, year = {2014}, abstract={ Ad hoc teamwork exists when a team of agents needs to cooperate without being able to communicate or use coordination schemes that were designed a-priori. Sometimes ad hoc teamwork amounts to acting so as to bring out the best in your teammates by ``leading'' them to the optimal joint action. Doing so can be challenging even when their behavior is fully known. In this paper, we take the challenge to the next level by considering the situation in which there is uncertainty about the teammates' behaviors. We discuss the problem of recursive modeling of the teammate's uncertain behavior in two-agent teams and conclude not only that the depth that is useful to model is bounded, but also the number of models useful to consider is linear in the number of actions (and not exponential, as expected). We then show that adopting a naive perspective might lead to negative long-term results in large teams, and thus introduce REACT, an algorithm for determining the action an agent should perform in order to maximize the team's expected utility. Finally, we show empirically that in randomly generated utility matrices, using REACT to select actions outperforms making incorrect assumptions about the identities of teammates. }, }

Generated by bib2html.pl (written by Patrick Riley ) on Wed Sep 06, 2017 22:17:32