Peter Stone's Selected Publications

Classified by TopicClassified by Publication TypeSorted by DateSorted by First Author Last NameClassified by Funding Source


Modeling Uncertainty in Leading Ad Hoc Teams

Modeling Uncertainty in Leading Ad Hoc Teams.
Noa Agmon, Samuel Barrett, and Peter Stone.
In Proc. of 13th Int. Conf. on Autonomous Agents and Multiagent Systems (AAMAS), May 2014.

Download

[PDF]255.6kB  [postscript]1.7MB  

Abstract

Ad hoc teamwork exists when a team of agents needs to cooperate without being able to communicate or use coordination schemes that were designed a-priori. Sometimes ad hoc teamwork amounts to acting so as to bring out the best in your teammates by ``leading'' them to the optimal joint action. Doing so can be challenging even when their behavior is fully known. In this paper, we take the challenge to the next level by considering the situation in which there is uncertainty about the teammates' behaviors. We discuss the problem of recursive modeling of the teammate's uncertain behavior in two-agent teams and conclude not only that the depth that is useful to model is bounded, but also the number of models useful to consider is linear in the number of actions (and not exponential, as expected). We then show that adopting a naive perspective might lead to negative long-term results in large teams, and thus introduce REACT, an algorithm for determining the action an agent should perform in order to maximize the team's expected utility. Finally, we show empirically that in randomly generated utility matrices, using REACT to select actions outperforms making incorrect assumptions about the identities of teammates.

BibTeX Entry

@InProceedings{AAMAS14-Agmon,
  author = {Noa Agmon and Samuel Barrett and Peter Stone},
  title = {Modeling Uncertainty in Leading Ad Hoc Teams},
  booktitle = {Proc. of 13th Int. Conf. on Autonomous Agents and Multiagent Systems (AAMAS)},
  location = {Paris, France},
  month = {May},
  year = {2014},
  abstract={
    Ad hoc teamwork exists when a team of agents needs to cooperate without being able to communicate or use coordination schemes that were designed a-priori. Sometimes ad hoc teamwork amounts to acting so as to bring out the best in your teammates by ``leading'' them to the optimal joint action.  Doing so can be challenging even when their behavior is fully known. In this paper, we take the challenge to the next level by considering the situation in which there is uncertainty about the teammates' behaviors. We discuss the problem of recursive modeling of the teammate's uncertain behavior in two-agent teams and conclude not only that the depth that is useful to model is bounded, but also the number of models useful to consider is linear in the number of actions (and not exponential, as expected). We then show that adopting a naive perspective might lead to negative long-term results in large teams, and thus introduce REACT, an algorithm for determining the action an agent should perform in order to maximize the team's expected utility.  Finally, we show empirically that in randomly generated utility matrices, using REACT to select actions outperforms making incorrect assumptions about the identities of teammates.
  },
}

Generated by bib2html.pl (written by Patrick Riley ) on Wed Apr 17, 2024 18:42:54