UTCS Artificial Intelligence
courses
talks/events
demos
people
projects
publications
software/data
labs
areas
admin
Cooperating with a Markovian Ad Hoc Teammate (2013)
Doran Chakraborty
and
Peter Stone
This paper focuses on learning in the presence of a Markovian teammate in Ad hoc teams. A Markovian teammate's policy is a function of a set of discrete feature values derived from the joint history of interaction, where the feature values transition in a Markovian fashion on each time step. We introduce a novel algorithm ``Learning to Cooperate with a Markovian teammate'', or LCM, that converges to optimal cooperation with any Markovian teammate, and achieves safety with any arbitrary teammate. The novel aspect of LCM is the manner in which it satisfies the above two goals via efficient exploration and exploitation. The main contribution of this paper is a full specification and a detailed analysis of LCM's theoretical properties.
View:
PDF
,
PS
,
HTML
Citation:
In
Proceedings of the 12th International Conference on Autonomous Agents and Multiagent Systems (AAMAS)
, May 2013.
Bibtex:
@inproceedings{AAMAS13-chakrado, title={Cooperating with a Markovian Ad Hoc Teammate}, author={Doran Chakraborty and Peter Stone}, booktitle={Proceedings of the 12th International Conference on Autonomous Agents and Multiagent Systems (AAMAS)}, month={May}, url="http://www.cs.utexas.edu/users/ai-lab?AAMAS13-chakrado", year={2013} }
People
Doran Chakraborty
Ph.D. Alumni
chakrado [at] cs utexas edu
Peter Stone
Faculty
pstone [at] cs utexas edu
Areas of Interest
Ad Hoc Teamwork
Reinforcement Learning
Labs
Learning Agents