Convergence, Targeted Optimality and Safety in Multiagent Learning

Convergence, Targeted Optimality and Safety in Multiagent Learning (2010)

This paper introduces a novel multiagent learning algorithm which achieves convergence, targeted optimality against memory-bounded adversaries, and safety, in arbitrary repeated games. Called CMLeS, its most novel aspect is the manner in which it guarantees (in a PAC sense) targeted optimality against memory-bounded adversaries, via efficient exploration and exploitation. CMLeS is fully implemented and we present empirical results demonstrating its effectiveness.

View:

PDF, PS, HTML

Citation:

In Proceedings of the Twenty-seventh International Conference on Machine Learning (ICML 2010), June 2010.

Bibtex:

People

Doran Chakraborty	Ph.D. Alumni	chakrado [at] cs utexas edu
Peter Stone	Faculty	pstone [at] cs utexas edu

Areas of Interest

Agent Modeling in Multiagent Systems Reinforcement Learning

Labs

Learning Agents