Convergence, Targeted Optimality and Safety in Multiagent Learning (2010)
This paper introduces a novel multiagent learning algorithm which achieves convergence, targeted optimality against memory-bounded adversaries, and safety, in arbitrary repeated games. Called CMLeS, its most novel aspect is the manner in which it guarantees (in a PAC sense) targeted optimality against memory-bounded adversaries, via efficient exploration and exploitation. CMLeS is fully implemented and we present empirical results demonstrating its effectiveness.
View:
PDF, PS, HTML
Citation:
In Proceedings of the Twenty-seventh International Conference on Machine Learning (ICML 2010), June 2010.
Bibtex:

Doran Chakraborty Ph.D. Alumni chakrado [at] cs utexas edu
Peter Stone Faculty pstone [at] cs utexas edu