| Function Approximation |   |   | Partial Observability |   |   | Learning Methods |   |   | Ensembles |   |   | 
| Stochastic Optimisation |   |   | General RL |   |   | General ML |   |   | Multiagent Learning |   |   | 
| Comparison/Integration |   |   | Bandits |   |   | Applications |   |   | Robot Soccer |   |   | 
| Humanoids |   |   | Parameter |   |   | MDP |   |   | Empirical |   |   | 
| Failure Warning |   |   | Representation |   |   | General AI |   |   | Neural Networks |   |   | 
| All |   |   | 
 A Comprehensive Survey of Multiagent Reinforcement Learning
 Lucian Bu\csoniu,  Robert Babu\vska, and  Bart De Schutter, 2008
    Details   
 Half Field Offense in RoboCup Soccer: A Multiagent Reinforcement Learning Case Study
 Shivaram Kalyanakrishnan,  Yaxin Liu, and  Peter Stone, 2007
    Details   
 Hierarchical multi-agent reinforcement learning
 Mohammad Ghavamzadeh,  Sridhar Mahadevan, and  Rajbala Makar, 2006
    Details   
 Cooperative Multi-Agent Learning: The State of the Art
 Liviu Panait and  Sean Luke, 2005
    Details   
 Decentralized Control of Cooperative Systems: Categorization and Complexity Analysis
 Claudia V. Goldman and  Shlomo Zilberstein, 2004
    Details   
 The Complexity of Decentralized Control of Markov Decision Processes
 Daniel S. Bernstein,  Robert Givan,  Neil Immerman, and  Shlomo Zilberstein, 2002
    Details   
 Coordinated Reinforcement Learning
 Carlos Guestrin,  Michail G. Lagoudakis, and  Ronald Parr, 2002
    Details   
 Multiagent Planning with Factored MDPs
 Carlos Guestrin,  Daphne Koller, and  Ronald Parr, 2001
    Details   
 Distributed Value Functions
 Jeff Schneider,  Weng-Keen Wong,  Andrew Moore, and  Martin Riedmiller, 1999
    Details   
 Reinforcement Learning in the Multi-Robot Domain
 Maja J. Matarić, 1997
    Details   
 Strongly Typed Genetic Programming in Evolving Cooperation Strategies
 Thomas Haynes,  Roger L. Wainwright,  Sandip Sen, and  Dale A. Schoenefeld, 1995
    Details   
 Multi-Agent Reinforcement Learning: Independent versus Cooperative Agents
 Ming Tan, 1993
    Details   
 On Optimal Cooperation of Knowledge Sources - An Empirical Investigation
 M. Benda,  V. Jagannathan, and  R. Dodhiawala, 1986
    Details