| Function Approximation |   |   | Partial Observability |   |   | Learning Methods |   |   | Ensembles |   |   | 
| Stochastic Optimisation |   |   | General RL |   |   | General ML |   |   | Multiagent Learning |   |   | 
| Comparison/Integration |   |   | Bandits |   |   | Applications |   |   | Robot Soccer |   |   | 
| Humanoids |   |   | Parameter |   |   | MDP |   |   | Empirical |   |   | 
| Failure Warning |   |   | Representation |   |   | General AI |   |   | Neural Networks |   |   | 
| All |   |   | 
 Learning Complementary Multiagent Behaviors: A Case Study
 Shivaram Kalyanakrishnan and  Peter Stone, 2010
    Details   
 A Case Study on Improving Defense Behavior in Soccer Simulation 2D: The NeuroHassle Approach
 Thomas Gabel,  Martin Riedmiller, and  Florian Trost, 2009
    Details   
 Simulation-Based Approach to General Game Playing
 Hilmar Finnsson and  Yngvi Björnsson, 2008
    Details   
 Adaptive Treatment of Epilepsy via Batch-mode Reinforcement Learning
 Arthur Guez,  Robert D. Vincent,  Massimo Avoli, and  Joelle Pineau, 2008
    Details   
 Model-Based Reinforcement Learning in a Complex Domain
 Shivaram Kalyanakrishnan,  Peter Stone, and  Yaxin Liu, 2008
    Details   
 Reinforcement learning of motor skills with policy gradients
 Jan Peters and  Stefan Schaal, 2008
    Details   
 Managing Power Consumption and Performance of Computing Systems Using Reinforcement Learning
 Gerald Tesauro,  Rajarshi Das,  Hoi Chan,  Jeffrey O. Kephart,  Charles Lefurgy,  David W. Levine, and  Freeman Rawson, 2008
    Details   
 Self-Optimizing Memory Controllers: A Reinforcement Learning Approach
 Engin \.Ipek,  Onur Mutlu,  José and  Martínez, and  Rich Caruana, 2008
    Details   
 Learning RoboCup-Keepaway with Kernels
 Tobias Jung and  Daniel Polani, 2007
    Details   
 Half Field Offense in RoboCup Soccer: A Multiagent Reinforcement Learning Case Study
 Shivaram Kalyanakrishnan,  Yaxin Liu, and  Peter Stone, 2007
    Details   
 Autonomous blimp control using model-free reinforcement learning in a continuous state and action space
 Axel Rottmann,  Christian Plagemann,  Peter Hilgers, and  Wolfram Burgard, 2007
    Details   
 Reinforcement Learning of Local Shape in the Game of Go
 David Silver,  Richard S. Sutton, and  Martin Müller, 2007
    Details   
 Learning to Play Using Low-Complexity Rule-Based Policies: Illustrations through Ms. Pac-Man
 István Szita and  András L\Horincz, 2007
    Details   
 On the use of hybrid reinforcement learning for autonomic resource allocation
 Gerald Tesauro,  Nicholas K. Jong,  Rajarshi Das, and  Mohamed N. Bennani, 2007
    Details   
 Reinforcement learning for quasi-passive dynamic walking of an unstable biped robot
 Kentarou Hitomi,  Tomohiro Shibata,  Yutaka Nakamura, and  Shin Ishii, 2006
    Details   
 Quadruped Robot Obstacle Negotiation via Reinforcement Learning
 Honglak Lee,  Yirong Shen,  Chih-Han Yu,  Gurjeet Singh, and  Andrew Y. Ng, 2006
    Details   
 Reinforcement learning for optimized trade execution
 Yuriy Nevmyvaka,  Yi Feng, and  Michael Kearns, 2006
    Details   
 Keepaway Soccer:  From Machine Learning Testbed to Benchmark
 Peter Stone,  Gregory Kuhlmann,  Matthew E. Taylor, and  Yaxin Liu, 2006
    Details   
 Learning Tetris using the noisy cross-entropy method
 István Szita and  András L\Horincz, 2006
    Details   
 Reinforcement Learning for RoboCup-Soccer Keepaway
 Peter Stone,  Richard S. Sutton, and  Gregory Kuhlmann, 2005
    Details   
 Machine Learning for Fast Quadrupedal Locomotion
 Nate Kohl and  Peter Stone, 2004
    Details   
 Reinforcement learning for sensing strategies
 Cody Kwok and  Dieter Fox, 2004
    Details   
 Autonomous Helicopter Flight via Reinforcement Learning
 Andrew Y. Ng,  H. Jin Kim,  Michael I. Jordan, and  Shankar Sastry, 2004
    Details   
 Multi-Agent Patrolling with Reinforcement Learning
 Hugo Santana,  Geber Ramalho,  Vincent Corruble, and  Bohdana Ratitch, 2004
    Details   
 Stochastic policy gradient reinforcement learning on a simple 3D biped
 Russ Tedrake,  Teresa Weirui Zhang, and  H. Sebastian Seung, 2004
    Details   
 Adaptive Job Routing and Scheduling
 Shimon Whiteson and  Peter Stone, 2004
    Details   
 Active Guidance for a Finless Rocket Using Neuroevolution
 Faustino J. Gomez and  Risto Miikkulainen, 2003
    Details   
 Deep Blue
 Murray Campbell,  A. Joseph Hoane Jr., and  Feng-hsiung Hsu, 2002
    Details   
 Multiagent Planning with Factored MDPs
 Carlos Guestrin,  Daphne Koller, and  Ronald Parr, 2001
    Details   
 Learning to trade via direct reinforcement
 John Moody and  Matthew Saffell, 2001
    Details   
 Planning treatment of ischemic heart disease with partially observable Markov decision processes
 Milos Hauskrecht and  Hamish Fraser, 2000
    Details   
 Reinforcement Learning for Control of Self-Similar Call Traffic in Broadband Networks
 Jakob Carlström and  Ernst Nordström, 1999
    Details   
 Distributed Value Functions
 Jeff Schneider,  Weng-Keen Wong,  Andrew Moore, and  Martin Riedmiller, 1999
    Details   
 Symposium on Applications of Reinforcement Learning: Final Report for NSF Grant IIS-9810208
 Pat Langley and  Mark Pendrith, 1998
    Details   
 Learning to Drive a Bicycle Using Reinforcement Learning and Shaping
 Jette Randløv and  Preben Alstrøm, 1998
    Details   
 Reinforcement Learning: An Introduction
 Richard S. Sutton and  Andrew G. Barto, 1998
    Details   
 Reinforcement Learning for Dynamic Channel Allocation in Cellular Telephone Systems
 Satinder Singh and  Dimitri Bertsekas, 1997
    Details   
 Neuro-Dynamic Programming
 Dimitri P. Bertsekas and  John N. Tsitsiklis, 1996
    Details   
 Improving Elevator Performance Using Reinforcement Learning
 Robert H. Crites and  Andrew G. Barto, 1996
    Details   
 A Reinforcement Learning Approach to job-shop Scheduling
 Wei Zhang and  Thomas G. Dietterich, 1995
    Details   
 Practical Issues in Temporal Difference Learning
 Gerald Tesauro, 1992
    Details   
 Further Real Applications of Markov Decision Processes
 Douglas J. White, 1988
    Details   
 Real Applications of Markov Decision Processes
 Douglas J. White, 1985
    Details