Shivaram's Reading List


Function Approximation     Partial Observability     Learning Methods     Ensembles    
Stochastic Optimisation     General RL     General ML     Multiagent Learning    
Comparison/Integration     Bandits     Applications     Robot Soccer    
Humanoids     Parameter     MDP     Empirical    
Failure Warning     Representation     General AI     Neural Networks    
All    

Applications

Learning Complementary Multiagent Behaviors: A Case Study
Shivaram Kalyanakrishnan and Peter Stone, 2010
Details   

A Case Study on Improving Defense Behavior in Soccer Simulation 2D: The NeuroHassle Approach
Thomas Gabel, Martin Riedmiller, and Florian Trost, 2009
Details   

Simulation-Based Approach to General Game Playing
Hilmar Finnsson and Yngvi Björnsson, 2008
Details   

Adaptive Treatment of Epilepsy via Batch-mode Reinforcement Learning
Arthur Guez, Robert D. Vincent, Massimo Avoli, and Joelle Pineau, 2008
Details   

Model-Based Reinforcement Learning in a Complex Domain
Shivaram Kalyanakrishnan, Peter Stone, and Yaxin Liu, 2008
Details   

Reinforcement learning of motor skills with policy gradients
Jan Peters and Stefan Schaal, 2008
Details   

Managing Power Consumption and Performance of Computing Systems Using Reinforcement Learning
Gerald Tesauro, Rajarshi Das, Hoi Chan, Jeffrey O. Kephart, Charles Lefurgy, David W. Levine, and Freeman Rawson, 2008
Details   

Self-Optimizing Memory Controllers: A Reinforcement Learning Approach
Engin \.Ipek, Onur Mutlu, José and Martínez, and Rich Caruana, 2008
Details   

Learning RoboCup-Keepaway with Kernels
Tobias Jung and Daniel Polani, 2007
Details   

Half Field Offense in RoboCup Soccer: A Multiagent Reinforcement Learning Case Study
Shivaram Kalyanakrishnan, Yaxin Liu, and Peter Stone, 2007
Details   

Autonomous blimp control using model-free reinforcement learning in a continuous state and action space
Axel Rottmann, Christian Plagemann, Peter Hilgers, and Wolfram Burgard, 2007
Details   

Reinforcement Learning of Local Shape in the Game of Go
David Silver, Richard S. Sutton, and Martin Müller, 2007
Details   

Learning to Play Using Low-Complexity Rule-Based Policies: Illustrations through Ms. Pac-Man
István Szita and András L\Horincz, 2007
Details   

On the use of hybrid reinforcement learning for autonomic resource allocation
Gerald Tesauro, Nicholas K. Jong, Rajarshi Das, and Mohamed N. Bennani, 2007
Details   

Reinforcement learning for quasi-passive dynamic walking of an unstable biped robot
Kentarou Hitomi, Tomohiro Shibata, Yutaka Nakamura, and Shin Ishii, 2006
Details   

Quadruped Robot Obstacle Negotiation via Reinforcement Learning
Honglak Lee, Yirong Shen, Chih-Han Yu, Gurjeet Singh, and Andrew Y. Ng, 2006
Details   

Reinforcement learning for optimized trade execution
Yuriy Nevmyvaka, Yi Feng, and Michael Kearns, 2006
Details   

Keepaway Soccer: From Machine Learning Testbed to Benchmark
Peter Stone, Gregory Kuhlmann, Matthew E. Taylor, and Yaxin Liu, 2006
Details   

Learning Tetris using the noisy cross-entropy method
István Szita and András L\Horincz, 2006
Details   

Reinforcement Learning for RoboCup-Soccer Keepaway
Peter Stone, Richard S. Sutton, and Gregory Kuhlmann, 2005
Details   

Machine Learning for Fast Quadrupedal Locomotion
Nate Kohl and Peter Stone, 2004
Details   

Reinforcement learning for sensing strategies
Cody Kwok and Dieter Fox, 2004
Details   

Autonomous Helicopter Flight via Reinforcement Learning
Andrew Y. Ng, H. Jin Kim, Michael I. Jordan, and Shankar Sastry, 2004
Details   

Multi-Agent Patrolling with Reinforcement Learning
Hugo Santana, Geber Ramalho, Vincent Corruble, and Bohdana Ratitch, 2004
Details   

Stochastic policy gradient reinforcement learning on a simple 3D biped
Russ Tedrake, Teresa Weirui Zhang, and H. Sebastian Seung, 2004
Details   

Adaptive Job Routing and Scheduling
Shimon Whiteson and Peter Stone, 2004
Details   

Active Guidance for a Finless Rocket Using Neuroevolution
Faustino J. Gomez and Risto Miikkulainen, 2003
Details   

Deep Blue
Murray Campbell, A. Joseph Hoane Jr., and Feng-hsiung Hsu, 2002
Details   

Multiagent Planning with Factored MDPs
Carlos Guestrin, Daphne Koller, and Ronald Parr, 2001
Details   

Learning to trade via direct reinforcement
John Moody and Matthew Saffell, 2001
Details   

Planning treatment of ischemic heart disease with partially observable Markov decision processes
Milos Hauskrecht and Hamish Fraser, 2000
Details   

Reinforcement Learning for Control of Self-Similar Call Traffic in Broadband Networks
Jakob Carlström and Ernst Nordström, 1999
Details   

Distributed Value Functions
Jeff Schneider, Weng-Keen Wong, Andrew Moore, and Martin Riedmiller, 1999
Details   

Symposium on Applications of Reinforcement Learning: Final Report for NSF Grant IIS-9810208
Pat Langley and Mark Pendrith, 1998
Details   

Learning to Drive a Bicycle Using Reinforcement Learning and Shaping
Jette Randløv and Preben Alstrøm, 1998
Details   

Reinforcement Learning: An Introduction
Richard S. Sutton and Andrew G. Barto, 1998
Details   

Reinforcement Learning for Dynamic Channel Allocation in Cellular Telephone Systems
Satinder Singh and Dimitri Bertsekas, 1997
Details   

Neuro-Dynamic Programming
Dimitri P. Bertsekas and John N. Tsitsiklis, 1996
Details   

Improving Elevator Performance Using Reinforcement Learning
Robert H. Crites and Andrew G. Barto, 1996
Details   

A Reinforcement Learning Approach to job-shop Scheduling
Wei Zhang and Thomas G. Dietterich, 1995
Details   

Practical Issues in Temporal Difference Learning
Gerald Tesauro, 1992
Details   

Further Real Applications of Markov Decision Processes
Douglas J. White, 1988
Details   

Real Applications of Markov Decision Processes
Douglas J. White, 1985
Details