Shivaram's Reading List

Shivaram's Reading List

Function Approximation			Partial Observability			Learning Methods			Ensembles
Stochastic Optimisation			General RL			General ML			Multiagent Learning
Comparison/Integration			Bandits			Applications			Robot Soccer
Humanoids			Parameter			MDP			Empirical
Failure Warning			Representation			General AI			Neural Networks
All

Applications

Learning Complementary Multiagent Behaviors: A Case Study
Shivaram Kalyanakrishnan and Peter Stone, 2010
Details

A Case Study on Improving Defense Behavior in Soccer Simulation 2D: The NeuroHassle Approach
Thomas Gabel, Martin Riedmiller, and Florian Trost, 2009
Details

Simulation-Based Approach to General Game Playing
Hilmar Finnsson and Yngvi Björnsson, 2008
Details

Adaptive Treatment of Epilepsy via Batch-mode Reinforcement Learning
Arthur Guez, Robert D. Vincent, Massimo Avoli, and Joelle Pineau, 2008
Details

Model-Based Reinforcement Learning in a Complex Domain
Shivaram Kalyanakrishnan, Peter Stone, and Yaxin Liu, 2008
Details

Reinforcement learning of motor skills with policy gradients
Jan Peters and Stefan Schaal, 2008
Details

Managing Power Consumption and Performance of Computing Systems Using Reinforcement Learning
Gerald Tesauro, Rajarshi Das, Hoi Chan, Jeffrey O. Kephart, Charles Lefurgy, David W. Levine, and Freeman Rawson, 2008
Details

Self-Optimizing Memory Controllers: A Reinforcement Learning Approach
Engin \.Ipek, Onur Mutlu, José and Martínez, and Rich Caruana, 2008
Details

Learning RoboCup-Keepaway with Kernels
Tobias Jung and Daniel Polani, 2007
Details

Half Field Offense in RoboCup Soccer: A Multiagent Reinforcement Learning Case Study
Shivaram Kalyanakrishnan, Yaxin Liu, and Peter Stone, 2007
Details

Autonomous blimp control using model-free reinforcement learning in a continuous state and action space
Axel Rottmann, Christian Plagemann, Peter Hilgers, and Wolfram Burgard, 2007
Details

Reinforcement Learning of Local Shape in the Game of Go
David Silver, Richard S. Sutton, and Martin Müller, 2007
Details

Learning to Play Using Low-Complexity Rule-Based Policies: Illustrations through Ms. Pac-Man
István Szita and András L\Horincz, 2007
Details

On the use of hybrid reinforcement learning for autonomic resource allocation
Gerald Tesauro, Nicholas K. Jong, Rajarshi Das, and Mohamed N. Bennani, 2007
Details

Reinforcement learning for quasi-passive dynamic walking of an unstable biped robot
Kentarou Hitomi, Tomohiro Shibata, Yutaka Nakamura, and Shin Ishii, 2006
Details

Quadruped Robot Obstacle Negotiation via Reinforcement Learning
Honglak Lee, Yirong Shen, Chih-Han Yu, Gurjeet Singh, and Andrew Y. Ng, 2006
Details

Reinforcement learning for optimized trade execution
Yuriy Nevmyvaka, Yi Feng, and Michael Kearns, 2006
Details

Keepaway Soccer: From Machine Learning Testbed to Benchmark
Peter Stone, Gregory Kuhlmann, Matthew E. Taylor, and Yaxin Liu, 2006
Details

Learning Tetris using the noisy cross-entropy method
István Szita and András L\Horincz, 2006
Details

Reinforcement Learning for RoboCup-Soccer Keepaway
Peter Stone, Richard S. Sutton, and Gregory Kuhlmann, 2005
Details

Machine Learning for Fast Quadrupedal Locomotion
Nate Kohl and Peter Stone, 2004
Details

Reinforcement learning for sensing strategies
Cody Kwok and Dieter Fox, 2004
Details

Autonomous Helicopter Flight via Reinforcement Learning
Andrew Y. Ng, H. Jin Kim, Michael I. Jordan, and Shankar Sastry, 2004
Details

Multi-Agent Patrolling with Reinforcement Learning
Hugo Santana, Geber Ramalho, Vincent Corruble, and Bohdana Ratitch, 2004
Details

Stochastic policy gradient reinforcement learning on a simple 3D biped
Russ Tedrake, Teresa Weirui Zhang, and H. Sebastian Seung, 2004
Details

Adaptive Job Routing and Scheduling
Shimon Whiteson and Peter Stone, 2004
Details

Active Guidance for a Finless Rocket Using Neuroevolution
Faustino J. Gomez and Risto Miikkulainen, 2003
Details

Deep Blue
Murray Campbell, A. Joseph Hoane Jr., and Feng-hsiung Hsu, 2002
Details

Multiagent Planning with Factored MDPs
Carlos Guestrin, Daphne Koller, and Ronald Parr, 2001
Details

Learning to trade via direct reinforcement
John Moody and Matthew Saffell, 2001
Details

Planning treatment of ischemic heart disease with partially observable Markov decision processes
Milos Hauskrecht and Hamish Fraser, 2000
Details

Reinforcement Learning for Control of Self-Similar Call Traffic in Broadband Networks
Jakob Carlström and Ernst Nordström, 1999
Details

Distributed Value Functions
Jeff Schneider, Weng-Keen Wong, Andrew Moore, and Martin Riedmiller, 1999
Details

Symposium on Applications of Reinforcement Learning: Final Report for NSF Grant IIS-9810208
Pat Langley and Mark Pendrith, 1998
Details

Learning to Drive a Bicycle Using Reinforcement Learning and Shaping
Jette Randløv and Preben Alstrøm, 1998
Details

Reinforcement Learning: An Introduction
Richard S. Sutton and Andrew G. Barto, 1998
Details

Reinforcement Learning for Dynamic Channel Allocation in Cellular Telephone Systems
Satinder Singh and Dimitri Bertsekas, 1997
Details

Neuro-Dynamic Programming
Dimitri P. Bertsekas and John N. Tsitsiklis, 1996
Details

Improving Elevator Performance Using Reinforcement Learning
Robert H. Crites and Andrew G. Barto, 1996
Details

A Reinforcement Learning Approach to job-shop Scheduling
Wei Zhang and Thomas G. Dietterich, 1995
Details

Practical Issues in Temporal Difference Learning
Gerald Tesauro, 1992
Details

Further Real Applications of Markov Decision Processes
Douglas J. White, 1988
Details

Real Applications of Markov Decision Processes
Douglas J. White, 1985
Details