Shivaram's Reading List

Shivaram's Reading List

Function Approximation			Partial Observability			Learning Methods			Ensembles
Stochastic Optimisation			General RL			General ML			Multiagent Learning
Comparison/Integration			Bandits			Applications			Robot Soccer
Humanoids			Parameter			MDP			Empirical
Failure Warning			Representation			General AI			Neural Networks
All

Partial Observability

SarsaLandmark: an algorithm for learning in POMDPs with landmarks
Michael R. James and Satinder Singh, 2009
Details

Learning to Use Working Memory in Partially Observable Environments through Dopaminergic Reinforcement
Michael T. Todd, Yael Niv, and Jonathan D. Cohen, 2009
Details

Analysis of an Evolutionary Reinforcement Learning Method in a Multiagent Domain
Jan Hendrik Metzen, Mark Edgington, Yohannes Kassahun, and Frank Kirchner, 2008
Details

Looping suffix tree-based inference of partially observable hidden state
Michael P. Holmes and Charles Lee Isbell, Jr, 2006
Details

Anytime Point-Based Approximations for Large POMDPs
Joelle Pineau, Geoffrey J. Gordon, and Sebastian Thrun, 2006
Details

Scaling Internal-State Policy-Gradient Methods for POMDPs
Douglas Aberdeen and Jonathan Baxter, 2002
Details

An $epsilon$-Optimal Grid-Based Algorithm for Partially Observable Markov Decision Processes
Blai Bonet, 2002
Details

On the Existence of Fixed Points for Q-Learning and Sarsa in Partially Observable Domains
Theodore J. Perkins and Mark D. Pendrith, 2002
Details

Reinforcement Learning for POMDPs Based on Action Values and Stochastic Optimization
Theodore J. Perkins, 2002
Details

Evolutionary Search, Stochastic Policies with Memory, and Reinforcement Learning with Hidden State
Matthew R. Glickman and Katia Sycara, 2001
Details

Value-Function Approximations for Partially Observable Markov Decision Processes
Milos Hauskrecht, 2000
Details

Monte Carlo POMDPs
Sebastian Thrun, 2000
Details

Reinforcement Learning Using Approximate Belief States
Andrés Rodríguez, Ronald Parr, and Daphne Koller, 1999
Details

Using Eligibility Traces to Find the Best Memoryless Policy in Partially Observable Markov Decision Processes
John Loch and Satinder Singh, 1998
Details

An Analysis of Direct Reinforcement Learning in Non-Markovian Domains
Mark D. Pendrith and Michael J. McGarity, 1998
Details

Reinforcement Learning: An Introduction
Richard S. Sutton and Andrew G. Barto, 1998
Details

Reinforcement Learning with Selective Perception and Hidden State
Andrew Kachites McCallum, 1996
Details

Reinforcement learning with replacing eligibility traces
Satinder P. Singh and Richard S. Sutton, 1996
Details

Reinforcement Learning Algorithm for Partially Observable Markov Problems
Tommi Jaakkola, Satinder P. Singh, and Michael I. Jordan, 1995
Details

Instance-Based Utile Distinctions for Reinforcement Learning with Hidden State
R. Andrew McCallum, 1995
Details

Approximating Optimal Policies for Partially Observable Stochastic Domains
Ronald Parr and Stuart Russell, 1995
Details

Acting optimally in partially observable stochastic domains
Anthony R. Cassandra, Leslie Pack Kaelbling, and Michael L. Littman, 1994
Details

Learning Without State-Estimation in Partially Observable Markovian Decision Processes
Satinder P. Singh, Tommi Jaakkola, and Michael I. Jordan, 1994
Details

Reinforcement learning with hidden states
Long-Ji Lin and Tom M. Mitchell, 1993
Details

Overcoming Incomplete Perception with Utile Distinction Memory
R. Andrew McCallum, 1993
Details

Reinforcement Learning with Perceptual Aliasing: The Perceptual Distinctions Approach
Lonnie Chrisman, 1992
Details

Cost-Sensitive Reinforcement Learning for Adaptive Classification and Control
Ming Tan, 1991
Details

Learning to perceive and act by trial and error
Steven D. Whitehead and Dana H. Ballard, 1991
Details

A Survey of Partially Observable Markov Decision Processes: Theory, Models, and Algorithms
George E. Monahan, 1982
Details

The Optimal Control of Partially Observable Markov Processes Over the Infinite Horizon: Discounted Costs
Edward J. Sondik, 1978
Details

Optimal Control of Markov Processes with Incomplete State Information
K. J. Åström, 1965
Details