Fast gradient-descent methods for temporal-difference learning with linear function approximation
Fast gradient-descent methods for temporal-difference learning with linear function approximation
Richard S. Sutton, Hamid Reza Maei, Doina Precup, Shalabh Bhatnagar, David Silver, Csaba Szepesvári, and Eric Wiewiora, 2009
Download
Abstract
(unavailable)
BibTeX Entry
@InProceedings{Sutton+MPBSSW:2009,
author = "Sutton, Richard S. and Maei, Hamid Reza and Precup, Doina and Bhatnagar, Shalabh and Silver, David and Szepesv{\'a}ri, Csaba and Wiewiora, Eric",
title = "Fast gradient-descent methods for temporal-difference learning with linear function approximation",
booktitle = "Proceedings of the Twenty-sixth Annual International Conference on Machine Learning (ICML 2009)",
year = "2009",
editor = "Danyluk, Andrea Pohoreckyj and Bottou, L{\'e}on and Littman, Michael L.",
volume = "382",
series = "ACM International Conference Proceeding Series",
publisher = "ACM",
pages = "993--1000",
url = "http://www.cs.ualberta.ca/%7Esutton/papers/SMPBSSW-09.pdf",
bib2html_rescat = "Function Approximation",
}