Linear Least-Squares Algorithms for Temporal Difference Learning
Linear Least-Squares Algorithms for Temporal Difference Learning
Steven J. Bradtke and Andrew G. Barto, 1996
Download
Abstract
(unavailable)
BibTeX Entry
@Article{Bradtke+Barto:1996,
author = "Bradtke, Steven J. and Barto, Andrew G.",
title = "Linear Least-Squares Algorithms for Temporal Difference Learning",
journal = "Machine Learning",
year = "1996",
volume = "22",
pages = "33--57",
publisher = "Kluwer Academic Publishers",
address = "Boston, MA, USA",
url = "http://www.cs.ualberta.ca/~sutton/bradtke-barto-96.pdf",
bib2html_rescat = "Function Approximation",
}