On the Existence of Fixed Points for Q-Learning and Sarsa in Partially Observable Domains
On the Existence of Fixed Points for Q-Learning and Sarsa in Partially Observable Domains
Theodore J. Perkins and Mark D. Pendrith, 2002
Download
Abstract
(unavailable)
BibTeX Entry
@InProceedings{Perkins+Pendrith:2002,
author = "Perkins, Theodore J. and Pendrith, Mark D.",
title = "On the Existence of Fixed Points for {Q-Learning} and {Sarsa} in Partially Observable Domains",
booktitle = "Proceedings of the Nineteenth International Conference on Machine Learning (ICML 2002)",
editor = "Sammut, Claude and Hoffman, Achim",
year = "2002",
ISBN = "1-55860-873-7",
publisher = "Morgan Kauffman",
address = "San Francisco, CA, USA",
pages = "490--497",
url = "http://www-all.cs.umass.edu/pubs/2002/perkins_p_ICML02.ps",
bib2html_rescat = "Partial Observability",
}