Off-Policy Temporal Difference Learning with Function Approximation
 Off-Policy Temporal Difference Learning with Function Approximation
 Doina Precup,  Richard S. Sutton, and  Sanjoy Dasgupta, 2001
Download
Abstract
(unavailable)
BibTeX Entry
@InProceedings{Precup+SD:2001,
  author =       "Precup, Doina and Sutton, Richard S. and Dasgupta, Sanjoy",
  title =        "Off-Policy Temporal Difference Learning with Function Approximation",
  booktitle =    "Proceedings of the Eighteenth International Conference on Machine Learning (ICML 2001)",
  year =         "2001",
  ISBN =         "1-55860-778-1",
  editor =    "Brodley, Carla E. and Danyluk, Andrea Pohoreckyj",
  publisher = "Morgan Kaufmann",
  pages =     "417--424",
  url = "http://www.cs.ualberta.ca/~sutton/papers/PSD-01-retypeset.pdf",
  bib2html_rescat = "Function Approximation",
}