Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning
Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning
Richard S. Sutton, Doina Precup, and Satinder P. Singh, 1999
Download
Abstract
(unavailable)
BibTeX Entry
@Article{Sutton+PS:1999,
author = "Sutton, Richard S. and Precup, Doina and Singh, Satinder P.",
title = "Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning",
journal = "Artificial Intelligence",
year = "1999",
volume = "112",
number = "1--2",
pages = "181--211",
url = "http://webdocs.cs.ualberta.ca/~sutton/papers/SPS-aij.pdf",
bib2html_rescat = "Representation",
}