A General Purpose Task Specification Language for Bootstrap Learning (2008)
Reinforcement learning (RL) is an effective framework for online learning by autonomous agents. Most RL research focuses on domain-independent learning emphalgorithms, requiring an expert human to define the emphenvironment (state and action representation) and emphtask to be performed (e.g. start state and reward function) on a case-by-case basis. In this paper, we describe a general language for a teacher to specify sequential decision making tasks to RL agents. The teacher may communicate properties such as start states, reward functions, termination conditions, successful execution traces, task decompositions, and other advice. The learner may then practice and learn the task on its own using any RL algorithm. We demonstrate our language in a simple GridWorld example and on the RoboCup soccer keepaway benchmark problem. The language forms the basis of a larger ``Bootstrap Learning'' model for machine learning, a paradigm for incremental development of complete systems through integration of multiple machine learning techniques.
In AAAI Spring 2009 Symposium on Agents that Learn from Human Teachers, March 2008.

Ian Fasel Postdoctoral Alumni ianfasel [at] cs utexas edu
Michael Quinlan Formerly affiliated Research Scientist mquinlan [at] cs utexas edu
Peter Stone Faculty pstone [at] cs utexas edu