Recent Advances in Leveraging Human Guidance for Sequential Decision-Making Tasks (2021)
A longstanding goal of artificial intelligence is to create artificial agents capable of learning to perform tasks that require sequential decision making. Importantly, while it is the artificial agent that learns and acts, it is still up to humans to specify the particular task to be performed. Classical task-specification approaches typically involve humans providing stationary reward functions or explicit demonstrations of the desired tasks. However, there has recently been a great deal of research energy invested in exploring alternative ways in which humans may guide learning agents that may, e.g., be more suitable for certain tasks or require less human effort. This survey provides a high-level overview of five recent machine learning frameworks that primarily rely on human guidance apart from pre-specified reward functions or conventional, step-by-step action demonstrations. We review the motivation, assumptions, and implementation of each framework, and we discuss possible future research directions.
Autonomous Agents and Multi-Agent Systems (2021).

Peter Stone Faculty pstone [at] cs utexas edu
Faraz Torabi Ph.D. Student faraztrb [at] cs utexas edu
Garrett Warnell Research Scientist warnellg [at] cs utexas edu