Teaching an Agent Manually via Evaluative Reinforcement (TAMER) (2009)
Author: W. Bradley Knox and Peter Stone
Videos of a TAMER agent being trained by a human teacher giving positive and negative feedback signals.
W. Bradley Knox Ph.D. Alumni bradknox [at] mit edu
Peter Stone Faculty pstone [at] cs utexas edu
Learning Non-Myopically from Human-Generated Reward 2013
W. Bradley Knox and Peter Stone, In In Proceedings of the International Conference on Intelligent User Interfaces (IUI), March 2013.
How Humans Teach Agents: A New Experimental Perspective 2012
W. Bradley Knox, Brian D. Glass, Bradley C. Love, W. Todd Maddox, and Peter Stone, International Journal of Social Robotics, Vol. 4 (2012), pp. 409-421. Springer Netherlands.
Learning from feedback on actions past and intended 2012
W. Bradley Knox, Cynthia Breazeal, and Peter Stone, In In Proceedings of 7th ACM/IEEE International Conference on Human-Robot Interaction, Late-Breaking Reports Session (HRI 2012), March 2012.
Learning from Human-Generated Reward 2012
W. Bradley Knox,
Reinforcement Learning from Human Reward: Discounting in Episodic Tasks 2012
W. Bradley Knox and Peter Stone, In In Proceedings of the 21st IEEE International Symposium on Robot and Human Interactive Communication (Ro-Man), September 2012.
Reinforcement Learning with Human and MDP Reward 2012
W. Bradley Knox and Peter Stone, In Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2012), June 2012.
Understanding Human Teaching Modalities in Reinforcement Learning Environments: A Preliminary Report 2011
W. Bradley Knox and Peter Stone, In IJCAI 2011 Workshop on Agents Learning Interactively from Human Teachers (ALIHT), July 2011.
Combining Manual Feedback with Subsequent MDP Reward Signals for Reinforcement Learning 2010
W. Bradley Knox and Peter Stone, In Proc. of 9th Int. Conf. on Autonomous Agents and Multiagent Systems (AAMAS 2010), May 2010.
Design Principles for Creating Human-Shapable Agents 2009
W. Bradley Knox, Ian Fasel, and Peter Stone, In AAAI Spring 2009 Symposium on Agents that Learn from Human Teachers, March 2009.
Interactively Shaping Agents via Human Reinforcement: The TAMER Framework 2009
W. Bradley Knox and Peter Stone, In The Fifth International Conference on Knowledge Capture, September 2009.