Peter Stone's Selected Publications

• Classified by Topic • Classified by Publication Type • Sorted by Date • Sorted by First Author Last Name • Classified by Funding Source •

How Humans Teach Agents: A New Experimental Perspective

How Humans Teach Agents: A New Experimental Perspective.
W. Bradley Knox, Brian D. Glass, Bradley C. Love, W. Todd Maddox, and Peter Stone.
International Journal of Social Robotics, 4:409–421, Springer Netherlands, October 2012. 10.1007/s12369-012-0163-x
International Journal of Social Robotics
Download article from publisher

Download

[PDF]2.6MB

Abstract

Human beings are a largely untapped source of in-the-loop knowledge and guidance for computational learning agents, including robots. To effectively design agents that leverage available human expertise, we need to understand how people naturally teach. In this paper, we describe two experiments that ask how differing conditions affect a human teacher's feedback frequency and the computational agent's learned performance. The first experiment considers the impact of a self-perceived teaching role in contrast to believing one is merely critiquing a recording. The second considers whether a human trainer will give more frequent feedback if the agent acts less greedily (i.e., choosing actions believed to be worse) when the trainer's recent feedback frequency decreases. From the results of these experiments, we draw three main conclusions that inform the design of agents. More broadly, these two studies stand as early examples of a nascent technique of using agents as highly specifiable social entities in experiments on human behavior.

BibTeX Entry

@article {IJSR12-knox,
   author = {W. Bradley Knox and Brian D. Glass and Bradley C. Love and W. Todd Maddox and Peter Stone},
   affiliation = {Department of Computer Science, University of Texas at Austin, Austin, USA},
   title = {How Humans Teach Agents: A New Experimental Perspective},
   journal = {International Journal of Social Robotics},
   publisher = {Springer Netherlands},
   issn = {1875-4791},
   keyword = {Engineering},
   pages = {409-421},
   volume = {4},
   issue = {4},
   url = {http://dx.doi.org/10.1007/s12369-012-0163-x},
   note = {10.1007/s12369-012-0163-x},
   abstract = {Human beings are a largely untapped source of in-the-loop knowledge and guidance for computational learning agents, including robots. To effectively design agents that leverage available human expertise, we need to understand how people naturally teach. In this paper, we describe two experiments that ask how differing conditions affect a human teacher's feedback frequency and the computational agent's learned performance. The first experiment considers the impact of a self-perceived teaching role in contrast to believing one is merely critiquing a recording. The second considers whether a human trainer will give more frequent feedback if the agent acts less greedily (i.e., choosing actions believed to be worse) when the trainer's recent feedback frequency decreases. From the results of these experiments, we draw three main conclusions that inform the design of agents. More broadly, these two studies stand as early examples of a nascent technique of using agents as highly specifiable social entities in experiments on human behavior.},
   year = {2012},
   month = {October},
  wwwnote={<a href="http://www.springer.com/engineering/robotics/journal/12369">International Journal of Social Robotics</a>
    <br><a href="https://link.springer.com/article/10.1007/s12369-012-0163-x">Download article from publisher</a>}
}

Generated by bib2html.pl (written by Patrick Riley ) on Fri Jun 20, 2025 08:27:15