Peter Stone's Selected Publications

Classified by TopicClassified by Publication TypeSorted by DateSorted by First Author Last NameClassified by Funding Source


RAIL: A modular framework for Reinforcement-learning-based Adversarial Imitation Learning

RAIL: A modular framework for Reinforcement-learning-based Adversarial Imitation Learning.
Eddy Hudson, Garrett Warnell, and Peter Stone.
In Autonomous Robots and Multirobot Systems Workshop at the 20th International Conference onAutonomous Agents and Multiagent Systems (AAMAS 2021), May 2021.

Download

[PDF]393.0kB  

Abstract

While Adversarial Imitation Learning (AIL) algorithms have recently led to state-of-the-art results on various imitation learning benchmarks, it is unclear as to what impact various design decisions have on performance. To this end, we present here an organizing, modular framework called Reinforcement-learning-based Adversarial Imitation Learning (RAIL) that encompasses and generalizes a popular subclass of existing AIL approaches. Using the view espoused by RAIL, we create two new IfO (Imitation from Observation) algorithms, which we term SAIfO: SAC-based Adversarial Imitation from Observation and SILEM (Skeletal Feature Compensation for Imitation Learning with Embodiment Mismatch). We go into greater depth about SILEM in a separate technical report. In this paper, we focus on SAIfO, evaluating it on a suite of locomotion tasks from OpenAI Gym, and showing that it outperforms contemporaneous RAIL algorithms that perform IfO.

BibTeX Entry

@InProceedings{ARMS2021-Hudson,
  author = {Eddy Hudson and Garrett Warnell and Peter Stone},
  title = {RAIL: A modular framework for Reinforcement-learning-based Adversarial Imitation Learning},
  booktitle = {Autonomous Robots and Multirobot Systems Workshop at the 20th International Conference on
Autonomous Agents and Multiagent Systems (AAMAS 2021)},
  location = {London, UK},
  month = {May},
  year = {2021},
  abstract = {
        While Adversarial Imitation Learning (AIL) algorithms have recently led to state-of-the-art results on various imitation learning benchmarks, it is unclear as to what impact various design decisions have on performance. To this end, we present here an organizing, modular framework called Reinforcement-learning-based Adversarial Imitation Learning (RAIL) that encompasses and generalizes a popular subclass of existing AIL approaches. Using the view espoused by RAIL, we create two new IfO (Imitation from Observation) algorithms, which we term SAIfO: SAC-based Adversarial Imitation from Observation and SILEM (Skeletal Feature Compensation for Imitation Learning with Embodiment Mismatch). We go into greater depth about SILEM in a separate technical report. In this paper, we focus on SAIfO, evaluating it on a suite of locomotion tasks from OpenAI Gym, and showing that it outperforms contemporaneous RAIL algorithms that perform IfO.
  },
}

Generated by bib2html.pl (written by Patrick Riley ) on Wed Nov 17, 2021 22:19:02