Evaluating the Robustness of Natural Language Reward Shaping Models to Spatial Relations

Evaluating the Robustness of Natural Language Reward Shaping Models to Spatial Relations (2020)

As part of an effort to bridge the gap between using reinforcement learning in simulation and in the real world, we probe whether current reward shaping models are able to encode relational data between objects in the environment. We construct an augmented dataset for controlling a robotic arm in the Meta-World platform to test whether current models are able to discriminate between target objects based on their relations. We found that state of the art models are indeed expressive enough to achieve performance comparable to the gold standard, so this specific experiment did not uncover any obvious shortcomings.

View:

PDF

Citation:

Undergraduate Honors Thesis, Computer Science Department, University of Texas at Austin.

Bibtex:

Presentation:

Slides (PDF) Slides (PPT)

People

Antony Yun

Undergraduate Alumni

antony yun [at] utexas edu

Areas of Interest

Deep Learning Language and Robotics Natural Language Processing Reinforcement Learning

Labs

Machine Learning