AI-Lab - Learning Agents

Learning Agents

Webpage: http://www.cs.utexas.edu/users/pstone/

The learning agents research group is led by Prof. Peter Stone. Our aim is to understand how we can best create complete intelligent agents. We consider both adaptation and interaction to be essential capabilites of such agents. Thus, our research focuses mainly on machine learning, multiagent systems, and robotics. Application domains include robot soccer, autonomous bidding agents, traffic management, and autonomic computing.

People

[Expand to show all 15]

Ishan Durugkar	Ph.D. Student	ishand [at] cs utexas edu
Josiah Hanna	Ph.D. Student	jphanna [at] cs utexas edu
Justin Hart	Postdoctoral Fellow	hart [at] cs utexas edu
Yu-Sian Jiang	Ph.D. Student	sharonjiang [at] utexas edu
Yuqian Jiang	Ph.D. Student
Elad Liebman	Ph.D. Student	eladlieb [at] cs utexas edu
Shih-Yun Lo	Ph.D. Student	yunl [at] cs utexas edu
Patrick MacAlpine	Ph.D. Student	patmac [at] cs utexas edu
Jacob Menashe	Ph.D. Student	jmenashe [at] cs utexas edu
Sanmit Narvekar	Ph.D. Student	sanmit [at] cs utexas edu
Guni Sharon	Postdoctoral Fellow	gunisharon [at] gmail com
Peter Stone	Faculty	pstone [at] cs utexas edu
Faraz Torabi	Ph.D. Student	faraztrb [at] cs utexas edu
Garrett Warnell	Research Scientist	warnellg [at] cs utexas edu
Harel Yedidsion	Postdoctoral Fellow	harel [at] cs utexas edu

Show Alumni

Publications

[Expand to show all 515]

A Novel Control Law for Multi-joint Human-Robot Interaction Tasks While Maintaining Postural Coordination

2023

Keya Ghonasgi, Reuth Mirsky, Adrian M Haith, Peter Stone, and Ashish D Deshpande, 2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (2023).

Benchmarking Reinforcement Learning Techniques for Autonomous Navigation

2023

Zifan Xu, Bo Liu, Xuesu Xiao, Anirudh Nair, and Peter Stone, In Proceedings of the 2023 IEEE International Conference on Robotics and Automation (ICRA 2023), London, England, May 2023.

Causal Policy Gradient for Whole-Body Mobile Manipulation

2023

Jiaheng Hu, Peter Stone, and Roberto Martin-Martin, In Robotics: Science and Systems (RSS2023), Daegu, Republic of Korea, July 2023.

D-Shape: Demonstration-Shaped Reinforcement Learning via Goal Conditioning

2023

Caroline Wang, Garrett Warnell, and Peter Stone, In Proceedings of the 22nd International Conference on Autonomous Agents and Multiagent Systems (AAMAS), London, UK, May 2023.

DM$^2$: Decentralized Multi-Agent Reinforcement Learning via Distribution Matching

2023

Caroline Wang, Ishan Durugkar, Elad Liebman, and Peter Stone, In Proceedings of the 37th AAAI Conference on Artificial Intelligence (AAAI-23), Washington, D.C., February 2023.

Kinematic coordinations capture learning during human–exoskeleton interaction

2023

Keya Ghonasgi, Reuth Mirsky, Nisha Bhargava, Adrian M Haith, Peter Stone, and Ashish D Deshpande, Scientific Reports, Vol. 13 (2023), pp. 10322.

Learning Perceptual Hallucination for Multi-Robot Navigation in Narrow Hallways

2023

Jinsoo Park, Xuesu Xiao, Garrett Warnell, Harel Yedidsion, and Peter Stone, In Proceedings of the 2023 IEEE International Conference on Robotics and Automation (ICRA 2023), London, England, May 2023.

MACTA: A Multi-agent Reinforcement Learning Approach for Cache Timing Attacks and Detection

2023

Jiaxun Cui, Xiaomeng Yang, Mulong Luo, Geunbae Lee, Peter Stone, Hsien-Hsin S. Lee, Benjamin Lee, G. Edward Suh, Wenjie Xiong, and Yuandong Tian, In The Eleventh International Conference on Learning Representations (ICLR), Kigali, Rwanda, May 2023.

Metric Residual Networks for Sample Efficient Goal-Conditioned Reinforcement Learning

2023

Bo Liu, Yihao Feng, Qiang Liu, and Peter Stone, In Thirty-Seventh AAAI Conference on Artificial Intelligence (AAAI), Washington, DC, US, February 2023.

Model-Based Meta Automatic Curriculum Learning

2023

Zifan Xu, Yulin Zhang, Shahaf S. Shperberg, Reuth Mirsky, Yuqian Jiang, Bo Liu, and Peter Stone, In The Second Conference on Lifelong Learning Agents (CoLLAs 2023), Montreal, Canada, August 2023.

Motion Planning (In)feasibility Detection using a Prior Roadmap via Path and Cut Search

2023

Yoonchang Sung and Peter Stone, In Robotics: Science and Systems (RSS2023), Daegu, Republic of Korea, July 2023.

Reward (Mis)design for autonomous driving

2023

W. Bradley Knox, Alessandro Allievi, Holger Banzhaf, Felix Schmitt, and Peter Stone, Artificial Intelligence, Vol. 316 (2023).

Symbolic State Space Optimization for Long Horizon Mobile Manipulation Planning

2023

Xiaohan Zhang, Yifeng Zhu, Yan Ding, Yuqian Jiang, Yuke Zhu, Peter Stone, and Shiqi Zhang, In International Conference on Intelligent Robots and Systems, Detroit, USA, October 2023.

The Perils of Trial-and-Error Reward Design: Misdesign through Overfitting and Invalid Task Specifications

2023

Serena Booth, W Bradley Knox, Julie Shah, Scott Niekum, Peter Stone, and Alessandro Allievi, In Proceedings of the 37th AAAI Conference on Artificial Intelligence (AAAI), Washington, D.C., February 2023.

VaryNote: A Method to Automatically Vary the Number of Notes in Symbolic Music

2023

Juan M. Huerta, Bo Liu, and Peter Stone, In Bridge after the turmoil - The 16th International Symposium, CMMR 2023, Tokyo, Japan, November 13-17, 2023, Tokyo, Japan, November 2023.

A Rule-based Shield: Accumulating Safety Rules from Catastrophic Action Effects

2022

Shahaf Shperberg, Bo Liu, Allessandro Allievi, and Peter Stone, In Proceedings of the 1st Conference on Lifelong Learning Agents (CoLLAs), Montreal, Canada, August 2022.

A Survey of Ad Hoc Teamwork Research

2022

Reuth Mirsky, Ignacio Carlucho, Arrasy Rahman, Eliott Fosong, William Macke, Mohan Sridharan, Peter Stone, and Stefano Albrecht, In The 19th European Conference on Multi-Agent Systems (EUMAS), Dusseldorf, Germany, September 2022.

Adversarial Imitation Learning from Video using a State Observer

2022

Haresh Karnan, Garrett Warnell, Faraz Torabi, and Peter Stone, In International Conference on Robotics and Automation, 2022, Philadelphia, Pennsylvania, May 2022.

APPL: Adaptive Planner Parameter Learning

2022

Xuesu Xiao, Zizhao Wang, Zifan Xu, Bo Liu, abd Gauraang Dhamankar, Anirudh Nair, Garrett Warnell, and Peter Stone, Robotics and Autonomous Systems (2022).

Autonomous Ground Navigation in Highly Constrained Spaces: Lessons Learned From the Benchmark Autonomous Robot Navigation Challenge at ICRA 2022

2022

Xuesu Xiao, Zifan Xu, Zizhao Wang, Yunlong Song, Garrett Warnell, Peter Stone, Tingnan Zhang, Shravan Ravi, Gary Wang, Haresh Karnan, Joydeep Biswas, Nicholas Mohammad, Lauren Bramblett, Rahul Peddi, Nicola Bezzo, Zhanteng Xie, and Philip Dames, IEEE Robotics and Automation Magazine (2022).

BOME! Bilevel Optimization Made Easy: A Simple First-Order Approach

2022

Bo Liu, Mao Ye, Stephen Wright, Peter Stone, and Qiang Liu, In Conference on Neural Information Processing Systems, 2022, New Orleans, LA, December 2022.

Bottom-Up Skill Discovery from Unsegmented Demonstrations for Long-Horizon Robot Manipulation

2022

Yifeng Zhu, Peter Stone, and Yuke Zhu, IEEE Robotics and Automation Letters (2022).

Causal Dynamics Learning for Task-Independent State Abstraction

2022

Zizhao Wang, Xuesu Xiao, Zifan Xu, Yuke Zhu, and Peter Stone, In roceedings of the 39th International Conference on Machine Learning (ICML2022), Baltimore, USA, July 2022.

Continual Learning and Private Unlearning

2022

Bo Liu, Qiang Liu, and Peter Stone, In Proceedings of the 1st Conference on Lifelong Learning Agents (CoLLAs), Montreal, Canada, August 2022.

Coopernaut: End-to-End Driving with Cooperative Perception for Networked Vehicles

2022

Jiaxun Cui, Hang Qiu, Dian Chen, Peter Stone, and Yuke Zhu, In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, LA, June 2022.

DynaBARN: Benchmarking Metric Ground Navigation in Dynamic Environments

2022

Anirudh Nair, Fulin Jiang, Kang Hou, Zifan Xu, Shuozhe Li, Xuesu Xiao, and Peter Stone, In Proceedings of the 2022 IEEE International Symposium on Safety, Security, and Rescue Robotics (SSRR), November 2022.

Effective Mutation Rate Adaptation through Group Elite Selection

2022

Akarsh Kumar, Bo Liu, Risto Miikkulainen, and Peter Stone, In Proceedings of the Genetic and Evolutionary Computation Conference, Boston, United States, July 2022.

Learning a Robust Multiagent Driving Policy for Traffic Congestion Reduction

2022

Yulin Zhang, William Macke, Jiaxun Cui, Daniel Urieli, and Peter Stone, In Proceedings of the Adaptive and Learning Agents Workshop (ALA), Auckland, NZ, May 2022.

Learning to Correct Mistakes: Backjumping in Long-Horizon Task and Motion Planning

2022

Yoonchang Sung, Zizhao Wang, and Peter Stone, In Proceedings of the 6th Conference on Robot Learning (CoRL 2022), Auckland, New Zealand, December 2022.

Model-Based Meta Automatic Curriculum Learning

2022

Zifan Xu, Yulin Zhang, Shahaf S. Shperberg, Reuth Mirsky, Yulin Zhan, Yuqian Jiang, Bo Liu, and Peter Stone, In Decision Awareness in Reinforcement Learning (DARL) workshop t the +39th International Conference on Machine Learning (ICML), Baltimore, Maryland, USA, July 2022.

Motion Planning and Control for Mobile Robot Navigation Using Machine Learning: a Survey

2022

Xuesu Xiao, Bo Liu, Garrett Warnell, and Peter Stone, Autonomous Robots (2022).

Quantifying Changes in Kinematic Behavior of a Human-Exoskeleton Interactive System

2022

Keya Ghonasgi, Reuth Mirsky, Adrian M Haith, Peter Stone, and Ashish D Deshpande, In Proceedings of the 35th International Conference on Intelligent Robots and Systems (IROS), Kyoto, Japan, October 2022.

Quantifying Human Rationality in Ad-hoc Teamwork

2022

Yair Hanina, Reuth Mirsky, William Macke, and Peter Stone, In AAMAS workshop on Autonomous Robots and Multirobot Systems (ARMS), Online, May 2022.

Quantifying Human Rationality in Ad-hoc Teamwork

2022

Yair Hanina, Reuth Mirsky, William Macke, and Peter Stone, In AAMAS workshop on Autonomous Robots and Multirobot Systems (ARMS), Online, May 2022.

Real-world challenges for multi-agent reinforcement learning in grid-interactive buildings

2022

Kingsley Nweye, Bo Liu, Nagy Zoltan, and Peter Stone, Journal of Energy and AI, 2022 (2022).

Socially CompliAnt Navigation Dataset (SCAND): A Large-Scale Dataset Of Demonstrations For Social Navigation

2022

Haresh Karnan, Anirudh Nair, Xuesu Xiao, Garrett Warnell, Soren Pirk, Alexander Toshev, Justin Hart, Joydeep Biswas, and Peter Stone, Robotics and Automation Letters (RA-L), 2022 (2022).

Task Factorization in Curriculum Learning

2022

Reuth Mirsky, Shahaf S. Shperberg, Yulin Zhang, Zifan Xu, Yuqian Jiang, Jiaxun Cui, and Peter Stone, In Decision Awareness in Reinforcement Learning (DARL) workshop t the 39th International Conference on Machine Learning (ICML), Baltimore, Maryland, USA, July 2022.

Towards a Real-Time, Low-Resource, End-to-end Object Detection Pipeline for Robot Soccer

2022

Sai Kiran Narayanaswami, Mauricio Tec, Ishan Durugkar, Siddharth Desai, Bharath Masetty, Sanmit Narvekar, and Peter Stone, In Proceedings of the RoboCup Symposium, 2022, Bangkok, Thailand, July 2022.

VI-IKD: High-Speed Accurate Off-Road Navigation using Learned Visual-Inertial Inverse Kinodynamics

2022

Haresh Karnan, Kavan Sikand, Pranav Atreya, Sadegh Rabiee, Xuesu Xiao, Garrett Warnell, Peter Stone, and Joydeep Biswas, In International Conference on Intelligent Robots and Systems, 2022, Kyoto, Japan, October 2022.

VIOLA: Imitation Learning for Vision-Based Manipulation with Object Proposal Priors

2022

Yifeng Zhu, Abhishek Joshi, Peter Stone, and Yuke Zhu, In Proceedings of the 6th Conference on Robot Learning (CoRL 2022), Auckland, New Zealand, January 2022.

VOILA: Visual-Observation-Only Imitation Learning for Autonomous Navigation

2022

Haresh Karnan, Garrett Warnell, Xuesu Xiao, and Peter Stone, In International Conference on Robotics and Automation, 2022, Philadelphia, Pennsylvania, May 2022.

A Lifelong Learning Approach to Mobile Robot Navigation

2021

Bo Liu, Xuesu Xiao, and Peter Stone, In IEEE International Conference on Robotics and Automation (ICRA), 2021, Xi'an, China, June 2021.

A Scavenger Hunt for Service Robots

2021

Harel Yedidsion, Jennifer Suriadinata, Zifan Xu, Stefan Debruyn, and Peter Stone, In Proceedings of the 2021 International Conference on Robotics and Automation (ICRA 2021), Xi'an China, May 2021.

Adversarial Intrinsic Motivation for Reinforcement Learning

2021

Ishan Durugkar, Mauricio Tec, Scott Niekum, and Peter Stone, In Proceedings of the 35th International Conference on Neural Information Processing Systems (NeurIPS 2021), Sydney, Australia, December 2021.

Agile Robot Navigation through Hallucinated Learning and Sober Deployment

2021

Xuesu Xiao, Bo Liu, and Peter Stone, In Proceedings of the 2021 IEEE International Conference on Robotics and Automation (ICRA 2021), Xi'an, China, June 2021.

APPLE: Adaptive Planner Parameter Learning From Evaluative Feedback

2021

Zizhao Wang, Xuesu Xiao, Bo Liu, Garrett Warnell, and Peter Stone, {IEEE} Robotics and Automation Letters, presented at International Conference on Intelligent Robots and Systems ({IROS}) (2021).

APPLI: Adaptive Planner Parameter Learning From Interventions

2021

Zizhao Wang, Xuesu Xiao, Bo Liu, Garrett Warnell, and Peter Stone, In Proceedings of the International Conference on Robotics and Automation (ICRA 2021), Xi'an, China, May 2021.

APPLR: Adaptive Planner Parameter Learning from Reinforcement

2021

Zifan Xu, Gauraang Dhamankar, Anirudh Nair, Xuesu Xiao, Garrett Warnell, Bo Liu, Zizhao Wang, and Peter Stone, In Proceedings of the 2021 IEEE International Conference on Robotics and Automation (ICRA 2021), Xi'an, China, June 2021.

Capturing Skill State in Curriculum Learning for Human Skill Acquisition

2021

Keya Ghonasgi, Reuth Mirsky, Sanmit Narvekar, Bharath Masetty, Adrian M. Haith, Peter Stone, and Ashish D. Deshpande, In International Conference on Intelligent Robots and Systems (IROS), Virtual, September 2021.

Coach-Player Multi-Agent Reinforcement Learning for Dynamic Team Composition

2021

Bo Liu, Qiang Liu, Peter Stone, Animesh Garg, Yuke Zhu, and Animashree Anandkumar, In Proceedings of the 38th International Conference on Machine Learning, PMLR 139, 2021 (ICML), Vienna, Austria, July 2021.

Conflict-Averse Gradient Descent for Multi-task learning

2021

Bo Liu, Xingchao Liu, Xiaojie Jin, Peter Stone, and Qiang Liu, In Conference on Neural Information Processing Systems, 2021, Virtual, December 2021.

DEALIO: Data-Efficient Adversarial Learning for Imitation from Observation

2021

Faraz Torabi, Garrett Warnell, and Peter Stone, In Proceedings of The IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Prague, Czech Republic, September 2021.

Expected Value of Communication for Planning in Ad Hoc Teamwork

2021

William Macke, Reuth Mirsky, and Peter Stone, In Proceedings of the 35th Conference on Artificial Intelligence (AAAI), February 2021.

From Agile Ground to Aerial Navigation: Learning from Learned Hallucination

2021

Zizhao Wang, Xuesu Xiao, Alexander J Nettekoven, Kadhiravan Umasankar, Anika Singh, Sriram Bommakanti, Ufuk Topcu, and Peter Stone, In Proceedings of the International Conference on Intelligent Robots and Systems (IROS 2021), Prague, Czech Republic, October 2021.

Goal Blending for Responsive Shared Autonomy in a Navigating Vehicle

2021

Yu-Sian Jiang, Garrett Warnell, and Peter Stone, In Proceedings of the 35th AAAI Conference on Artificial Intelligence (AAAI), A Virtual Conference, February 2021.

Grounded Action Transformation for Sim-to-Real Reinforcement Learning

2021

Josiah P.Hanna, Siddharth Desai, Haresh Karnan, Garrett Warnell, and Peter Stone, Special Issue on Reinforcement Learning for Real Life, Machine Learning, 2021 (2021).

Importance Sampling in Reinforcement Learning with an Estimated Behavior Policy

2021

Josiah P. Hanna, Scott Niekum, and Peter Stone, Machine Learning (MLJ), Vol. 110, 6 (2021), pp. 1267–1317.

Incorpotating Gaze into Social Navigation

2021

Justin Hart, Reuth Mirsky, Xuesu Xiao, and Peter Stone, In Robotics: Science and Systems Workshop on Social Robot Navigation (RSS), Virtual, July 2021.

Intelligent Disobedience and AI Rebel Agents in Assistive Robotics

2021

Reuth Mirsky and Peter Stone, In ASIMOV workshop as part of the International Conference on Intelligent Robots and Systems (IROS), Virtual, November 2021.

Is the Cerebellum a Model-Based Reinforcement Learning Agent?

2021

Bharath Masetty, Reuth Mirsky, Ashish D. Deshpande, Michael Mauk, and Peter Stone, In Adaptive and Learning Agents Workshop at AAMAS, Virtual, May 2021.

Learning Inverse Kinodynamics for Accurate High-Speed Off-Road Navigation on Unstructured Terrain

2021

Xuesu Xiao, Joydeep Biswas, and Peter Stone, In Opportunities and Challenges with Autonomous Racing Workshop at the 2021 IEEE International Conference on Robotics and Automation (ICRA 2021), Xi'an, China, June 2021.

Learning Inverse Kinodynamics for Accurate High-Speed Off-Road Navigation on Unstructured Terrain

2021

Xuesu Xiao, Joydeep Biswas, and Peter Stone, IEEE Robotics and Automation Letters (2021).

Lucid Dreaming for Experience Replay: Refreshing Past States with the Current Policy

2021

Yunshu Du, Garrett Warnell, Assefaw Gebremedhin, Peter Stone, and Matthew E. Taylor, Neural Computing and Applications (2021).

Machine Learning Methods for Local Motion Planning: A Study of End-to-End vs. Parameter Learning

2021

Zifan Xu, Xuesu Xiao, Garrett Warnell, Anirudh Nair, and Peter Stone, In Proceedings of the 2021 IEEE International Symposium on Safety, Security, and Rescue Robotics (SSRR 2021), New York, USA, October 2021.

Machine versus Human Attention in Deep Reinforcement Learning Tasks

2021

Sihang Guo, Ruohan Zhang, Bo Liu, Yifeng Zhu, Mary Hayhoe, Dana Ballard, and Peter Stone, In Conference on Neural Information Processing Systems, 2021, Virtual, December 2021.

Multiagent Epidemiologic Inference through Realtime Contact Tracing

2021

Guni Sharon, James Ault, Peter Stone, Varun Kompella, and Roberto Capobianco, In Proceedings of the 20th International Conference on Autonomous Agents and MultiAgent Systems (AAMAS 2021), London, UK, May 2021.

RAIL: A modular framework for Reinforcement-learning-based Adversarial Imitation Learning

2021

Eddy Hudson, Garrett Warnell, and Peter Stone, In Autonomous Robots and Multirobot Systems Workshop at the 20th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2021), London, UK, May 2021.

Reasoning about Human Behavior in Ad Hoc Teamwork

2021

Jennifer Suriadinata, William Macke, Reuth Mirsky, and Peter Stone, In Adaptive and learning Agents Workshop at AAMAS 2021, May 2021.

Recent Advances in Leveraging Human Guidance for Sequential Decision-Making Tasks

2021

Ruohan Zhang, Faraz Torabi, Garrett Warnell, and Peter Stone, Autonomous Agents and Multi-Agent Systems (2021).

Scalable Multiagent Driving Policies For Reducing Traffic Congestion

2021

Jiaxun Cui, William Macke, Harel Yedidsion, Aastha Goyal, Daniel Urieli, and Peter Stone, No other information

Team Orienteering Coverage Planning with Uncertain Reward

2021

Bo Liu, Xuesu Xiao, and Peter Stone, No other information

Temporal-Logic-Based Reward Shaping for Continuing Reinforcement Learning Tasks

2021

Yuqian Jiang, Suda Bharadwaj, Bo Wu, Rishi Shah, Ufuk Topcu, and Peter Stone, In Proceedings of the 35th AAAI Conference on Artificial Intelligence (AAAI 2021), Virtual Conference, February 2021.

The Seeing-Eye Robot Grand Challenge: Rethinking Automated Care

2021

Reuth Mirsky and Peter Stone, In Proceedings of the 20th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2021), Online, May 2021.

Toward Agile Maneuvers in Highly Constrained Spaces: Learning from Hallucination

2021

Xuesu Xiao, Bo Liu, Garrett Warnell, and Peter Stone, IEEE Robotics and Automation Letters (2021).

Watch Where You're Going! Gaze and Head Orientation as Predictors for Social Robot Navigation

2021

Blake Holman, Abrar Anwar, Akash Singh, Mauricio Tec, Justin Hart, and Peter Stone, In Proceedings of the International Conference on Robotics and Automation (ICRA 2021), Xi'an, China, May 2021.

A Penny for Your Thoughts: The Value of Communication in Ad Hoc Teamwork

2020

Reuth Mirsky, William Macke, Andy Wang, Harel Yedidsion, and Peter Stone, In Proceedings of the 29th International Joint Conference on Artificial Intelligence, Yokohama, Japan, January 2020.

Agents teaching agents: a survey on inter-agent transfer learning

2020

Felipe Leno Da Silva, Garrett Warnell, Anna Helena Reali Costa, and Peter Stone, Autonomous Agents and Multi-Agent Systems (2020).

An Imitation from Observation Approach to Transfer Learning with Dynamics Mismatch

2020

Siddarth Desai, Ishan Durugkar, Haresh Karnan, Garrett Warnell, Josiah Hanna, and Peter Stone, In Proceedings of the 34th International Conference on Neural Information Processing Systems (NeurIPS 2020), Virtual, December 2020.

APPLD: Adaptive Planner Parameter Learning from Demonstration

2020

Xuesu Xiao, Bo Liu, Garrett Warnell, Jonathan Fink, and Peter Stone, No other information

Balancing Individual Preferences and Shared Objectives in Multiagent Reinforcement Learning

2020

Ishan Durugkar, Elad Liebman, and Peter Stone, Proceedings of the 29th International Joint Conference on Artificial Intelligence (IJCAI 2020) (2020).

Benchmarking Metric Ground Navigation

2020

Daniel Perille, Abigail Truong, Xuesu Xiao, and Peter Stone, In Proceedings of the 2020 IEEE International Symposium on Safety, Security, and Rescue Robotics (SSRR 2016), Virtual Conference, November 2020.

Deep R-Learning for Continual Area Sweeping

2020

Rishi Shah, Yuqian Jiang, Justin Hart, and Peter Stone, Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2020) (2020).

Firefly Neural Architecture Descent: a General Approach for Growing Neural Networks

2020

Lemeng Wu, Bo Liu, Peter Stone, and Qiang Liu, In Advances in Neural Information Processing Systems 34 (2020), Vancouver, Canada, December 2020.

Jointly Improving Parsing and Perception for Natural Language Commands through Human-Robot Dialog

2020

Jesse Thomason, Aishwarya Padmakumar, Jivko Sinapov, Nick Walker, Yuqian Jiang, Harel Yedidsion, Justin Hart, Peter Stone, Raymond J. Mooney, The Journal of Artificial Intelligence Research (JAIR), Vol. 67 (2020), pp. 327-374.

Learning to Improve Multi-Robot Hallway Navigation

2020

Jin-Soo Park, Brian Tsang, Harel Yedidsion, Garrett Warnell, Daehyun Kyoung, and Peter Stone, In Proceedings of the 4th Conference on Robot Learning (CoRL), Virtual Conference, November 2020.

On Sampling Error in Batch Action-Value Prediction Algorithms

2020

Brahma S. Pavse, Josiah P. Hanna, Ishan Durugkar, and Peter Stone, In In the Offline Reinforcement Learning Workshop at Neural Information Processing Systems (NeurIPS), December 2020., Remote (Virtual Conference), December 2020.

Policy Evaluation in Continuous MDPs with Efficient Kernelized Gradient Temporal Difference

2020

Alec Koppel, Garrett Warnell, Ethan Stump, Peter Stone, and Alejandro Ribeiro, No other information

Reducing Sampling Error in Batch Temporal Difference Learning

2020

Brahma Pavse, Ishan Durugkar, Josiah Hanna, and Peter Stone, In Proceedings of the 37th International Conference on Machine Learning (ICML), Vienna, Austria (Virtual Conference), July 2020.

Reinforced Grounded Action Transformation for Sim-to-Real Transfer

2020

Haresh Karnan, Siddharth Desai, Josiah P. Hanna, Garrett Warnell, and Peter Stone, In IEEE/RSJ International Conference on Intelligent Robots and Systems(IROS 2020), October 2020.

RIDM: Reinforced Inverse Dynamics Modeling for Learning from a Single Observed Demonstration

2020

Brahma Pavse, Faraz Torabi, Josiah Hanna, Garrett Warnell, and Peter Stone, IEEE Robotics and Automation Letters, presented at International Conference on Intelligent Robots and Systems (IROS) (2020).

Stochastic Grounded Action Transformation for Robot Learning in Simulation

2020

Siddharth Desai, Haresh Karnan, Josiah P. Hanna, Garrett Warnell, and Peter Stone, In IEEE/RSJ International Conference on Intelligent Robots and Systems(IROS 2020), Las Vegas, NV, USA, October 2020.

The EMPATHIC Framework for Task Learning from Implicit Human Feedback

2020

Yuchen Cui, Qiping Zhang, Alessandro Allievi, Peter Stone, Scott Niekum, and W. Bradley Knox, In Proceedings of the 4th Conference on Robot Learning (CoRL 2020), Cambridge MA, USA, November 2020.

The PETLON Algorithm to Plan Efficiently for Task-Level-Optimal Navigation

2020

Shih-Yun Lo, Shiqi Zhang, and Peter Stone, The Journal of Artificial Intelligence Research (JAIR), Vol. 67 (2020).

Using Human-Inspired Signals to Disambiguate Navigational Intentions

2020

Justin Hart, Reuth Mirsky, Xuesu Xiao, Stone Tejeda, Bonny Mahajan, Jamin Goo, Kathryn Baldauf, Sydney Owen, and Peter Stone, In Proceedings of the 12th International Conference on Social Robotics (ICSR), Golden, Colorado, November 2020.

Ad hoc Teamwork with Behavior Switching Agents

2019

Manish Ravula, Shani Alkobi, and Peter Stone, International Joint Conference on Artificial Intelligence (IJCAI) (2019).

Building Self-Play Curricula Online by Playing with Expert Agents in Adversarial Games

2019

Felipe Leno Da Silva, Anna Helena Reali Costa, and Peter Stone, In Proceedings of the 8th Brazilian Conference on Intelligent Systems (BRACIS), Salvador, Bahia, Brazil, October 2019.

Generative Adversarial Imitation from Observation

2019

Faraz Torabi, Garrett Warnell, and Peter Stone, Imitation, Intent, and Interaction (I3) Workshop at ICML 2019 (2019).

Imitation Learning from Video by Leveraging Proprioception

2019

Faraz Torabi, Garrett Warnell, and Peter Stone, In Proceedings of the 28th International Joint Conference on Artificial Intelligence (IJCAI), Macao, China, August 2019.

Importance Sampling Policy Evaluation with an Estimated Behavior Policy

2019

Josiah Hanna, Scott Niekum, and Peter Stone, In Proceedings of the 36th International Conference on Machine Learning (ICML), Long Beach, California, U.S.A., June 2019.

Improving Grounded Natural Language Understanding through Human-Robot Dialog

2019

Jesse Thomason, Aishwarya Padmakumar, Jivko Sinapov, Nick Walker, Yuqian Jiang, Harel Yedidsion, Justin Hart, Peter Stone, and Raymond J. Mooney, In IEEE International Conference on Robotics and Automation (ICRA), Montreal, Canada, May 2019.

Learning Curriculum Policies for Reinforcement Learning

2019

Sanmit Narvekar and Peter Stone, In Proceedings of the 18th International Conference on Autonomous Agents and Multiagent Systems (AAMAS), Montreal, Canada, May 2019.

Open-World Reasoning for Service Robots

2019

Yuqian Jiang, Nick Walker, Justin Hart, Peter Stone, In Proceedings of the 29th International Conference on Automated Planning and Scheduling (ICAPS 2019), Berkeley, CA, USA, July 2019.

Optimal Use Of Verbal Instructions For Multi-Robot Human Navigation Guidance

2019

Harel Yedidsion, Jacqueline Deans, Connor Sheehan, Mahathi Chillara, Justin Hart, Peter Stone, and Raymond J. Mooney, In Proceedings of the Eleventh International Conference on Social Robotics, pp. 133-143 2019. Springer.

Recent Advances in Imitation Learning from Observation

2019

Faraz Torabi, Garrett Warnell, and Peter Stone, Proceedings of the 28th International Joint Conference on Artificial Intelligence (IJCAI) (2019).

Reducing Sampling Error in Policy Gradient Learning

2019

Josiah Hanna and Peter Stone, In Proceedings of the 18th International Conference on Autonomous Agents and Multiagent Systems (AAMAS), Montreal, Canada, May 2019.

RIDM: Reinforced Inverse Dynamics Modeling for Learning from a Single Observed Demonstration

2019

Brahma S. Pavse, Faraz Torabi, Josiah Hanna, Garrett Warnell, and Peter Stone, No other information

Sample-efficient Adversarial Imitation Learning from Observation

2019

Faraz Torabi, Garrett Warnell, and Peter Stone, No other information

Selecting Compliant Agents for Opt-in Micro-Tolling

2019

Josiah Hanna, Guni Sharon, Stephen Boyles, and Peter Stone, In Proceedings of the 33rd AAAI Conference on Artificial Intelligence (AAAI), Honolulu, HI, January 2019.

The right music at the right time: adaptive personalized playlists based on sequence modeling

2019

Elad Liebman, Maytal Saar-Tsechansky, and Peter Stone Peter Stone, Management Information Systems Quarterly, Vol. 43, 3 (2019), pp. 765--786.

UT Austin Villa: RoboCup 2018 3D Simulation League Champions

2019

Patrick MacAlpine, Faraz Torabi, Brahma Pavse, John Sigmon and Peter Stone, In RoboCup 2018: Robot Soccer World Cup XXII, Dirk Holz and Katie Genter and Maarouf Saad and Oskar von Stryk (Eds.) 2019. Springer.

A Study of Human-Robot Copilot Systems for En-Route Destination Changing

2018

Yu-Sian Jiang, Garrett Warnell, Eduardo Munera, and Peter Stone, In Proceedings of the 27th IEEE International Conference on Robot and Human Interactive Communication (RO-MAN2018), Nanjing, China, August 2018.

Autonomous Agents Modelling Other Agents: A Comprehensive Survey and Open Problems

2018

Stefano Albrecht and Peter Stone, Artificial Intelligence, Vol. 258 (2018), pp. 66--95. Elsevier.

Behavioral Cloning from Observation

2018

Faraz Torabi, Garrett Warnell, and Peter Stone, In Proceedings of the 27th International Joint Conference on Artificial Intelligence (IJCAI), Stockholm, Sweden, July 2018.

Deep TAMER: Interactive agent shaping in high-dimensional state spaces

2018

Garrett Warnell, Nicholas Waytowich, Vernon Lawhern, and Peter Stone, In Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, New Orleans, Louisiana, USA, February 2018.

Deterministic Implementations for Reproducibility in Deep Reinforcement Learning

2018

Prabhat Nagarajan, Garrett Warnell, and Peter Stone, In 2nd Reproducibility in Machine Learning Workshop at ICML 2018, Stockholm, Sweden, July 2018.

DyETC: Dynamic Electronic Toll Collection for Traffic Congestion Alleviation

2018

Haipeng Chen, Bo An, Guni Sharon, Josiah Hanna, Peter Stone, Chunyan Miao, and Yeng Chai Soh, In Proceedings of the 32nd AAAI Conference on Artificial Intelligence (AAAI-18), New Orleans, Louisiana, USA, February 2018.

Guiding Exploratory Behaviors for Multi-Modal Grounding of Linguistic Descriptions

2018

Jesse Thomason, Jivko Sinapov, Raymond Mooney, Peter Stone, In Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence (AAAI-18) , February 2018.

Inferring User Intention using Gaze in Vehicles

2018

Yu-Sian Jiang, Garrett Warnell, and Peter Stone, In The 20th ACM International Conference on Multimodal Interaction (ICMI), Boulder, Colorado, October 2018.

Interaction and Autonomy in RoboCup@Home and Building-Wide Intelligence

2018

Justin Hart, Harel Yedidsion, Yuqian Jiang, Nick Walker, Rishi Shah, Jesse Thomason, Aishwarya Padmakumar, Rolando Fernandez, Jivko Sinapov, Raymond Mooney, Peter Stone, In Artificial Intelligence (AI) for Human-Robot Interaction (HRI) symposium, AAAI Fall Symposium Series, Arlington, Virginia, October 2018.

Jointly Improving Parsing and Perception for Natural Language Commands through Human-Robot Dialog

2018

Jesse Thomason, Aishwarya Padmakumar, Jivko Sinapov, Nick Walker, Yuqian Jiang, Harel Yedidsion, Justin Hart, Peter Stone, and Raymond J. Mooney, In RSS Workshop on Models and Representations for Natural Human-Robot Communication (MRHRC-18). Robotics: Science and Systems (RSS), June 2018.

Jointly Improving Parsing and Perception for Natural Language Commands through Human-Robot Dialog

2018

Jesse Thomason, Aishwarya Padmakumar, Jivko Sinapov, Nick Walker, Yuqian Jiang, Harel Yedidsion, Justin Hart, Peter Stone, and Raymond J. Mooney, In Late-breaking Track at the SIGDIAL Special Session on Physically Situated Dialogue (RoboDIAL-18), Melbourne, Australia, July 2018.

Learning a Policy for Opportunistic Active Learning

2018

Aishwarya Padmakumar, Peter Stone, Raymond J. Mooney, In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing (EMNLP-18), Brussels, Belgium, November 2018.

Multi-modal Predicate Identification using Dynamically Learned Robot Controllers

2018

Saeid Amiri, Suhua Wei, Shiqi Zhang, Jivko Sinapov, Jesse Thomason, and Peter Stone, In Proceedings of the 27th International Joint Conference on Artificial Intelligence (IJCAI-18), Stockholm, Sweden, July 2018.

Overlapping Layered Learning

2018

Patrick MacAlpine and Peter Stone, Artificial Intelligence, Vol. 254 (2018), pp. 21--43. Elsevier.

Passive Demonstrations of Light-Based Robot Signals for Improved Human Interpretability

2018

Rolando Fernandez, Nathan John, Sean Kirmani, Justin Hart, Jivko Sinapov, and Peter Stone, In Proceedings of the 27th {IEEE} International Symposium on Robot and Human Interactive Communication {(RO-MAN)}, Nanjing, China, August 2018.

PETLON - Planning Efficiently for Task-Level Optimal Navigation

2018

Shih-Yun Lo, Shiqi Zhang, and Peter Stone, In Proceedings of the 17th International Conference on Autonomous Agents and Multiagent Systems (AAMAS), Stockholm, Sweden, July 2018.

PRISM: Pose Registration for Integrated Semantic Mapping

2018

Justin W. Hart, Rishi Shah, Sean Kirmani, Nick Walker, Kathryn Baldauf, Nathan John, and Peter Stone, In Proceedings of the 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Madrid, Spain, October 2018.

State Abstraction Synthesis for Discrete Models of Continuous Domains

2018

Jacob Menashe and Peter Stone, In Data Efficient Reinforcement Learning Workshop at AAAI Spring Symposium, Stanford, CA, USA, March 2018.

Towards a Data Efficient Off-Policy Policy Gradient

2018

Josiah Hanna and Peter Stone, In AAAI Spring Symposium on Data Efficient Reinforcement Learning, Palo Alto, CA, March 2018.

Traffic Optimization For a Mixture of Self-interested and Compliant Agents

2018

Guni Sharon, Michael Albert, Tarun Rambha, Stephen Boyles, and Peter Stone, In Proceedings of the 32nd AAAI Conference on Artificial Intelligence (AAAI-18), New Orleans, Louisiana, USA, February 2018.

Variety Wins: Soccer-Playing Robots and Infant Walking

2018

Ori Ossmy, Justine E. Hoch, Patrick MacAlpine, Shohan Hasan, Peter Stone, and Karen E. Adolph, Frontiers in Neurorobotics, Vol. 12 (2018), pp. 19.

A Stitch in Time - Autonomous Model Management via Reinforcement Learning

2018

Elad Liebman, Eric Zavesky, and Peter Stone, In Proceedings of the 17th International Conference on Autonomous Agents and Multiagent Systems (AAMAS), Stockholm, Sweden, July 2018.

On the Impact of Music on Decision Making in Cooperative Tasks

2018

Elad Liebman, Corey N. White, and Peter Stone, In 19th International Society for Music Information retrieval Conference (ISMIR), Paris, France, September 2018.

A Protocol for Mixed Autonomous and Human-Operated Vehicles at Intersections

2017

Guni Sharon and Peter Stone, In Proceedings of the 2nd International Workshop on Agent-based modeling of urban systems (ABMUS-2017), Sao Paulo, Brazil, May 2017.

Automated Design of Robust Mechanisms

2017

Michael Albert, Vincent Conitzer, and Peter Stone, In Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence (AAAI-17), San Francisco, CA, USA, February 2017.

Automatic Curriculum Graph Generation for Reinforcement Learning Agents

2017

Maxwell Svetlik, Matteo Leonetti, Jivko Sinapov, Rishi Shah, Nick Walker, and Peter Stone, In Proceedings of the 31st AAAI Conference on Artificial Intelligence (AAAI), San Francisco, CA, February 2017.

Autonomous Task Sequencing for Customized Curriculum Design in Reinforcement Learning

2017

Sanmit Narvekar, Jivko Sinapov, and Peter Stone, In Proceedings of the 26th International Joint Conference on Artificial Intelligence (IJCAI), Melbourne, Australia, August 2017.

Bootstrapping with Models: Confidence Intervals for Off-Policy Evaluation

2017

Josiah Hanna, Peter Stone, and Scott Niekum, In Proceedings of the 16th International Conference on Autonomous Agents and Multiagent Systems (AAMAS), Sao Paolo, Brazil, May 2017.

BWIBots: A platform for bridging the gap between AI and human--robot interaction research

2017

Piyush Khandelwal, Shiqi Zhang, Jivko Sinapov, Matteo Leonetti, Jesse Thomason, Fangkai Yang, Ilaria Gori, Maxwell Svetlik, Priyanka Khante, Vladimir Lifschitz, J. K. Aggarwal, Raymond Mooney, and Peter Stone, The International Journal of Robotics Research (2017).

CC-Log: Drastically Reducing Storage Requirements for Robots Using Classification and Compression

2017

Santiago Gonzalez, Vijay Chidambaram, Jivko Sinapov, and Peter Stone, In Proceedings of the 9th USENIX Workshop on Hot Topics in Storage and File Systems (HotStorage '17), Santa Clara, CA, July 2017.

Data-Efficient Policy Evaluation Through Behavior Policy Search

2017

Josiah Hanna, Philip Thomas, Peter Stone, and Scott Niekum, In Proceedings of the 34th International Conference on Machine Learning (ICML), Sydney, Australia, August 2017.

Decision mechanisms underlying mood-congruent emotional classification

2017

Corey White, Elad Liebman, and Peter Stone, Cognition and Emotion (2017), pp. 1--10. Taylor and Francis.

Designing Better Playlists with Monte Carlo Tree Search

2017

Elad Liebman, Piyush Khandelwal, Maytal Saar-Tsechansky, and Peter Stone, In PROCEEDINGS OF THE TWENTY-NINTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE (IAAI-17), San Francisco, USA, February 2017.

Dynamically Constructed (PO)MDPs for Adaptive Robot Planning

2017

Shiqi Zhang, Piyush Khandelwal, and Peter Stone, In Proceedings of the 31st AAAI Conference on Artificial Intelligence (AAAI), San Francisco, CA, February 2017.

Evaluating Ad Hoc Teamwork Performance in Drop-In Player Challenges

2017

Patrick MacAlpine and Peter Stone, In AAMAS Multiagent Interaction without Prior Coordination (MIPC) Workshop, Sao Paulo, Brazil, May 2017.

Evaluating Ad Hoc Teamwork Performance in Drop-In Player Challenges

2017

Patrick MacAlpine and Peter Stone, In Autonomous Agents and Multiagent Systems, AAMAS 2017 Workshops, Best Papers, Gita Sukthankar and Juan A. Rodriguez-Aguilar (Eds.), pp. 168--186 2017. Springer International Publishing.

Fast and Precise Black and White Ball Detection for RoboCup Soccer

2017

Jacob Menashe, Josh Kelle, Katie Genter, Josiah Hanna, Elad Liebman, Sanmit Narvekar, Ruohan Zhang, and Peter Stone, In {R}obo{C}up-2017: Robot Soccer World Cup {XXI}, 2017 (Eds.), Nagoya, Japan, July 2017.

Grounded Action Transformation for Robot Learning in Simulation

2017

Josiah Hanna and Peter Stone, In Proceedings of the 31st AAAI Conference on Artificial Intelligence (AAAI), San Francisco, CA, February 2017.

Integrated Commonsense Reasoning and Probabilistic Planning

2017

Shiqi Zhang and Peter Stone, In Proceedings of 2017 ICAPS Workshop on Planning and Robotics, Pittsburgh, Pennsylvania, June 2017.

Iterative Human-Aware Mobile Robot Navigation

2017

Shih-Yun Lo, Benito Fernandez, and Peter Stone, In Proceedings of the Human-Centered Robotics workshop of the 13th International Conference on Robotics: Science and System (RSS), Cambridge, MA, USA, July 2017.

Leveraging Commonsense Reasoning and Multimodal Perception for Robot Spoken Dialog Systems

2017

Dongcai Lu, Shiqi Zhang, Peter Stone, and Xiaoping Chen, In Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Vancouver, Canada, September 2017.

Mechanism Design with Unknown Correlated Distributions: Can We Learn Optimal Mechanisms?

2017

Michael Albert, Vincent Conitzer, and Peter Stone, In Proceedings of the 16th Conference on Autonomous Agents and MultiAgent Systems (AAMAS-17), Sau Paulo, Brazil, May 2017.

Multi-Robot Human Guidance: Human Experiments and Multiple Concurrent Requests

2017

Piyush Khandelwal and Peter Stone, In International Conference on Autonomous Agents and Multiagent Systems (AAMAS), São Paulo, Brazil, May 2017.

Multirobot Symbolic Planning under Temporal Uncertainty

2017

Shiqi Zhang, Yuqian Jiang, Guni Sharon, and Peter Stone, In Proceedings of the 16th International Conference on Autonomous Agents and Multiagent Sytems (AAMAS), Sao Paulo, Brazil, May 2017.

Network-wide adaptive tolling for connected and automated vehicles

2017

Guni Sharon, Michael W. Levin, Josiah P. Hanna, Tarun Rambha, Stephen D. Boyles, and Peter Stone, Transportation Research Part C, Vol. 84 (2017), pp. 142--157.

Opportunistic Active Learning for Grounding Natural Language Descriptions

2017

Jesse Thomason, Aishwarya Padmakumar, Jivko Sinapov, Justin Hart, Peter Stone, and Raymond J. Mooney, In Proceedings of the 1st Annual Conference on Robot Learning (CoRL-17), Sergey Levine and Vincent Vanhoucke and Ken Goldberg (Eds.), pp. 67--76, Mountain View, California, November 2017. PMLR.

Real-time Adaptive Tolling Scheme for Optimized Social Welfare in Traffic Networks

2017

Guni Sharon, Josiah P. Hanna, Tarun Rambha, Michael W. Levin, Michael Albert, Stephen D. Boyles, and Peter Stone, In Proceedings of the 16th International Conference on Autonomous Agents and Multiagent Systems (AAMAS-2017), Sao Paulo, Brazil, May 2017.

Reasoning about Hypothetical Agent Behaviours and their Parameters

2017

Stefano Albrecht and Peter Stone, In Proceedings of the 16th International Conference on Autonomous Agents and Multiagent Systems (AAMAS-17), Sao Paulo, Brazil, May 2017.

Robot Behavioral Exploration and Multimodal Perception using POMDPs

2017

Shiqi Zhang, Jivko Sinapov, Suhua Wei, and Peter Stone, In Proceedings of 2017 AAAI Spring Symposium on Interactive Multi-Sensory Perception for Embodied Agents, Stanford, CA, March 2017.

TD Learning with Constrained Gradients

2017

Ishan Durugkar and Peter Stone, In Proceedings of the Deep Reinforcement Learning Symposium, NIPS 2017, Long Beach, CA, USA, December 2017.

UT Austin Villa: RoboCup 2017 3D Simulation League Competition and Technical Challenges Champions

2017

Patrick MacAlpine and Peter Stone, In {R}obo{C}up 2017: Robot Soccer World Cup {XXI}, Claude Sammut and Oliver Obst and Flavio Tonidandel and Hidehisa Akyama (Eds.) 2017. Springer.

A Study of Layered Learning Strategies Applied to Individual Behaviors in Robot Soccer

2016

David L. Leottau, Javier Ruiz-del-Solar, Patrick MacAlpine, and Peter Stone, In {R}obo{C}up-2015: Robot Soccer World Cup {XIX}, Luis Almeida and Jianmin Ji and Gerald Steinbauer and Sean Luke (Eds.), Berlin, Germany 2016. Springer Verlag.

A synthesis of automated planning and reinforcement learning for efficient, robust decision-making

2016

Matteo Leonetti, Luca Iocchi, and Peter Stone, Artificial Intelligence, Vol. 241 (2016), pp. 103 - 130.

Ad Hoc Teamwork Behaviors for Influencing a Flock

2016

Katie Genter and Peter Stone, Acta Polytechnica (2016).

Adaptation of Surrogate Tasks for Bipedal Walk Optimization

2016

Patrick MacAlpine, Elad Liebman, and Peter Stone, In GECCO Surrogate-Assisted Evolutionary Optimisation (SAEOpt) Workshop, Denver, Colorado, USA, July 2016.

Adding Influencing Agents to a Flock

2016

Katie Genter and Peter Stone, In Proceedings of the 15th International Conference on Autonomous Agents and Multiagent Systems (AAMAS-16), Singapore, Singapore, May 2016.

An MDP-Based Winning Approach to Autonomous Power Trading: Formalization and Empirical Analysis

2016

Daniel Urieli and Peter Stone, In Proceedings of the 15th International Conference on Autonomous Agents and Multiagent Systems (AAMAS), May 2016.

Autonomous Electricity Trading using Time-Of-Use Tariffs in a Competitive Market

2016

Daniel Urieli and Peter Stone, In Proceedings of the 30th Conference on Artificial Intelligence (AAAI 2016), Phoenix, AZ, USA, February 2016.

Deep Imitation Learning for Parameterized Action Spaces

2016

Matthew Hausknecht, Yilun Chen, and Peter Stone, In AAMAS Adaptive Learning Agents (ALA) Workshop, Singapore, May 2016.

Deep Reinforcement Learning in Parameterized Action Space

2016

Matthew Hausknecht and Peter Stone, In Proceedings of the International Conference on Learning Representations (ICLR), San Juan, Puerto Rico, May 2016.

Delta-Tolling: Adaptive Tolling for Optimizing Traffic Throughput

2016

Guni Sharon, Josiah Hanna, Tarun Rambha, Michael Albert, Peter Stone, and Stephen D. Boyles, In Proceedings of the 9th International Workshop on Agents in Traffic and Transportation (ATT 2016), New York, NY, USA, July 2016.

Dynamically Constructed (PO)MDPs for Adaptive Robot Planning

2016

Shiqi Zhang, Piyush Khandelwal, and Peter Stone, In IJCAI'16 Workshop on Autonomous Mobile Service Robots, New York City, USA, July 2016.

Grounded Semantic Networks for Learning Shared Communication Protocols

2016

Matthew Hausknecht and Peter Stone, In Deep Reinforcement Learning, NIPS Workshop, Barcelona, Spain, December 2016.

Half Field Offense: An Environment for Multiagent Learning and Ad Hoc Teamwork

2016

Matthew Hausknecht, Prannoy Mupparaju, Sandeep Subramanian, Shivaram Kalyanakrishnan, and Peter Stone, In AAMAS Adaptive Learning Agents (ALA) Workshop, Singapore, May 2016.

Learning Multi-Modal Grounded Linguistic Semantics by Playing "I Spy"

2016

Jesse Thomason, Jivko Sinapov, Maxwell Svetlik, Peter Stone, and Raymond J. Mooney, In Proceedings of the 25th International Joint Conference on Artificial Intelligence (IJCAI-16), pp. 3477--3483, New York City 2016.

Learning to Order Objects Using Haptic and Proprioceptive Exploratory Behaviors

2016

Jivko Sinapov, Priyanka Khante, Maxwell Svetlik, and Peter Stone, In Proceedings of the 25th International Joint Conference on Artificial Intelligence (IJCAI), New York City, USA, Jult 2016.

Machine Learning Capabilities of a Simulated Cerebellum

2016

Matthew Hausknecht, Wen-Ke Li, Michael Mauk, and Peter Stone, IEEE Transactions on Neural Networks and Learning Systems (2016).

Making Friends on the Fly: Cooperating with New Teammates

2016

Samuel Barrett, Avi Rosenfeld, Sarit Kraus, and Peter Stone, Artificial Intelligence (2016).

Minimum Cost Matching for Autonomous Carsharing

2016

Josiah P. Hanna, Michael Albert, Donna Chen, and Peter Stone, In Proceedings of the 9th IFAC Symposium on Intelligent Autonomous Vehicles (IAV 2016), Leipzig, Germany, June 2016.

Multirobot Symbolic Planning under Temporal Uncertainty

2016

Shiqi Zhang, Yuqian Jiang, Guni Sharon, and Peter Stone, In IJCAI'16 Workshop on Autonomous Mobile Service Robots, New York City, USA, July 2016.

On the Analysis of Complex Backup Strategies in Monte Carlo Tree Search

2016

Khandelwal, Piyush, Liebman, Elad, Niekum, Scott, Stone, and Peter, In Proceedings of The 33rd International Conference on Machine Learning, pp. 1319--1328, New York City, NY, USA, June 2016.

On-Policy vs. Off-Policy Updates for Deep Reinforcement Learning

2016

Matthew Hausknecht and Peter Stone, In Deep Reinforcement Learning: Frontiers and Challenges, IJCAI Workshop, New York, July 2016.

Prioritized Role Assignment for Marking

2016

Patrick MacAlpine and Peter Stone, In {R}obo{C}up 2016: Robot Soccer World Cup {XX}, Sven Behnke and Daniel D. Lee and Sanem Sariel and Raymond Sheh (Eds.), Berlin 2016. Springer Verlag.

Robot Scavenger Hunt: A Standardized Framework for Evaluating Intelligent Mobile Robots

2016

Shiqi Zhang, Dongcai Lu, Xiaoping Chen, and Peter Stone, In Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI), New York City, USA, July 2016.

Robust Automated Mechanism Design

2016

Michael Albert, Vincent Conitzer, and Peter Stone, In Proceedings of the EC 2016 2nd Algorithmic Game Theory and Data Science Workshop, Netherlands, July 2016.

Source Task Creation for Curriculum Learning

2016

Sanmit Narvekar, Jivko Sinapov, Matteo Leonetti, and Peter Stone, In Proceedings of the 15th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2016), Singapore, May 2016.

Special Issue on Multiagent Interaction without Prior Coordination: Guest Editorial

2016

Stefano Albrecht, Somchaya Liemhetcharat, and Peter Stone, Autonomous Agents and Multi-Agent Systems (2016).

State Aggregation through Reasoning in Answer Set Programming

2016

Ginevra Gaudioso, Matteo Leonetti, and Peter Stone, In Proceedings of the IJCAI Workshop on Autonomous Mobile Service Robots (WSR 16), New York City, NY, USA, July 2016.

Three Years of the RoboCup Standard Platform League Drop-in Player Competition: Creating and Maintaining a Large Scale Ad Hoc Teamwork Robotics Competition

2016

Katie Genter, Tim Laue, and Peter Stone, Journal of Autonomous Agents and Multi-Agent Systems (JAAMAS) (2016), pp. 1--31. Springer.

Bin-Based Estimation of the Amount of Effort for Embedded Software Development Projects with Support Vector Machines

2016

Kazunori Iwata, Elad Liebman, Peter Stone, Toyoshiro Nakashima, Yoshiyuki Anan, and Naohiro Ishii, In {C}omputer and {I}nformation {S}cience , Roger Lee (Eds.), Berlin 2016. Springer Verlag.

Impact of Music on Decision Making in Quantitative Tasks

2016

Elad Liebman, Peter Stone, and Corey N. White, In 17th International Society for Music Information retrieval Conference (ISMIR), NYC, USA, August 2016.

UT Austin Villa RoboCup 3D Simulation Base Code Release

2016

Patrick MacAlpine and Peter Stone, In {R}obo{C}up 2016: Robot Soccer World Cup {XX}, Sven Behnke and Daniel D. Lee and Sanem Sariel and Raymond Sheh (Eds.), Berlin 2016. Springer Verlag.

UT Austin Villa: RoboCup 2015 3D Simulation League Competition and Technical Challenges Champions

2016

Patrick MacAlpine, Josiah Hanna, Jason Liang, and Peter Stone, In {R}obo{C}up-2015: Robot Soccer World Cup {XIX}, Luis Almeida and Jianmin Ji and Gerald Steinbauer and Sean Luke (Eds.), Berlin, Germany 2016. Springer Verlag.

UT Austin Villa: RoboCup 2016 3D Simulation League Competition and Technical Challenges Champions

2016

Patrick MacAlpine and Peter Stone, In {R}obo{C}up 2016: Robot Soccer World Cup {XX}, Sven Behnke and Daniel D. Lee and Sanem Sariel and Raymond Sheh (Eds.) 2016. Springer.

Autonomous Intersection Management for Semi-Autonomous Vehicles

2015

Tsz-Chiu Au, Shun Zhang and Peter Stone, In Handbook of Transportation, May 2015.

Autonomous Trading in Modern Electricity Markets

2015

Daniel Urieli, PhD Thesis, Department of Computer Sciences, The University of Texas at Austin. Code and binaries available at: http://www.cs.utexas.edu/~urieli/thesis.

Benchmarking Robot Cooperation without Pre-Coordination in the RoboCup Standard Platform League Drop-In Player Competition

2015

Katie Genter, Tim Laue, and Peter Stone, In Proceedings of the 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS-15), Hamburg, Germany, September 2015.

Cooperating with Unknown Teammates in Complex Domains: A Robot Soccer Case Study of Ad Hoc Teamwork

2015

Samuel Barrett and Peter Stone, In Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, January 2015.

CORPP: Commonsense Reasoning and Probabilistic Planning, as Applied to Dialog with a Mobile Robot

2015

Shiqi Zhang and Peter Stone, In Proceedings of the 29th Conference on Artificial Intelligence (AAAI), January 2015.

Deep Recurrent Q-Learning for Partially Observable MDPs

2015

Matthew Hausknecht and Peter Stone, In AAAI Fall Symposium on Sequential Decision Making for Intelligent Agents (AAAI-SDMIA15), Arlington, Virginia, USA, November 2015.

Determining Placements of Influencing Agents in a Flock

2015

Katie Genter, Shun Zhang, and Peter Stone, In Proceedings of the 2015 International Conference on Autonomous Agents and Multiagent Systems (AAMAS-15), Istanbul, Turkey, May 2015.

DJ-MC: A Reinforcement-Learning Agent for Music Playlist Recommendation

2015

Elad Liebman, Maytal Saar-Tsechansky, and Peter Stone, In Proceedings of the 14th International Conference on Autonomous Agents and Multiagent Systems (AAMAS), Istanbul, Turkey, May 2015.

Framing reinforcement learning from human reward: Reward positivity, temporal discounting, episodicity, and performance

2015

W. Bradley Knox and Peter Stone, Artificial Intelligence, Vol. 225 (2015).

How Music Alters Decision Making: Impact of Music Stimuli on Emotional Classification

2015

Elad Liebman, Peter Stone, and Corey N. White, In 16th International Society for Music Information Retrieval Conference (ISMIR), Malaga, Spain, October 2015.

Intrinsically motivated model learning for developing curious robots

2015

Todd Hester and Peter Stone, Artificial Intelligence (2015). Elsevier.

Keyframe Sampling, Optimization, and Behavior Integration: Towards Long-Distance Kicking in the RoboCup 3D Simulation League

2015

Mike Depinet, Patrick MacAlpine, and Peter Stone, In {R}obo{C}up-2014: Robot Soccer World Cup {XVIII}, H. Levent Akin and Reinaldo A. C. Bianchi and Subramanian Ramamoorthy and Komei Sugiura (Eds.) 2015. Springer Verlag.

Leading the Way: An Efficient Multi-robot Guidance System

2015

Piyush Khandelwal, Samuel Barrett, and Peter Stone, In International Conference on Autonomous Agents and Multiagent Systems (AAMAS), Istanbul, Turkey, May 2015.

Learning Inter-Task Transferability in the Absence of Target Task Samples

2015

Jivko Sinapov, Sanmit Narvekar, Matteo Leonetti, and Peter Stone, In Proceedings of the International Conference on Autonomous Agents and Multiagent Systems (AAMAS), Istanbul, Turkey, May 2015.

Learning to Interpret Natural Language Commands through Human-Robot Dialog

2015

Jesse Thomason, Shiqi Zhang, Raymond Mooney, and Peter Stone, In Proceedings of the 2015 International Joint Conference on Artificial Intelligence (IJCAI), pp. 1923--1929, Buenos Aires, Argentina, July 2015.

Mobile Robot Planning using Action Language BC with an Abstraction Hierarchy

2015

Shiqi Zhang, Fangkai Yang, Piyush Khandelwal, and Peter Stone, In Proceedings of the 13th International Conference on Logic Programming and Non-monotonic Reasoning (LPNMR), Lexington, KY, USA, September 2015.

Monte Carlo Hierarchical Model Learning

2015

Jacob Menashe and Peter Stone, In Proceedings of the 14th International Conference on Autonomous Agents and Multiagent Systems (AAMAS), Istanbul, Turkey, May 2015.

Representative Selection in Nonmetric Datasets

2015

Elad Liebman, Benny Chor, and Peter Stone, Applied Artificial Intelligence, Vol. 29, 8 (2015), pp. 807--838.

Robot-centric Activity Recognition "in the Wild"

2015

Gori, I., Sinapov, J., Khante, P., Stone, P., and Aggarwal, J.K., In Proceedings of the International Conference on Social Robotics (ICSR), October 2015.

The Impact of Determinism on Learning Atari 2600 Games

2015

Matthew Hausknecht and Peter Stone, In AAAI Workshop on Learning for General Competency in Video Games, Austin, Texas, USA, January 2015.

SCRAM: Scalable Collision-avoiding Role Assignment with Minimal-makespan for Formational Positioning

2015

Patrick MacAlpine, Eric Price, and Peter Stone, In Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence (AAAI), January 2015.

UT Austin Villa 2014: RoboCup 3D Simulation League Champion via Overlapping Layered Learning

2015

Patrick MacAlpine, Mike Depinet, and Peter Stone, In Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence (AAAI), January 2015.

UT Austin Villa: RoboCup 2014 3D Simulation League Competition and Technical Challenge Champions

2015

Patrick MacAlpine, Mike Depinet, Jason Liang, and Peter Stone, In {R}obo{C}up-2014: Robot Soccer World Cup {XVIII}, H. Levent Akin and Reinaldo A. C. Bianchi and Subramanian Ramamoorthy and Komei Sugiura (Eds.) 2015. Springer Verlag.

Communicating with Unknown Teammates

2014

Samuel Barrett, Noa Agmon, Noam Hazon, Sarit Kraus, and Peter Stone, In Proceedings of the Twenty-First European Conference on Artificial Intelligence, August 2014.

Cooperating with Unknown Teammates in Robot Soccer

2014

Samuel Barrett and Peter Stone, In AAMAS Autonomous Robots and Multirobot Systems Workshop (ARMS 2014), May 2014.

Cooperating with Unknown Teammates in Robot Soccer

2014

Samuel Barrett and Peter Stone, In AAAI Workshop on Multiagent Interaction without Prior Coordination (MIPC 2014), July 2014.

Influencing a Flock via Ad Hoc Teamwork

2014

Katie Genter and Peter Stone, In Proceedings of the Ninth International Conference on Swarm Intelligence (ANTS 2014), September 2014.

Mobile Robot Planning using Action Language BC with Hierarchical Domain Abstractions

2014

Shiqi Zhang, Fangkai Yang, Piyush Khandelwal, and Peter Stone, In The 7th Workshop on Answer Set Programming and Other Computing Paradigms (ASPOCP), July 2014.

Modeling Uncertainty in Leading Ad Hoc Teams

2014

Noa Agmon, Samuel Barrett, and Peter Stone, In Proc. of 13th Int. Conf. on Autonomous Agents and Multiagent Systems (AAMAS), May 2014.

Multi-robot Human Guidance using Topological Graphs

2014

Piyush Khandelwal and Peter Stone, In AAAI Spring 2014 Symposium on Qualitative Representations for Robots (AAAI-SSS), March 2014.

Orienting a Flock via Ad Hoc Teamwork

2014

Katie Genter and Peter Stone, In Proceedings of the 13th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2014), May 2014.

Planning in Action Language BC while Learning Action Costs for Mobile Robots

2014

Piyush Khandelwal, Fangkai Yang, Matteo Leonetti, Vladimir Lifschitz, and Peter Stone, In International Conference on Automated Planning and Scheduling (ICAPS), June 2014.

Planning in Answer Set Programming while Learning Action Costs for Mobile Robots

2014

Fangkai Yang, Piyush Khandelwal, Matteo Leonetti, and Peter Stone, No other information

TacTex'13: A Champion Adaptive Power Trading Agent

2014

Daniel Urieli and Peter Stone, In Proceedings of the Twenty-Eighth Conference on Artificial Intelligence (AAAI 2014), July 2014.

The RoboCup 2013 Drop-In Player Challenges: A Testbed for Ad Hoc Teamwork

2014

Patrick MacAlpine, Katie Genter, Samuel Barrett, and Peter Stone, In Proc. of 13th Int. Conf. on Autonomous Agents and Multiagent Systems (AAMAS), May 2014. Accompanying videos at

SCRAM: Scalable Collision-avoiding Role Assignment with Minimal-makespan for Formational Positioning

2014

Patrick MacAlpine, Eric Price, and Peter Stone, In Proc. of 13th Int. Conf. on Autonomous Agents and Multiagent Systems (AAMAS), May 2014. Accompanying videos at

Projects

A Neuroevolution Approach to General Atari Game Playing	2013 - Present
TEXPLORE: Real-Time Sample Efficient Reinforcement Learning	2009 - Present
Teaching an Agent Manually via Evaluative Reinforcement (TAMER)	2008 - Present
Autonomous Intersection Management (AIM)	2004 - Present
The UT Austin Villa Robot Soccer team	2003 - Present
TacTex, an autonomous bidding agent for the Trading Agent Competition	2003 - Present
Autonomic computing	2003 - Present

Demos

A Neuroevolution Approach to General Atari Game Playing	Matthew Hausknecht	2013
TEXPLORE: Real-Time Sample Efficient Reinforcement Learning	Todd Hester	2012
Teaching an Agent Manually via Evaluative Reinforcement (TAMER)	W. Bradley Knox and Peter Stone	2009
SPL Robocup Soccer		2008
Simulated RoboCup Soccer		2004
Autonomous Intersection Management (AIM)

Software/Data

TacTex AA Binary	The binary version of our 2009 TacTex AA agent, along with many other teams' agents, are available at the ...	2009

TacTex SCM Binaries	Binary versions of all TacTex SCM (2005-2008) agents, along with many other teams' agents, are available at the ...	2008

TacTex SCM Starter Agent	The purpose of this agent is to serve as a starting point for new participants in the TAC SCM competition. The agent is ...	2006

Keepaway player framework source code, version 0.6	The Keepaway player framework is an implementation of all the low- and mid-level keepaway behaviors described in the pub...

Michael Albert	Postdoctoral Alumni	malbert [at] cs duke edu
Stefano Albrecht	Postdoctoral Alumni	svalb [at] cs utexas edu
Tsz-Chiu Au	Postdoctoral Alumni	chiu [at] cs utexas edu
Bikramjit Banerjee	Postdoctoral Alumni	bikramjitbanerjee [at] yahoo com
Patrick Beeson	Postdoctoral Alumni	pbeeson [at] traclabs com
Ian Fasel	Postdoctoral Alumni	ianfasel [at] cs utexas edu
Todd Hester	Postdoctoral Alumni	todd [at] cs utexas edu
Tobias Jung	Postdoctoral Alumni	tjung [at] ulg ac be
Matteo Leonetti	Postdoctoral Alumni	matteo [at] cs utexas edu
Yaxin Liu	Postdoctoral Alumni
Jivko Sinapov	Postdoctoral Alumni	jsinapov [at] cs utexas edu
Shiqi Zhang	Postdoctoral Alumni	szhang [at] cs utexas edu
Samuel Barrett	Ph.D. Alumni	sbarrett [at] cs utexas edu
Doran Chakraborty	Ph.D. Alumni	chakrado [at] cs utexas edu
Kurt Dresner	Ph.D. Alumni	kurt [at] dresner name
Katie Genter	Ph.D. Alumni	katie [at] cs utexas edu
Matthew Hausknecht	Ph.D. Alumni	mhauskn [at] cs utexas edu
Todd Hester	Ph.D. Alumni	todd [at] cs utexas edu
Nicholas Jong	Ph.D. Alumni	nickjong [at] me com
Shivaram Kalyanakrishnan	Ph.D. Alumni	shivaram [at] cs utexas edu
Piyush Khandelwal	Ph.D. Alumni	piyushk [at] cs utexas edu
W. Bradley Knox	Ph.D. Alumni	bradknox [at] mit edu
Gregory Kuhlmann	Ph.D. Alumni	kuhlmann [at] cs utexas edu
Juhyun Lee	Ph.D. Alumni	impjdi [at] gmail com
David Pardoe	Ph.D. Alumni	dpardoe [at] cs utexas edu
Mohan Sridharan	Ph.D. Alumni	mhnsrdhrn [at] gmail com
Daniel Stronger	Ph.D. Alumni	dan stronger [at] gmail com
Matthew Taylor	Ph.D. Alumni	taylorm [at] eecs wsu edu
Daniel Urieli	Ph.D. Alumni	urieli [at] cs utexas edu
Shimon Whiteson	Ph.D. Alumni	s a whiteson [at] uva nl
Adam Setapen	Masters Alumni	asetapen [at] cs utexas edu
Nick Walker	Undergraduate Alumni	nswalker [at] cs uw edu