I am a second-year M.S. student at the Computer Science Department of the Univeristy of Texas at Austin advised by Peter Stone and Scott Niekum. Currently, I am working in the Learning Agents Research Group on interactive machine learning and human-robot interaction, and I am specifically interested in reinforcement learning from implicit human feedback, as well as efficient robot learning of tasks and social rules via intelligent interactions with humans. I am also working in parallel with Reid Simmons at Carnegie Mellon University on human emotion recognition and utilization for a game-playing robot to alter its mood and behaviors.
The EMPATHIC Framework for Task Learning from Implicit Human Feedback
Qiping Zhang*, Yuchen Cui*, Alessandro Allievi, Peter Stone, Scott Niekum, W. Bradley Knox
Conference on Robot Learning (CoRL), 2020
A demo also accepted to AAAI-21 Demonstrations Program
(* indicates equal contribution)
Paper
Project page
Video
3D Backscatter Localization for Fine-Grained Robotics
Zhihong Luo, Qiping Zhang, Yunfei Ma, Manish Singh, Fadel Adib
USENIX Symposium on Networked Systems Design and Implementation (NSDI), 2019
Paper
Project page
Video
COMP2396: Object-oriented Programming and Java:
Student Teaching Assistant, The University of Hong Kong, Fall 2017
ENGG1111: Computer Programming and Applications:
Student Teaching Assistant, The University of Hong Kong, Spring 2017
Reward Sharing for Multi-Agent RL
CS 394R: Reinforcement Learning, The University of Texas at Austin
Course project with Jordan Schneider and William Macke. [Github]
Investigated the effects of reward sharing in traffic light control domains based on Flow.
Inference-Based AI Cognitive System
Undergraduate research under the supervision of Ken Forbus. [Github]
Built out Companion architecture with Microsoft \psi framework to support interactive dialogues and multi-modal Q&A tasks.
Semantic Video Segmentation
Final year project under the supervision of Kenneth Wong.
Quantum Communication in Superposition of Causal Orders
Undergraduate research under the supervision of Giulio Chiribella.