Imperial College London

Professor Aldo Faisal

Faculty of EngineeringDepartment of Bioengineering

Professor of AI & Neuroscience
 
 
 
//

Contact

 

+44 (0)20 7594 6373a.faisal Website

 
 
//

Assistant

 

Miss Teresa Ng +44 (0)20 7594 8300

 
//

Location

 

4.08Royal School of MinesSouth Kensington Campus

//

Summary

 

Publications

Citation

BibTex format

@inproceedings{Shafti:2021:10.1109/IROS45743.2020.9341473,
author = {Shafti, SA and Tjomsland, J and Dudley, W and Faisal, A},
doi = {10.1109/IROS45743.2020.9341473},
pages = {11161--11166},
publisher = {IEEE},
title = {Real-world human-robot collaborative reinforcement learning},
url = {http://dx.doi.org/10.1109/IROS45743.2020.9341473},
year = {2021}
}

RIS format (EndNote, RefMan)

TY  - CPAPER
AB - The intuitive collaboration of humans and intel-ligent robots (embodied AI) in the real-world is an essentialobjective for many desirable applications of robotics. Whilstthere is much research regarding explicit communication, wefocus on how humans and robots interact implicitly, on motoradaptation level. We present a real-world setup of a human-robot collaborative maze game, designed to be non-trivial andonly solvable through collaboration, by limiting the actions torotations of two orthogonal axes, and assigning each axes to oneplayer. This results in neither the human nor the agent beingable to solve the game on their own. We use deep reinforcementlearning for the control of the robotic agent, and achieve resultswithin 30 minutes of real-world play, without any type ofpre-training. We then use this setup to perform systematicexperiments on human/agent behaviour and adaptation whenco-learning a policy for the collaborative game. We presentresults on how co-policy learning occurs over time between thehuman and the robotic agent resulting in each participant’sagent serving as a representation of how they would play thegame. This allows us to relate a person’s success when playingwith different agents than their own, by comparing the policyof the agent with that of their own agent.
AU - Shafti,SA
AU - Tjomsland,J
AU - Dudley,W
AU - Faisal,A
DO - 10.1109/IROS45743.2020.9341473
EP - 11166
PB - IEEE
PY - 2021///
SN - 2153-0866
SP - 11161
TI - Real-world human-robot collaborative reinforcement learning
UR - http://dx.doi.org/10.1109/IROS45743.2020.9341473
UR - https://ieeexplore.ieee.org/abstract/document/9341473
UR - http://hdl.handle.net/10044/1/81314
ER -