Publications

Citation

BibTex format

@inproceedings{Wang:2019,
author = {Wang, R and Ciliberto, C and Amadori, P and Demiris, Y},
publisher = {Proceedings of International Conference on Machine Learning (ICML-2019)},
title = {Random Expert Distillation: Imitation Learning via Expert Policy Support  Estimation},
url = {http://proceedings.mlr.press/v97/wang19d/wang19d.pdf},
year = {2019}
}

Download

RIS format (EndNote, RefMan)

TY  - CPAPER
AB  - We consider the problem of imitation learning from a finite set of experttrajectories, without access to reinforcement signals. The classical approachof extracting the expert's reward function via inverse reinforcement learning,followed by reinforcement learning is indirect and may be computationallyexpensive. Recent generative adversarial methods based on matching the policydistribution between the expert and the agent could be unstable duringtraining. We propose a new framework for imitation learning by estimating thesupport of the expert policy to compute a fixed reward function, which allowsus to re-frame imitation learning within the standard reinforcement learningsetting. We demonstrate the efficacy of our reward function on both discreteand continuous domains, achieving comparable or better performance than thestate of the art under different reinforcement learning algorithms.
AU  - Wang,R
AU  - Ciliberto,C
AU  - Amadori,P
AU  - Demiris,Y
PB  - Proceedings of International Conference on Machine Learning (ICML-2019)
PY  - 2019///
TI  - Random Expert Distillation: Imitation Learning via Expert Policy Support  Estimation
UR  - http://proceedings.mlr.press/v97/wang19d/wang19d.pdf
UR  - https://icml.cc/
UR  - http://hdl.handle.net/10044/1/72668
ER  -

Download

Imperial College London

Latest News

Robotics Forum

Publications

Citation

BibTex format

RIS format (EndNote, RefMan)