Below is a list of all relevant publications authored by Robotics Forum members.

Citation

BibTex format

@inproceedings{Wang:2019,
author = {Wang, R and Ciliberto, C and Amadori, P and Demiris, Y},
publisher = {Proceedings of International Conference on Machine Learning (ICML-2019)},
title = {Random Expert Distillation: Imitation Learning via Expert Policy Support Estimation},
url = {http://proceedings.mlr.press/v97/wang19d/wang19d.pdf},
year = {2019}
}

RIS format (EndNote, RefMan)

TY  - CPAPER
AB - We consider the problem of imitation learning from a finite set of experttrajectories, without access to reinforcement signals. The classical approachof extracting the expert's reward function via inverse reinforcement learning,followed by reinforcement learning is indirect and may be computationallyexpensive. Recent generative adversarial methods based on matching the policydistribution between the expert and the agent could be unstable duringtraining. We propose a new framework for imitation learning by estimating thesupport of the expert policy to compute a fixed reward function, which allowsus to re-frame imitation learning within the standard reinforcement learningsetting. We demonstrate the efficacy of our reward function on both discreteand continuous domains, achieving comparable or better performance than thestate of the art under different reinforcement learning algorithms.
AU - Wang,R
AU - Ciliberto,C
AU - Amadori,P
AU - Demiris,Y
PB - Proceedings of International Conference on Machine Learning (ICML-2019)
PY - 2019///
TI - Random Expert Distillation: Imitation Learning via Expert Policy Support Estimation
UR - http://proceedings.mlr.press/v97/wang19d/wang19d.pdf
UR - https://icml.cc/
UR - http://hdl.handle.net/10044/1/72668
ER -