Publications

Citation

BibTex format

@article{Kormushev:2008,
author = {Kormushev, P and Nomoto, K and Dong, F and Hirota, K},
journal = {Cybernetics and Information Technologies},
pages = {12--24},
title = {Time manipulation technique for speeding up reinforcement learning in simulations},
url = {http://www.cit.iit.bas.bg/CIT_08/CIT_81.html},
volume = {8},
year = {2008}
}

Download

RIS format (EndNote, RefMan)

TY  - JOUR
AB  - A technique for speeding up reinforcement learning algorithms by usingtime manipulation is proposed. It is applicable to failure-avoidance controlproblems running in a computer simulation. Turning the time of the simulationbackwards on failure events is shown to speed up the learning by 260% andimprove the state space exploration by 12% on the cart-pole balancing task,compared to the conventional Q-learning and Actor-Critic algorithms.
AU  - Kormushev,P
AU  - Nomoto,K
AU  - Dong,F
AU  - Hirota,K
EP  - 24
PY  - 2008///
SN  - 1311-9702
SP  - 12
TI  - Time manipulation technique for speeding up reinforcement learning in simulations
T2  - Cybernetics and Information Technologies
UR  - http://www.cit.iit.bas.bg/CIT_08/CIT_81.html
UR  - http://hdl.handle.net/10044/1/26083
VL  - 8
ER  -

Download

Imperial College London

Latest News

Robot Intelligence Lab

Publications

Citation

BibTex format

RIS format (EndNote, RefMan)