Download a PDF with the full list of Robot-Intelligence-Lab-Publications.pdf

A comprehensive list can also be found at Google Scholar, or by searching for the publications of author Kormushev, Petar.

Citation

BibTex format

@article{Kormushev:2008,
author = {Kormushev, P and Nomoto, K and Dong, F and Hirota, K},
journal = {Cybernetics and Information Technologies},
pages = {12--24},
title = {Time manipulation technique for speeding up reinforcement learning in simulations},
url = {http://www.cit.iit.bas.bg/CIT_08/CIT_81.html},
volume = {8},
year = {2008}
}

RIS format (EndNote, RefMan)

TY  - JOUR
AB - A technique for speeding up reinforcement learning algorithms by usingtime manipulation is proposed. It is applicable to failure-avoidance controlproblems running in a computer simulation. Turning the time of the simulationbackwards on failure events is shown to speed up the learning by 260% andimprove the state space exploration by 12% on the cart-pole balancing task,compared to the conventional Q-learning and Actor-Critic algorithms.
AU - Kormushev,P
AU - Nomoto,K
AU - Dong,F
AU - Hirota,K
EP - 24
PY - 2008///
SN - 1311-9702
SP - 12
TI - Time manipulation technique for speeding up reinforcement learning in simulations
T2 - Cybernetics and Information Technologies
UR - http://www.cit.iit.bas.bg/CIT_08/CIT_81.html
UR - http://hdl.handle.net/10044/1/26083
VL - 8
ER -