TY - JOUR AB - A technique for speeding up reinforcement learning algorithms by usingtime manipulation is proposed. It is applicable to failure-avoidance controlproblems running in a computer simulation. Turning the time of the simulationbackwards on failure events is shown to speed up the learning by 260% andimprove the state space exploration by 12% on the cart-pole balancing task,compared to the conventional Q-learning and Actor-Critic algorithms. AU - Kormushev,P AU - Nomoto,K AU - Dong,F AU - Hirota,K EP - 24 PY - 2008/// SN - 1311-9702 SP - 12 TI - Time manipulation technique for speeding up reinforcement learning in simulations T2 - Cybernetics and Information Technologies UR - http://www.cit.iit.bas.bg/CIT_08/CIT_81.html UR - http://hdl.handle.net/10044/1/26083 VL - 8 ER -