Imperial College London

DrPetarKormushev

Faculty of EngineeringDyson School of Design Engineering

Lecturer
 
 
 
//

Contact

 

+44 (0)20 7594 9235p.kormushev Website

 
 
//

Location

 

10-12 Prince's GardensSouth Kensington Campus

//

Summary

 

Publications

Citation

BibTex format

@article{Kormushev:2008,
author = {Kormushev, P and Nomoto, K and Dong, F and Hirota, K},
journal = {Cybernetics and Information Technologies},
pages = {12--24},
title = {Time manipulation technique for speeding up reinforcement learning in simulations},
url = {http://www.cit.iit.bas.bg/CIT_08/CIT_81.html},
volume = {8},
year = {2008}
}

RIS format (EndNote, RefMan)

TY  - JOUR
AB - A technique for speeding up reinforcement learning algorithms by usingtime manipulation is proposed. It is applicable to failure-avoidance controlproblems running in a computer simulation. Turning the time of the simulationbackwards on failure events is shown to speed up the learning by 260% andimprove the state space exploration by 12% on the cart-pole balancing task,compared to the conventional Q-learning and Actor-Critic algorithms.
AU - Kormushev,P
AU - Nomoto,K
AU - Dong,F
AU - Hirota,K
EP - 24
PY - 2008///
SN - 1311-9702
SP - 12
TI - Time manipulation technique for speeding up reinforcement learning in simulations
T2 - Cybernetics and Information Technologies
UR - http://www.cit.iit.bas.bg/CIT_08/CIT_81.html
UR - http://hdl.handle.net/10044/1/26083
VL - 8
ER -