Imperial College London


Faculty of EngineeringDyson School of Design Engineering




+44 (0)20 7594 9235p.kormushev Website




156Roderic Hill BuildingSouth Kensington Campus






BibTex format

author = {Shen, H and Yosinski, J and Kormushev, P and Caldwell, DG and Lipson, H},
doi = {10.2478/cait-2012-0022},
journal = {Cybernetics and Information Technologies},
pages = {66--75},
title = {Learning fast quadruped robot gaits with the RL PoWER spline parameterization},
url = {},
volume = {12},
year = {2012}

RIS format (EndNote, RefMan)

AB - Legged robots are uniquely privileged over their wheeled counterparts in their potential to access rugged terrain. However, designing walking gaits by hand for legged robots is a difficult and time-consuming process, so we seek algorithms for learning such gaits to automatically using real world experimentation. Numerous previous studies have examined a variety of algorithms for learning gaits, using an assortment of different robots. It is often difficult to compare the algorithmic results from one study to the next, because the conditions and robots used vary. With this in mind, we have used an open-source, 3D printed quadruped robot called QuadraTot, so the results may be verified, and hopefully improved upon, by any group so desiring. Because many robots do not have accurate simulators, we test gait-learning algorithms entirely on the physical robot. Previous studies using the QuadraTot have compared parameterized splines, the HyperNEAT generative encoding and genetic algorithm. Among these, the research on the genetic algorithm was conducted by (G l e t t e et al., 2012) in a simulator and tested on a real robot. Here we compare these results to an algorithm called Policy learning by Weighting Exploration with the Returns, or RL PoWER. We report that this algorithm has learned the fastest gait through only physical experiments yet reported in the literature, 16.3% faster than reported for HyperNEAT. In addition, the learned gaits are less taxing on the robot and more repeatable than previous record-breaking gaits.
AU - Shen,H
AU - Yosinski,J
AU - Kormushev,P
AU - Caldwell,DG
AU - Lipson,H
DO - 10.2478/cait-2012-0022
EP - 75
PY - 2012///
SN - 1311-9702
SP - 66
TI - Learning fast quadruped robot gaits with the RL PoWER spline parameterization
T2 - Cybernetics and Information Technologies
UR -
UR -
VL - 12
ER -