Polebalancing
Released 1995
This simulator contains the code used to compare (neuron-level) SANE to one- and two-layer adaptive heuristic critics in the inverted pendulum problem. The code for the single-layer Adaptive Heuristic Critic was adapted from Rich Sutton's pole-balancing simulator. The code for the two-layer AHC was adapted from Charles Anderson's pole simulator. Codes for all three methods are included in this package, together with sample runs for each. The simulators are currently set up to balance a pole on a cart starting from random positions with random initial velocities. To always start from a centered pole and cart with no initial velocities, set RANDOM_START to 0. For more details, see the paper on pole-balancing comparisons.

Comments to moriarty@isi.edu.

Versions:


v1.0 2/21/95 moriarty   

Download:
TAR
David E. Moriarty Ph.D. Alumni moriarty [at] alumni utexas net
Neuroevolution 2015
Risto Miikkulainen, In Encyclopedia of Machine Learning, 2nd Edition, Sammut, C. and Webb, G. I. (Eds.), Berlin, 2015. Springer.
Neuroevolution 2010
Risto Miikkulainen, In Encyclopedia of Machine Learning, New York 2010. Springer.