Reinforcement Learning
Reinforcement Learning tasks are learning problems where the desired behavior is not known; only sparse feedback on how well the agent is doing is provided. Reinforcement Learning techniques include value-function and policy iteration methods (note that although evolutionary computation and neuroevolution can also be seen as reinforcement learning methods, they are presented separately in this area hierarchy.)
Subareas:
     [Expand to show all 20][Minimize]
Adrian Agogino Formerly affiliated Collaborator adrian k agogino [at] nasa gov
Samuel Barrett Ph.D. Student sbarrett [at] cs utexas edu
Julian Bishop Ph.D. Student julian [at] cs utexas edu
Craig Corcoran Ph.D. Student ccor [at] cs utexas edu
Todd Hester Postdoctoral Alumni todd [at] cs utexas edu
Leif Johnson Ph.D. Student leif [at] cs utexas edu
Igor V. Karpov Ph.D. Student ikarpov [at] gmail com
W. Bradley Knox Ph.D. Alumni bradknox [at] mit edu
Nate Kohl Ph.D. Alumni nate [at] natekohl net
Shailesh Kumar Masters Alumni
Elad Liebman Ph.D. Student eladlieb [at] cs utexas edu
Risto Miikkulainen Faculty risto [at] cs utexas edu
Jefferson Provost Ph.D. Alumni jefferson provost [at] gmail com
Jefferson Provost Ph.D. Alumni jefferson provost [at] gmail com
Joseph Reisinger Ph.D. Alumni joeraii [at] cs utexas edu
Joseph Reisinger Formerly affiliated Ph.D. Student joeraii [at] cs utexas edu
Jacob Schrum Ph.D. Alumni schrum2 [at] southwestern edu
Peter Stone Faculty pstone [at] cs utexas edu
Nathaniel Tucker Undergraduate Alumni
Shimon Whiteson Formerly affiliated Collaborator s a whiteson [at] uva nl
     [Expand to show all 178][Minimize]
Evolving Multimodal Behavior Through Modular Multiobjective Neuroevolution 2014
Jacob Schrum, PhD Thesis, The University of Texas at Austin. Tech Report TR-14-07.
Evolving Multimodal Behavior With Modular Neural Networks in Ms. Pac-Man 2014
Jacob Schrum and Risto Miikkulainen, In Proceedings of the Genetic and Evolutionary Computation Conference (GECCO 2014), pp. 325--332, Vancouver, BC, Canada, July 2014. Best Paper: Digital Entertainment and Arts.
TacTex'13: A Champion Adaptive Power Trading Agent 2014
Daniel Urieli and Peter Stone, In Proceedings of the Twenty-Eighth Conference on Artificial Intelligence (AAAI 2014), July 2014.
A Learning Agent for Heat-Pump Thermostat Control 2013
Daniel Urieli and Peter Stone, In Proceedings of the 12th International Conference on Autonomous Agents and Multiagent Systems (AAMAS'13), May 2013.
Cooperating with a Markovian Ad Hoc Teammate 2013
Doran Chakraborty and Peter Stone, In Proceedings of the 12th International Conference on Autonomous Agents and Multiagent Systems (AAMAS), May 2013.
Learning Exploration Strategies in Model-Based Reinforcement Learning 2013
Todd Hester, Manuel Lopes, and Peter Stone, In The Twelfth International Conference on Autonomous Agents and Multiagent Systems (AAMAS), May 2013.
Learning Non-Myopically from Human-Generated Reward 2013
W. Bradley Knox and Peter Stone, In In Proceedings of the International Conference on Intelligent User Interfaces (IUI), March 2013.
Model-Selection for Non-Parametric Function Approximation in Continuous Control Problems: A Case Study in a Smart Energy System 2013
Daniel Urieli and Peter Stone, In Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases (ECML-PKDD'13), September 2013.
Multiagent Learning in the Presence of Memory-Bounded Agents 2013
Doran Chakraborty and Peter Stone, Autonomous Agents and Multiagent Systems (JAAMAS) (2013). Springer.
Targeted Opponent Modeling of Memory-Bounded Agents 2013
Doran Chakraborty and Peter Stone, In Proceedings of the Adaptive Learning Agents Workshop (ALA), May 2013.
The Open-Source TEXPLORE Code Release for Reinforcement Learning on Robots 2013
Todd Hester and Peter Stone, In RoboCup-2013: Robot Soccer World Cup {XVII}, Sven Behnke and Arnoud Visser and Rong Xiong and Manuela Veloso (Eds.) 2013. Springer Verlag.
Training a Robot via Human Feedback: A Case Study 2013
W. Bradley Knox, Peter Stone, and Cynthia Breazeal, In Social Robotics, October 2013.
Accelerating Evolution via Egalitarian Social Learning 2012
Wesley Tansey, Eliana Feasley, and Risto Miikkulainen, Proceedings of the 14th Annual Genetic and Evolutionary Computation Conference (GECCO 2012) (2012).
Evolution of a Communication Code in Cooperative Tasks 2012
Aditya Rawal, Padmini Rajagopalan, Risto Miikkulainen and Kay Holekamp, In Artificial Life (13th International Conference on the Synthesis and Simulation of Living Systems), East Lansing, Michigan, USA 2012.
How Humans Teach Agents: A New Experimental Perspective 2012
W. Bradley Knox, Brian D. Glass, Bradley C. Love, W. Todd Maddox, and Peter Stone, International Journal of Social Robotics, Vol. 4 (2012), pp. 409-421. Springer Netherlands.
Intrinsically Motivated Model Learning for a Developing Curious Agent 2012
Todd Hester and Peter Stone, In Eleventh International Conference on Autonomous Agents and Multiagent Systems - Adaptive Learning Agents Workshop (AAMAS - ALA), June 2012.
Intrinsically Motivated Model Learning for a Developing Curious Agent 2012
Todd Hester and Peter Stone, In The Eleventh International Conference on Development and Learning (ICDL), Nov 2012.
Learning from feedback on actions past and intended 2012
W. Bradley Knox, Cynthia Breazeal, and Peter Stone, In In Proceedings of 7th ACM/IEEE International Conference on Human-Robot Interaction, Late-Breaking Reports Session (HRI 2012), March 2012.
Learning from Human-Generated Reward 2012
W. Bradley Knox,
Reinforcement Learning from Human Reward: Discounting in Episodic Tasks 2012
W. Bradley Knox and Peter Stone, In In Proceedings of the 21st IEEE International Symposium on Robot and Human Interactive Communication (Ro-Man), September 2012.
Reinforcement Learning with Human and MDP Reward 2012
W. Bradley Knox and Peter Stone, In Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2012), June 2012.
TEXPLORE: Temporal Difference Reinforcement Learning for Robots and Time-Constrained Domains. 2012
Todd Hester, PhD Thesis, The University of Texas at Austin. Code available at: http://www.ros.org/wiki/rl-texplore-ros-pkg.
RTMBA: A Real-Time Model-Based Reinforcement Learning Architecture for Robot Control 2012
Todd Hester, Michael Quinlan, and Peter Stone, In {IEEE} International Conference on Robotics and Automation (ICRA), May 2012.
TEXPLORE: Real-Time Sample-Efficient Reinforcement Learning for Robots 2012
Todd Hester and Peter Stone, Machine Learning (2012).
A modular reinforcement learning model for human visuomotor behavior in a driving task 2011
Brian Sullivan, Leif Johnson, Dana Ballard and Mary Hayhoe, Proceedings of the AISB 2011 Symposium on Architectures for Active Vision. (2011), pp. 33-40.
A Real-Time Model-Based Reinforcement Learning Architecture for Robot Control 2011
Todd Hester, Michael Quinlan, and Peter Stone,
An Introduction to Inter-task Transfer for Reinforcement Learning 2011
Matthew E. Taylor and Peter Stone, AI Magazine, Vol. 32, 1 (2011), pp. 15--34.
Characterizing Reinforcement Learning Methods through Parameterized Learning Problems 2011
Shivaram Kalyanakrishnan and Peter Stone, Machine Learning (2011).
Evolving Multimodal Networks for Multitask Games 2011
Jacob Schrum and Risto Miikkulainen, In Proceedings of the IEEE Conference on Computational Intelligence and Games (CIG 2011), pp. 102--109, Seoul, South Korea, September 2011. IEEE. (Best Paper Award).
Human-Assisted Neuroevolution Through Shaping, Advice and Examples 2011
Igor V. Karpov, Vinod K. Valsalam and Risto Miikkulainen, In Proceedings of the 13th Annual Genetic and Evolutionary Computation Conference (GECCO 2011), Dublin, Ireland, July 2011.
Learning and Using Models 2011
Todd Hester and Peter Stone, In Reinforcement Learning: State of the Art 2011.
On Learning with Imperfect Representations 2011
Shivaram Kalyanakrishnan and Peter Stone, In Proceedings of the 2011 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning, April 2011.
Protecting Against Evaluation Overfitting in Empirical Reinforcement Learning 2011
Shimon Whiteson, Brian Tanner, Matthew E. Taylor, and Peter Stone, In {IEEE} Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL), April 2011.
Structure Learning in Ergodic Factored MDPs without Knowledge of the Transition Function's In-Degree 2011
Doran Chakraborty and Peter Stone, In Proceedings of the Twenty Eighth International Conference on Machine Learning (ICML'11), June 2011.
Understanding Human Teaching Modalities in Reinforcement Learning Environments: A Preliminary Report 2011
W. Bradley Knox and Peter Stone, In IJCAI 2011 Workshop on Agents Learning Interactively from Human Teachers (ALIHT), July 2011.
UT^2: Human-like Behavior via Neuroevolution of Combat Behavior and Replay of Human Traces 2011
Jacob Schrum, Igor V. Karpov and Risto Miikkulainen, In Proceedings of the IEEE Conference on Computational Intelligence and Games (CIG 2011), pp. 329--336, Seoul, South Korea, September 2011. IEEE.
Combining Manual Feedback with Subsequent MDP Reward Signals for Reinforcement Learning 2010
W. Bradley Knox and Peter Stone, In Proc. of 9th Int. Conf. on Autonomous Agents and Multiagent Systems (AAMAS 2010), May 2010.
Convergence, Targeted Optimality and Safety in Multiagent Learning 2010
Doran Chakraborty and Peter Stone, In Proceedings of the Twenty-seventh International Conference on Machine Learning (ICML 2010), June 2010.
Evolving Agent Behavior In Multiobjective Domains Using Fitness-Based Shaping 2010
Jacob Schrum and Risto Miikkulainen, In Proceedings of the Genetic and Evolutionary Computation Conference (GECCO 2010), pp. 439--446, Portland, Oregon, July 2010.
Gaussian processes for sample efficient reinforcement learning with RMAX-like exploration 2010
Tobias Jung and Peter Stone, In Proceedings of the European Conference on Machine Learning, September 2010.
Neuroevolution 2010
Risto Miikkulainen, In Encyclopedia of Machine Learning, New York 2010. Springer.
Online Model Learning in Adversarial Markov Decision Processes (Extended Abstract) 2010
Doran Chakraborty and Peter Stone, In Proceedings of the Ninth International Conference on Autonomous Agents and Multiagent Systems (AAMAS), pp. 1583–-1584, May 2010.
Real Time Targeted Exploration in Large Domains 2010
Todd Hester and Peter Stone, In Proceedings of the Ninth International Conference on Development and Learning (ICDL 2010), 2010 (Eds.), August 2010.
Structured Exploration for Reinforcement Learning 2010
Nicholas Kenneth Jong,
Transfer Learning for Reinforcement Learning on a Physical Robot 2010
Samuel Barrett, Matthew E. Taylor, and Peter Stone, In Ninth International Conference on Autonomous Agents and Multiagent Systems - Adaptive Learning Agents Workshop (AAMAS - ALA), May 2010.
An Empirical Analysis of Value Function-Based and Policy Search Reinforcement Learning 2009
Shivaram Kalyanakrishnan and Peter Stone, In The Eighth International Conference on Autonomous Agents and Multiagent Systems (AAMAS), pp. 749-756, Richland, SC, May 2009. International Foundation for Autonomous Agents and Multiagent Sy...
An Empirical Comparison of Abstraction in Models of Markov Decision Processes 2009
Todd Hester and Peter Stone, In Proceedings of the ICML/UAI/COLT Workshop on Abstraction in Reinforcement Learning, June 2009.
Critical Factors in the Empirical Performance of Temporal Difference and Evolutionary Methods for Reinforcement Learning 2009
Shimon Whiteson, Matthew E. Taylor, and Peter Stone, Journal of Autonomous Agents and Multi-Agent Systems, Vol. 21, 1 (2009), pp. 1-27.
Design Principles for Creating Human-Shapable Agents 2009
W. Bradley Knox, Ian Fasel, and Peter Stone, In AAAI Spring 2009 Symposium on Agents that Learn from Human Teachers, March 2009.
Evolving Adaptive Intelligence: Using NeuroEvolution with Temporal Difference Methods in the Game Domain 2009
Nathaniel Tucker, Technical Report HR-09-04, Department of Computer Science, The University of Texas at Austin..
Feature Selection for Value Function Approximation Using Bayesian Model Selection 2009
Tobias Jung and Peter Stone, In The European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, September 2009.
Generalized Domains for Empirical Evaluations in Reinforcement Learning 2009
Shimon Whiteson, Brian Tanner, Matthew E. Taylor, and Peter Stone, In ICML Workshop on Evaluation Methods for Machine Learning, June 2009. To appear..
Generalized Model Learning for Reinforcement Learning in Factored Domains 2009
Todd Hester and Peter Stone, In The Eighth International Conference on Autonomous Agents and Multiagent Systems (AAMAS), May 2009.
Interactively Shaping Agents via Human Reinforcement: The TAMER Framework 2009
W. Bradley Knox and Peter Stone, In The Fifth International Conference on Knowledge Capture, September 2009.
Learning Complementary Multiagent Behaviors: A Case Study 2009
Shivaram Kalyanakrishnan and Peter Stone, In Proceedings of the RoboCup International Symposium 2009 2009. Springer Verlag.
Learning in Fractured Problems for Constructive Neural Network Algorithms 2009
Nate Kohl, PhD Thesis, Department of Computer Sciences, University of Texas at Austin.
Transfer Learning for Reinforcement Learning Domains: A Survey 2009
Matthew E. Taylor and Peter Stone, Journal of Machine Learning Research, Vol. 10, 1 (2009), pp. 1633-1685.
A General Purpose Task Specification Language for Bootstrap Learning 2008
Ian Fasel, Michael Quinlan, and Peter Stone, In AAAI Spring 2009 Symposium on Agents that Learn from Human Teachers, March 2008.
Autonomous Transfer for Reinforcement Learning 2008
Matthew E. Taylor, Gregory Kuhlmann, and Peter Stone, In The Seventh International Joint Conference on Autonomous Agents and Multiagent Systems, May 2008.
Competition Between Reinforcement Learning Methods in a Predator-Prey Grid World 2008
Jacob Schrum, Technical Report AI08-9, The University of Texas at Austin, Department of Computer Sciences.
From pixels to policies: a bootstrapping agent 2008
Jeremy Stober and Benjamin Kuipers, In Proceedings of the IEEE International Conference on Development and Learning 2008.
Instance-Based Action Models for Fast Action Planning 2008
Mazda Ahmadi and Peter Stone, In RoboCup-2007: Robot Soccer World Cup XI, Ubbo Visser and Fernando Ribeiro and Takeshi Ohashi and Frank Dellaert (Eds.), Vol. 5001, pp. 1-16, Berlin 2008. Springer Verlag.
Model-based Reinforcement Learning in a Complex Domain 2008
Shivaram Kalyanakrishnan, Peter Stone, and Yaxin Liu, In RoboCup-2007: Robot Soccer World Cup XI, Ubbo Visser and Fernando Ribeiro and Takeshi Ohashi and Frank Dellaert (Eds.), Vol. 5001, pp. 171-83, Berlin 2008. Springer Verlag.
Online Kernel Selection for Bayesian Reinforcement Learning 2008
Joseph Reisinger, Peter Stone, and Risto Miikkulainen, In Proceedings of the Twenty-Fifth International Conference on Machine Learning, July 2008.
Transferring Instances for Model-Based Reinforcement Learning 2008
Matthew E. Taylor, Nicholas K. Jong, and Peter Stone, In Machine Learning and Knowledge Discovery in Databases, Vol. 5212, pp. 488-505, September 2008.
Acquiring Evolvability through Adaptive Representations 2007
Joseph Reisinger and Risto Miikkulainen, In Proceeedings of the Genetic and Evolutionary Computation Conference, pp. 1045-1052 2007.
Adaptive Tile Coding for Value Function Approximation 2007
Shimon Whiteson, Matthew E. Taylor, and Peter Stone, Technical Report AI-TR-07-339, University of Texas at Austin.
Autonomous Learning of Stable Quadruped Locomotion 2007
Manish Saggar, Thomas D'Silva, Nate Kohl, and Peter Stone, In RoboCup-2006: Robot Soccer World Cup X, Gerhard Lakemeyer and Elizabeth Sklar and Domenico Sorenti and Tomoichi Takahashi (Eds.), Vol. 4434, pp. 98-109, Berlin 2007. Springer Verlag.
Batch Reinforcement Learning in a Complex Domain 2007
Shivaram Kalyanakrishnan and Peter Stone, In The Sixth International Joint Conference on Autonomous Agents and Multiagent Systems, pp. 650-657, New York, NY, USA, May 2007. ACM.
Coevolution of Role-Based Cooperation in Multi-Agent Systems 2007
Chern Han Yong and Risto Miikkulainen, Technical Report AI07-338, Department of Computer Sciences, The University of Texas at Austin.
Cross-Domain Transfer for Reinforcement Learning 2007
Matthew E. Taylor and Peter Stone, In Proceedings of the Twenty-Fourth International Conference on Machine Learning, June 2007.
Empirical Studies in Action Selection for Reinforcement Learning 2007
Shimon Whiteson, Matthew E. Taylor, and Peter Stone, Adaptive Behavior, Vol. 15, 1 (2007), pp. 33-50.
Graph-Based Domain Mapping for Transfer Learning in General Games 2007
Gregory Kuhlmann and Peter Stone, In Proceedings of the 18th European Conference on Machine Learning, September 2007.
Half Field Offense in RoboCup Soccer: A Multiagent Reinforcement Learning Case Study 2007
Shivaram Kalyanakrishnan, Yaxin Liu, and Peter Stone, In RoboCup-2006: Robot Soccer World Cup X, Gerhard Lakemeyer and Elizabeth Sklar and Domenico Sorenti and Tomoichi Takahashi (Eds.), Vol. 4434, pp. 72-85, Berlin 2007. Springer Verlag.
IFSA: Incremental Feature-Set Augmentation for Reinforcement Learning Tasks 2007
Mazda Ahmadi, Matthew E. Taylor, and Peter Stone, In The Sixth International Joint Conference on Autonomous Agents and Multiagent Systems, May 2007.
Model-Based Exploration in Continuous State Spaces 2007
Nicholas K. Jong and Peter Stone, In The Seventh Symposium on Abstraction, Reformulation, and Approximation, July 2007.
Model-Based Function Approximation for Reinforcement Learning 2007
Nicholas K. Jong and Peter Stone, In The Sixth International Joint Conference on Autonomous Agents and Multiagent Systems, May 2007.
Reinforcement Learning in High-Diameter, Continuous Environments 2007
Jefferson Provost, PhD Thesis, Computer Sciences Department, University of Texas at Austin.
Representation Transfer for Reinforcement Learning 2007
Matthew E. Taylor and Peter Stone, In AAAI 2007 Fall Symposium on Computational Approaches to Representation Change during Learning and Development, November 2007.
Self-Organizing Distinctive State Abstraction Using Options 2007
Jefferson Provost, Benjamin J. Kuipers, and Risto Miikkulainen, In Proceedings of the 7th International Conference on Epigenetic Robotics 2007.
Temporal Difference and Policy Search Methods for Reinforcement Learning: An Empirical Comparison 2007
Matthew E. Taylor, Shimon Whiteson, and Peter Stone, In Proceedings of the Twenty-Second Conference on Artificial Intelligence, pp. 1675-1678, July 2007. Nectar Track.
The Chin Pinch: A Case Study in Skill Learning on a Legged Robot 2007
Peggy Fidelman and Peter Stone, In RoboCup-2006: Robot Soccer World Cup X, Gerhard Lakemeyer and Elizabeth Sklar and Domenico Sorenti and Tomoichi Takahashi (Eds.), Vol. 4434, pp. 59-71, Berlin 2007. Springer Verlag.
Transfer Learning via Inter-Task Mappings for Temporal Difference Learning 2007
Matthew E. Taylor, Peter Stone, and Yaxin Liu, Journal of Machine Learning Research, Vol. 8, 1 (2007), pp. 2125-2167.
Transfer via Inter-Task Mappings in Policy Search Reinforcement Learning 2007
Matthew E. Taylor, Shimon Whiteson, and Peter Stone, In Proceedings of the 6th International Joint Conference on Autonomous Agents and Multiagent Systems, May 2007.
Comparing Evolutionary and Temporal Difference Methods for Reinforcement Learning 2006
Matthew Taylor, Shimon Whiteson, and Peter Stone, In Proceedings of the Genetic and Evolutionary Computation Conference, pp. 1321-28, July 2006.
Developing navigation behavior through self-organizing distinctive state abstraction 2006
Jefferson Provost, Benjamin J. Kuipers, and Risto Miikkulainen, Connection Science, Vol. 18 (2006), pp. 159-172.
Evolutionary Function Approximation for Reinforcement Learning 2006
Shimon Whiteson and Peter Stone, Journal of Machine Learning Research, Vol. 7 (2006), pp. 877-917.
Integration and Evaluation of Exploration-Based Learning in Games 2006
Igor V. Karpov, Thomas D'Silva, Craig Varrichio, Kenneth O. Stanley, Risto Miikkulainen, In Proceedings of the {IEEE} Symposium on Computational Intelligence and Games, Reno, NV 2006. IEEE.
Keepaway Soccer: From Machine Learning Testbed to Benchmark 2006
Peter Stone, Gregory Kuhlmann, Matthew E. Taylor, and Yaxin Liu, In RoboCup-2005: Robot Soccer World Cup IX, Itsuki Noda and Adam Jacoff and Ansgar Bredenfeld and Yasutake Takahashi (Eds.), Vol. 4020, pp. 93-105, Berlin 2006. Springer Verlag.
Using Active Relocation to Aid Reinforcement Learning 2006
Lilyana Mihalkova and Raymond Mooney, In Prodeedings of the 19th International FLAIRS Conference (FLAIRS-2006), pp. 580-585, Melbourne Beach, FL, May 2006.
Value Function Transfer for General Game Playing 2006
Bikramjit Banerjee, Gregory Kuhlmann, and Peter Stone, In ICML workshop on Structural Knowledge Transfer for Machine Learning, June 2006.
Value-Function-Based Transfer for Reinforcement Learning Using Structure Mapping 2006
Yaxin Liu and Peter Stone, In Proceedings of the Twenty-First National Conference on Artificial Intelligence, pp. 415-20, July 2006.
Academic AI and Video Games: A Case Study of Incorporating Innovative Academic Research into a Video Game Prototype 2005
Aliza Gold, In Proceedings of the IEEE 2005 Symposium on Computational Intelligence and Games (CIG'05) 2005. Piscataway, NJ: IEEE.
Behavior Transfer for Value-Function-Based Reinforcement Learning 2005
Matthew E. Taylor and Peter Stone, In The Fourth International Joint Conference on Autonomous Agents and Multiagent Systems, Frank Dignum and Virginia Dignum and Sven Koenig and Sarit Kraus and Munindar P. Singh and Michael Woo...
Constructing Good Learners Using Evolved Pattern Generators 2005
Vinod K. Valsalam, James A. Bednar, and Risto Miikkulainen, In Proceedings of the Genetic and Evolutionary Computation Conference, GECCO-2005, H.-G. Beyer and others (Eds.), pp. 11-18 2005.
Effective Image Compression Using Evolved Wavelets 2005
Uli Grasemann and Risto Miikkulainen, In Proceedings of the Genetic and Evolutionary Computation Conference 2005.
Evolving Neural Network Agents in the NERO Video Game 2005
Kenneth O. Stanley, Bobby D. Bryant, and Risto Miikkulainen, In Proceedings of the IEEE 2005 Symposium on Computational Intelligence and Games (CIG'05), Piscataway, NJ 2005. IEEE.
Evolving Neural Network Ensembles for Control Problems 2005
David Pardoe, Michael Ryoo, and Risto Miikkulainen, In Proceedings of the Genetic and Evolutionary Computation Conference 2005.
Function Approximation via Tile Coding: Automating Parameter Choice 2005
Alexander A. Sherstov and Peter Stone, In SARA 2005, J.-D. Zucker and I. Saitta (Eds.), Vol. 3607, pp. 194-205, Berlin 2005. Springer Verlag.
Incorporating Advice into Evolution of Neural Networks 2005
Chern Han Yong, Kenneth O. Stanley, and Risto Miikkulainen, In Proceedings of the Genetic and Evolutionary Computation Conference (GECCO 2005) 2005. Late Breaking Papers.
Learning Basic Navigation for Personal Satellite Assistant Using Neuroevolution 2005
Yiu Fai Sit and Risto Miikkulainen, In Proceedings of the Genetic and Evolutionary Computation Conference 2005.
Neuroevolution of an Automobile Crash Warning System 2005
Kenneth Stanley, Nate Kohl, Rini Sherony, and Risto Miikkulainen, In Proceedings of the Genetic and Evolutionary Computation Conference 2005.
Real-Time Learning in the NERO Video Game 2005
Kenneth O. Stanley, Ryan Cornelius, Risto Miikkulainen, Thomas D'Silva, and Aliza Gold, In Proceedings of the Artificial Intelligence and Interactive Digital Entertainment Conference (AIIDE 2005) Demo Papers 2005.
Real-time Neuroevolution in the NERO Video Game 2005
Kenneth O. Stanley, Bobby D. Bryant, and Risto Miikkulainen, IEEE Transactions on Evolutionary Computation (2005), pp. 653-668. IEEE.
Reinforcement Learning for RoboCup-Soccer Keepaway 2005
Peter Stone, Richard S. Sutton, and Gregory Kuhlmann, Adaptive Behavior, Vol. 13, 3 (2005), pp. 165-188.
Retaining Learned Behavior During Real-Time Neuroevolution 2005
Thomas D'Silva, Roy Janik, Michael Chrien, Kenneth O. Stanley and Risto Miikkulainen, Artificial Intelligence and Interactive Digital Entertainment (2005). American Association for Artificial Intelligence.
Towards an Empirical Measure of Evolvability 2005
Joseph Reisinger, Kenneth O. Stanley, Risto Miikkulainen, In Genetic and Evolutionary Computation Conference {(GECCO2005)} Workshop Program, pp. 257-264, Washington, D.C. 2005. ACM Press.
Value Functions for RL-Based Behavior Transfer: A Comparative Study 2005
Matthew E. Taylor, Peter Stone, and Yaxin Liu, In Proceedings of the Twentieth National Conference on Artificial Intelligence, July 2005.
Self-Organizing Perceptual and Temporal Abstraction for Robot Reinforcement Learning 2004
Jefferson Provost, Benjamin J. Kuipers and Risto Miikkulainen, In AAAI-04 Workshop on Learning and Planning in Markov Processes 2004.
Adaptive Job Routing and Scheduling 2004
Shimon Whiteson and Peter Stone, Engineering Applications of Artificial Intelligence, Vol. 17(7), 7 (2004), pp. 855-869. Corrected version.
Competitive Coevolution through Evolutionary Complexification 2004
Kenneth O. Stanley and Risto Miikkulainen, Journal of Artificial Intelligence Research, Vol. 21 (2004), pp. 63-100.
Efficient Allele Fitness Assignment with Self-organizing Multi-agent System 2004
Adrian Agogino and Risto Miikkulainen, In Proceedings of the Genetic and Evolutionary Computation Conference (GECCO-2004) Workshop Program, New York, NY 2004. Springer-Verlag.
Efficient Evolution of Neural Networks Through Complexification 2004
Kenneth O. Stanley, PhD Thesis, Department of Computer Sciences, The University of Texas at Austin.
Evolving a Roving Eye for Go 2004
Kenneth O. Stanley and Risto Miikkulainen, In Proceedings of the Genetic and Evolutionary Computation Conference (GECCO-2004), Berlin 2004. Springer Verlag.
Evolving Reusable Neural Modules 2004
Joseph Reisinger, Kenneth O. Stanley, and Risto Miikkulainen, In Proceedings of the Genetic and Evolutionary Computation Conference 2004.
Evolving Wavelets using a Coevolutionary Genetic Algorithm and Lifting 2004
Uli Grasemann and Risto Miikkulainen, In Proceedings of the Genetic and Evolutionary Computation Conference, pp. 969-980, San Francisco 2004. Kaufmann.
Exploiting Morphological Conventions for Genetic Reuse 2004
Kenneth O. Stanley, Joseph Reisinger, and Risto Miikkulainen, In Proceedings of the Genetic and Evolutionary Computation Conference ({GECCO}-2004) Workshop Program, Berlin 2004. Springer Verlag.
Guiding a Reinforcement Learner with Natural Language Advice: Initial Results in RoboCup Soccer 2004
Gregory Kuhlmann, Peter Stone, Raymond J. Mooney, and Jude W. Shavlik, In The AAAI-2004 Workshop on Supervisory Control of Learning and Adaptive Systems, July 2004.
Machine Learning for Fast Quadrupedal Locomotion 2004
Nate Kohl and Peter Stone, In Nineteenth National Conference on Artificial Intelligence, pp. 611-616, July 2004.
Policy Gradient Reinforcement Learning for Fast Quadrupedal Locomotion 2004
Nate Kohl and Peter Stone, In Proceedings of the {IEEE} International Conference on Robotics and Automation, pp. 2619-2624, May 2004.
The Constructivist Learning Architecture: A Model of Cognitive Development for Robust Autonomous Robots 2004
Harold H. Chaput, PhD Thesis, Department of Computer Sciences, The University of Texas at Austin. Also Technical Report TR-04-34.
Towards Learning to Ignore Irrelevant State Variables 2004
Nicholas K. Jong and Peter Stone, In The AAAI-2004 Workshop on Learning and Planning in Markov Processes -- Advances and Challenges 2004.
Transfer of Neuroevolved Controllers in Unstable Domains 2004
Faustino J. Gomez and Risto Miikkulainen, In Proceedings of the Genetic and Evolutionary Computation Conference, Berlin 2004. Springer.
A Taxonomy for Artificial Embryogeny 2003
Kenneth O. Stanley and Risto Miikkulainen, Artificial Life, Vol. 9, 2 (2003), pp. 93-130.
Achieving High-Level Functionality through Evolutionary Complexification 2003
Kenneth O. Stanley and Risto Miikkulainen, In Proceedings of the AAAI-2003 Spring Symposium on Computational Synthesis, Stanford, CA 2003. AAAI Press.
Active Guidance for a Finless Rocket Using Neuroevolution 2003
Faustino J. Gomez and Risto Miikkulainen, In Proceedings of the Genetic and Evolutionary Computation Conference, pp. 2084-2095, San Francisco 2003. Morgan Kaufmann.
Evolving Adaptive Neural Networks with and Without Adaptive Synapses 2003
Kenneth O. Stanley, Bobby D. Bryant, and Risto Miikkulainen, In Proceedings of the 2003 Congress on Evolutionary Computation, Piscataway, NJ 2003. IEEE.
Neuroevolution for Adaptive Teams 2003
Bobby D. Bryant and Risto Miikkulainen, In Proceedings of the 2003 Congress on Evolutionary Computation (CEC 2003), pp. 2194-2201, Piscataway, NJ 2003. IEEE.
Robust Non-Linear Control through Neuroevolution 2003
Faustino J. Gomez, PhD Thesis, Department of Computer Sciences, The University of Texas at Austin.
Utilizing Domain Knowledge in Neuroevolution 2003
James Fan, Raymond Lau, and Risto Miikkulainen, Proceedings of the Twentieth International Conference on Machine Learning (ICML-03, Washington, DC)
Adaptive Control Utilising Neural Swarming 2002
Alex v. E. Conradie, Risto Miikkulainen, and Christiaan Aldrich, In Proceedings of the Genetic and Evolutionary Computation Conference, William B. Langdon and Erick Cantu-Paz and Keith E. Mathias and Rajkumar Roy and David Davis and Riccardo Poli and Karth...
Continual Coevolution Through Complexification 2002
Kenneth O. Stanley and Risto Miikkulainen, In Proceedings of the Genetic and Evolutionary Computation Conference (GECCO-2002), William B. Langdon and Erick Cantu-Paz and Keith E. Mathias and Rajkumar Roy and David Davis and Riccardo Pol...
Cultural Enhancement Of Neuroevolution 2002
Paul H. McQuesten, PhD Thesis, Department of Computer Sciences, The University of Texas at Austin. Technical Report AI-02-295.
Efficient Evolution Of Neural Network Topologies 2002
Kenneth O. Stanley and Risto Miikkulainen, In Proceedings of the Genetic and Evolutionary Computation Conference, William B. Langdon and Erick Cantu-Paz and Keith E. Mathias and Rajkumar Roy and David Davis and Riccardo Poli and Karthik...
Efficient Reinforcement Learning Through Evolving Neural Network Topologies 2002
Kenneth O. Stanley and Risto Miikkulainen, In Proceedings of the Genetic and Evolutionary Computation Conference (GECCO-2002), pp. 9, San Francisco 2002. Morgan Kaufmann.
Eugenic Evolution Utilizing A Domain Model 2002
Matthew Alden, Aard-Jan van Kesteren, and Risto Miikkulainen, In Proceedings of the Genetic and Evolutionary Computation Conference (GECCO 2002), pp. 279-286 2002.
Evolving Neural Networks Through Augmenting Topologies 2002
Kenneth O. Stanley and Risto Miikkulainen, Evolutionary Computation, Vol. 10, 2 (2002), pp. 99-127.
Intelligent Process Control Utilizing Symbiotic Memetic Neuro-Evolution 2002
Alex v. E. Conradie, Risto Miikkulainen, and Christiaan Aldrich, In Proceedings of the 2002 Congress on Evolutionary Computation, pp. 6 2002.
Numerical Optimization With Neuroevolution 2002
Brian Greer, Henri Hakonen, Risto Lahdelma, and Risto Miikkulainen, In Proceedings of the 2002 Congress on Evolutionary Computation, pp. 361-401, Piscataway, NJ 2002. IEEE. Undergraduate Thesis, Department of Computer Sciences, The University of Texas at Aust...
The Dominance Tournament Method of Monitoring Progress in Coevolution 2002
Kenneth O. Stanley and Risto Miikkulainen, In Proceedings of the Genetic and Evolutionary Computation Conference ({GECCO}-2002) Workshop Program, pp. 7, San Francisco 2002. Morgan Kaufmann.
A Neuroevolution Method For Dynamic Resource Allocation On A Chip Multiprocessor 2001
Faustino J. Gomez, Doug Burger, and Risto Miikkulainen, In Proceedings of the {INNS-IEEE} International Joint Conference on Neural Networks, pp. 2355-2361, Piscataway, NJ 2001. IEEE.
A Social Reinforcement Learning Agent 2001
Charles Lee Isbell, Christian R. Shelton, Michael Kearns, Satinder Singh, and Peter Stone, In Proceedings of the Fifth International Conference on Autonomous Agents, pp. 377--384 2001.
Abrupt And Gradual Sound Change In An Expanding Lexicon 2001
Melissa A. Redford and Risto Miikkulainen, Technical Report AI01-289, Department of Computer Sciences, The University of Texas at Austin.
Applying ESP And Region Specialists To Neuro-Evolution For Go 2001
Andres Santiago Perez-Bergquist, Technical Report TR-01-24, Department of Computer Science, University of Texas at Austin.
Co-Evolving A Go-Playing Neural Network 2001
Alex Lubberts and Risto Miikkulainen, In Coevolution: {T}urning Adaptive Algorithms Upon Themselves, Birds-of-a-Feather Workshop, Genetic and Evolutionary Computation Conference ({GECCO}-2001), pp. 6 2001.
Constrained Emergence Of Universals And Variation In Syllable Systems 2001
Melissa A. Redford, Chun Chi Chen, and Risto Miikkulainen, Language and Speech (2001), pp. 27-56. Manuscript.
Cooperative Coevolution Of Multi-Agent Systems 2001
Chern Han Yong and Risto Miikkulainen, Technical Report AI07-338, Department of Computer Sciences, The University of Texas at Austin.
Creating Melodies With Evolving Recurrent Neural Networks 2001
Chun-Chi J. Chen and Risto Miikkulainen, In Proceedings of the {INNS-IEEE} International Joint Conference on Neural Networks, pp. 2241-2246, Piscataway, NJ 2001. IEEE.
Evolving Populations Of Expert Neural Networks 2001
Joseph Bruce and Risto Miikkulainen, In Proceedings of the Genetic and Evolutionary Computation Conference, pp. 251-257, San Francisco, CA 2001. Morgan Kaufmann.
Eugenic Neuro-Evolution For Reinforcement Learning 2000
Daniel Polani and Risto Miikkulainen, In Proceedings of the Genetic and Evolutionary Computation Conference (GECCO-2000), pp. 1041-1046, San Francisco 2000. Morgan Kaufmann.
Neuro-Evolution And Natural Deduction 2000
Nirav S. Desai and Risto Miikkulainen, In Proceedings of The First {IEEE} Symposium on Combinations of Evolutionary Computation and Neural Networks, pp. 64-69, Piscataway, NJ 2000. IEEE.
Online Interactive Neuro-Evolution 2000
Adrian Agogino, Kenneth O. Stanley, and Risto Miikkulainen, Neural Processing Letters (2000), pp. 29-38.
TPOT-RL Applied to Network Routing 2000
Peter Stone, In Proceedings of the Seventeenth International Conference on Machine Learning, pp. 935-942 2000.
Confidence Based Dual Reinforcement Q-Routing: An Adaptive On-Line Routing Algorithm 1999
Shailesh Kumar and Risto Miikkulainen, In 16th International Joint Conference on Artificial Intelligence (IJCAI-99), pp. 758--763, Stockholm, Sweden 1999. San Francisco, CA: Kaufmann.
Solving Non-Markovian Control Tasks With Neuroevolution 1999
Faustino J. Gomez and Risto Miikkulainen, unpublished. Dissertation Proposal, Computer Science Department, University of Texas at Austin.
Team-Partitioned, Opaque-Transition Reinforcement Learning 1999
Peter Stone and Manuela Veloso, In RoboCup-98: Robot Soccer World Cup II, Minoru Asada and Hiroaki Kitano (Eds.), Vol. 1604, pp. 261-72, Berlin 1999. Springer Verlag. Also in Proceedings of the Third International Confe...
2-D Pole Balancing With Recurrent Evolutionary Networks 1998
Faustino Gomez and Risto Miikkulainen, In Proceedings of the International Conference on Artificial Neural Networks (ICANN-98), pp. 425-430, Skovde, Sweden 1998. Berlin, New York: Springer.
Confidence Based Dual Reinforcement Q-Routing: An On-Line Adaptive Network Routing Algorithm 1998
Shailesh Kumar, Masters Thesis, Department of Computer Sciences, the University of Texas at Austin.. 108. Technical Report AI-98-267.
Confidence Based Q-Routing: An On-Line Adaptive Network Routing Algorithm 1998
Shailesh Kumar and Risto Miikkulainen, Smart Engineering Systems: Neural Networks, Fuzzy Logic, Data Mining, and Evolutionary ProgrammingC. H. Dagli and M. Akay and O. Ersoy and B. R. Fernandez and A. Smith (Eds.), Vol. 8 (1998).
Eugenic Evolution For Combinatorial Optimization 1998
John W. Prior, Masters Thesis, Department of Computer Sciences, The University of Texas at Austin. 126. Technical Report AI98-268.
Evolving Neural Networks To Play Go 1998
Norman Richards, David Moriarty, and Risto Miikkulainen, Applied IntelligenceThomas B{"a}ck (Eds.) (1998), pp. 768-775. San Francisco, CA: Morgan Kaufmann.
Hierarchical Evolution Of Neural Networks 1998
David E. Moriarty and Risto Miikkulainen, In Proceedings of the 1998 IEEE Conference on Evolutionary Computation (ICEC98), pp. 428-433, Anchorage, AK 1998. Piscataway, NJ: IEEE.
Modeling The Emergence Of Syllable Systems 1998
Melissa A. Redford, Chun Chi Chen, and Risto Miikkulainen, In Proceedings of the 20th Annual Conference of the Cognitive Science Society, Morton Ann Gernsbacher and Sharon J. Derry (Eds.), pp. 882-886 1998. Hillsdale, NJ: Erlbaum.
Culling And Teaching In Neuro-Evolution 1997
Paul McQuesten and Risto Miikkulainen, In Proceedings of the Seventh International Conference on Genetic Algorithms (ICGA-97, East Lansing, MI), Thomas B{"a}ck (Eds.), pp. 760-767 1997. San Francisco, CA: Morgan Kaufmann.
Dual Reinforcement Q-Routing: An On-Line Adaptive Routing Algorithm 1997
Shailesh Kumar and Risto Miikkulainen, Smart Engineering Systems: Neural Networks, Fuzzy Logic, Data Mining, and Evolutionary ProgrammingC. H. Dagli, M. Akay, O. Ersoy, B. R. Fernandez and A. Smith (Eds.), Vol. 7 (1997).
Forming Neural Networks Through Efficient And Adaptive Coevolution 1997
David E. Moriarty and Risto Miikkulainen, Evolutionary Computation, Vol. 5 (1997), pp. 373--399.
Incremental Evolution Of Complex General Behavior 1997
Faustino Gomez and Risto Miikkulainen, Adaptive Behavior, 5 (1997), pp. 317-342.
Symbiotic Evolution Of Neural Networks In Sequential Decision Tasks 1997
David E. Moriarty, PhD Thesis, Department of Computer Sciences, The University of Texas at Austin. 117. Technical Report UT-AI97-257.
Efficient Reinforcement Learning Through Symbiotic Evolution 1996
David E. Moriarty and Risto Miikkulainen, Machine LearningLeslie Pack Kaelbling (Eds.), AI94-224 (1996), pp. 11-32.
Evolving Obstacle Avoidance Behavior In A Robot Arm 1996
David E. Moriarty and Risto Miikkulainen, In From Animals to Animats 4: Proceedings of the Fourth International Conference on Simulation of Adaptive Behavior, Pattie Maes and Maja J. Mataric and Jean-Arcady Meyer and Jordan Pollack an...
On-Line Adaptation Of A Signal Predistorter Through Dual Reinforcement Learning 1996
Patrick Goetz, Shailesh Kumar and Risto Miikkulainen, In Machine Learning: Proceedings of the 13th Annual Conference (Bari, Italy), Lorenza Saitta (Eds.), pp. 175-181 1996. San Francisco, CA: Morgan Kaufmann.
Discovering Complex Othello Strategies Through Evolutionary Neural Networks 1995
David E. Moriarty and Risto Miikkulainen, Connection Science, Vol. 7 (1995), pp. 195--209.
Learning Sequential Decision Tasks 1995
David E. Moriarty and Risto Miikkulainen, Technical Report AI95-229, Department of Computer Sciences, The University of Texas at Austin.
Evolutionary Neural Networks For Value Ordering In Constraint Satisfaction Problems 1994
David E. Moriarty and Risto Miikkulainen, Technical Report AI94-218, Department of Computer Sciences, The University of Texas at Austin.
Evolving Neural Networks To Focus Minimax Search 1994
David E. Moriarty and Risto Miikkulainen, In Proceedings of the Twelfth National Conference on Artificial Intelligence (AAAI-94), pp. 1371-1377, Seattle, WA 1994. Cambridge, MA: MIT Press.
Grounding Robotic Control With Genetic Neural Networks 1994
Diane Law and Risto Miikkulainen, Technical Report AI94-223, Department of Computer Sciences, The University of Texas at Austin.
Searle, Subsymbolic Functionalism And Synthetic Intelligence 1994
Diane Law, Technical Report, Department of Computer Sciences, The University of Texas at Austin. Technical Report AI94-222.
Using Marker-Based Genetic Encoding Of Neural Networks To Evolve Finite-State Behaviour 1991
Brad Fullmer and Risto Miikkulainen, In Toward a Practice of Autonomous Systems: {P}roceedings of the First {E}uropean Conference on Artificial Life, Francisco J. Varela and Paul Bourgine (Eds.), pp. 255-262, Cambridge, MA 1991. ...
MM-NEAT Modular Multiobjective NEAT is a software framework in Java that builds on the basic principles of 2014

BREVE Monsters BREVE is a system for designing Artificial Life simulations available at http://spiderlan... 2010

OpenNERO OpenNERO is a general research and education platform for artificial intelligence. The platform is based on a simulatio... 2010