Publications

Books

Learning Classifier Systems:10th International Workshop, IWLCS 2006, Seattle, MA, USA, July 2006 and 11th International Workshop, IWLCS 2007, London, UK, July 2007 Revised Selected Papers
Bacardit, J., Bernadó-Mansilla, E., Butz, M.V., Kovacs, T., Llorà, X., & Takadama, K. (Eds.)
LNAI 4998. Springer-Verlag, Berlin, Germany (2008)

The Challenge of Anticipation: A Unifying Framework for the Analysis and Design of Artificial Cognitive Systems
Pezzulo, G., Butz, M. V., Castelfranchi, C. & Falcone, R. (Eds.)
LNAI 5225, Spring Verlag, Berlin, Germany (2008)

Anticipatory Behavior in Adaptive Learning Systems: From Brains to Individual and Social Behavior
Butz, M. V., Sigaud, O., Pezzulo, G., & Baldassarre, G. (Eds.)
LNAI 4520, Spring Verlag, Berlin, Germany (2007)

Rule-Based Evolutionary Online Learning Systems: A Principled Approach to LCS Analysis and Design
Butz, M. V.
Studies in Fuzziness and Soft Computing Series, Springer Verlag, Berlin, Germany (2006)

GECCO 2006: Proceedings of the 8th annual conference on genetic and evolutionary computation.
Keijzer, M., Cattolico, M., Arnold, D., Babovic, V., Blum, C., Bosman, P., Butz, M.V., Coello Coello, C., Dasgupta, D., Ficici, S.G., Foster, J., Hernandez-Aguirre, A., Hornby, G., Lipson, H., McMinn, P., Moore, J., Raidl, G., Rothlauf, F., Ryan, C., & Thierens, D. (Eds.).
ACM Press, Seattle, WA (2006)

Anticipatory Behavior in Adaptive Learning Systems: Foundations, Theories, and Systems
Butz, M. V., Sigaud, O., & Gérard, P.
LNAI 2684, Spring Verlag, Berlin (2003)

Anticipatory learning classifier systems
Butz, M. V.
Kluwer Academic Publishers, Boston, MA (2002).

Journals

Anticipatory Planning of Sequential Hand and Finger Movements.
Herbort, O. & Butz, M. V. (submitted).
Journal of Motor Behavior.

Integrating Dynamics into a Human Behavior Model For Highly Flexible Autonomous Manipulator Control.
Pedersen, G., Butz, M. V., & Herbort, O. (submitted).
IEEE Robotics and Automation, Special Issue on Cognitive Robotics.

How and Why the Brain Lays the Foundations for a Conscious Self. Target Article with Commentaries and Response.
Butz, M. V. (2008).
Constructivist Foundations, 4, 1-42.

Intentions and mirror neurons: From the individual to overall sorical reality.
Butz, M.V. (2008).
Constructivist Foundations, 3, 87-89.
(Commentary)

Sensomotorische Raumrepräsentationen
Butz, M.V. (2008).
Informatik-Spektrum, 31, 237-240.

Function Approximation with XCS: Hyperellipsoidal Conditions, Recursive Least Squares, and Compaction
Butz, M. V., Lanzi, Pier-Luca, & Wilson, Stewart W. (2008).
IEEE Transactions on Evolutionary Computation, 12, 355-376.

Problem Solution Sustenance in XCS: Markov Chain Analysis of Niche Support Distributions and Consequent Computational Complexity
Butz, M. V., Goldberg, David E., Lanzi, Pier-Luca, & Sastry, K. (2007).
Genetic Programming and Evolvable Machines, 8, 5-37.

Exploiting Redundancy for Flexible Behavior: Unsupervised Learning in a Modular Sensorimotor Control Architecture
Butz, M. V., Herbort, Oliver, & Hoffmann, Joachim
Psychological Review, 114, 1015-1046.

Explorations of anticipatory behavioral control (ABC): A report from the cognitive psychology unit of the University of Würzburg
Hoffmann, J., Berner, M., Butz, M.V., Herbort, O., Kiesel, A., Kunde, W., Lenhard, A. (2007).
Cognitive Processing, 8, 133-142.

Spekulationen zur Struktur ideo-motorischer Beziehungen
Hoffmann, J., Butz, M.V., Herbort, O., Kiesel, A., & Lenhard, A. (2007).
Zeitschrift für Sportpsychologie, 14, 95-104.

Automated Global Structure Extraction for Effective Local Building Block Processing
Butz, M. V., Pelikan, M., Llorà, Xavier,& Goldberg, David E.
Evolutionary Computation, 14, 345-380 (2006)

Gradient Descent Methods in Learning Classifier Systems: Improving XCS Performance in Multistep Problems
Butz, M.V., Goldberg, D.E.,& Lanzi, P.L.
IEEE Transactions on Evolutionary Computation, 9, 452-473 (2005).

Strong, Stable, and Reliable Fitness Pressure in XCS due to Tournament Selection
Butz, M. V., Sastry, Kumara,& Goldberg, David E.
Genetic Programming and Evolvable Machines, 6, 53-77 (2004).

Anticipation for Learning, Cognition, and Education
Butz, M. V.
On the Horizon, 12, 111-116 (2004).

Toward a Theory of Generalization and Learning in XCS
Butz, M. V., Kovacs, Tim, Lanzi, Pier Luca, & Wilson, Stewart W.
IEEE Transaction on Evolutionary Computation, 8, 28-46 (2004)

Analysis and Improvement of Fitness Exploitation in XCS: Bounding Models, Tournament Selection, and Bilateral Accuracy
Butz, M. V., Goldberg, David E.,& Tharakunnel, K.,
Evolutionary Computation, 11, 239-277 (2003)

Anticipations Control Behavior: Animal Behavior in an Anticipatory Learning Classifier System
Butz, M. V., & Hoffmann, Joachim
Adaptive Behavior, 10, 75-96

The anticipatory classifier system and genetic generalization
Butz, M. V., Goldberg, David E., & Stolzmann, Wolfgang
Natural Computing, 1(4) 427-467

Conferences and selected Workshop Papers and Book Chapters (see my CV for a full list - papers not listed upon request)

accepted, in press

Distinction Between Types of Motivations: Emergent Behavior with a Neural, Model-Based Reinforcement Learning System
Shirinov, E. & Butz, M. V. (in press).
IEEE ALife 2009: 2009 IEEE Symposium on Artificial Life.

2008

Context-dependent predictions and cognitive arm control with XCSF
Butz, M. V. & Herbort, O. (2008).
GECCO 2008: Genetic and Evolutionary Computation Conference, 1357-1364 (best paper award).

An analysis of matching in learning classifier systems.
Butz, M. V., Lanzi, P. L., Llorà, X., & Loiacono, D. (2008).
GECCO 2008: Genetic and Evolutionary Computation Conference, 1349-1356.

Self-adaptive mutation in XCSF.
Butz, M. V., Stalph, P., & Lanzi, P. L. (2008).
GECCO 2008: Genetic and Evolutionary Computation Conference, 1365-1372.

Towards increasing learning speed and robustness of XCSF: Experimenting with larger offspring set sizes.
Stalph, P. & Butz, M. V. (2008).
GECCO 2008: Genetic and Evolutionary Computation Conference, Workshop Proceedings IWLCS 2008.

Bridging the gap: Learning sensorimotor-linked population codes for planning and motor control.
Butz, M.V., Reif, K., & Herbort, O. (2008).
International Conference on Cognitive Systems (CogSys 2008).

Multimodal goal representations and feedback in hierarchical motor control.
Herbort, O., Butz, M. V., & Hoffmann, J. (2008).
International Conference on Cognitive Systems (CogSys 2008).

2007

Data mining in learning classifier systems: Comparing XCS with GAssist
Bacardit, J., & Butz, M.V. (2007).
In Kovacs, T., Llorà, X., Takadama, K., Lanzi, P.L., Stolzmann, W., & Wilson, S.W. (Eds.), Learning Classifier Systems: International Workshops, IWLCS 2003-2005, LNAI 4399. pp. 282-290, Berlin Heidelberg: Springer Verlag.

Improving the performance of a Pittsburgh learning classifier system using a default rule
Bacardit, J., Goldberg, D.E., & Butz, M.V. (2007).
In Kovacs, T., Llorà, X., Takadama, K., Lanzi, P.L., Stolzmann, W., & Wilson, S.W. (Eds.), Learning Classifier Systems: International Workshops, IWLCS 2003-2005, LNAI 4399. pp. 291-307, Berlin Heidelberg: Springer Verlag.

Combining Gradient-Based With Evolutionary Online Learning: An Introduction to Learning Classifier Systems
Butz, Martin V. (2007).
Seventh International Conference on Hybrid Intelligent Systems, 12-17.

Effect of pure error-based fitness in XCS
Butz, M.V., Lanzi, P.L., & Goldberg, D.E. (2007).
In Kovacs, T., Llorà, X., Takadama, K., Lanzi, P.L., Stolzmann, W., & Wilson, S.W. (Eds.), Learning Classifier Systems: International Workshops, IWLCS 2003-2005, LNAI 4399. pp. 104-114, Berlin Heidelberg: Springer Verlag.

Emergent effector-independent internal spaces: Adaptation and intermanual learning transfer in humans and neural networks
Butz, M.V., Lenhard, A., & Herbort, O. (2007).
Proceedings of International Joint Conference on Neural Networks, Orlando, Florida, USA, August 12-17, 2007.

Anticipatory Behavior in Adaptive Learning Systems: From Brains to Individual and Social Behavior
Butz, M. V., Sigaud, O., Pezzulo, G., & Baldassarre, G. (Eds., 2007).
LNAI 4520 (State-of-the-Art Survey). Springer Verlag, Berlin-Heidelberg, Germany.

Anticipations, brains, individual and social behavior: An introduction to anticipatory systems
Butz, M.V., Sigaud, O., Pezzullo, G., & Baldassarre, G. (2007).
In Butz M.V., Sigaud O., Pezzulo G., & Baldassarre, G. (Eds.), Anticipatory Behavior in Adaptive Learning Systems: From Brains to Individual and Social Behavior. LNAI 4520, pp. 1-18, Berlin Heidelberg: Springer Verlag.

Encoding complete body models enables task dependent optimal behavior
Herbort, O., & Butz, M.V. (2007).
Proceedings of International Joint Conference on Neural Networks, Orlando, Florida, USA, August 12-17, 2007.

Learning to select targets within targets in reaching tasks.
Herbort, O., Ognibene, O., Butz, M.V., & Baldassarre, G. (2007).
6th IEEE International Conference on Development and Learning (ICDL 2007), 7-12.

Empirical Analysis of Generalization and Learning in XCS with Gradient Descent
Lanzi, Pier-Luca, Butz, Martin V., & Goldberg, David E. (2007).
GECCO 2007.

From Actions to Goals and Vice-versa: Theoretical Analysis and Models of the Ideomotor Principle and TOTE
Pezzullo, G., Baldassarre, G., Butz, M. V., Castelfranchi, C., & Hoffmann, J. (2007).
In Butz M.V., Sigaud O., Pezzulo G., & Baldassarre, G. (Eds.), Anticipatory Behavior in Adaptive Learning Systems: From Brains to Individual and Social Behavior. LNAI 4520, pp. 73-93, Berlin Heidelberg: Springer Verlag.

2006

An Analysis of the Ideomotor Principle and TOTE
Pezzulo, G., Baldassarre, G., Butz, M. V., Castelfranchi, C., & Hoffmann, J.
Proceedings of the Third Workshop on Anticipatory Behavior in Adaptive Learning Systems (ABiALS 2006) (2006)

Performance of Evolutionary Algorithms on Random Decomposable Problems
Pelikan, M., Sastry, K., Butz, M., Goldberg, D. E.
Parallel Problem Solving from Nature - PPSN IX (2006)
pp. 788-797

Substructural Neighborhoods for Local Search in the Bayesian Optimization Algorithm
Lima, C. F., Pelikan, M., Sastry, K., Butz, M. V., Goldberg, D. E., & Lobo, F. G.
Parallel Problem Solving from Nature - PPSN IX (2006)
pp. 232-241

Hyper-ellipsoidal conditions in XCS: Rotation, linear approximation, and solution structure
Butz, M. V., Lanzi, Pier-Luca, & Wilson, S. W.
Proceedings of the Genetic and Evolutionary Computation Conference (GECCO-2006) (2006)
pp. 1457-1464

Studying XCS/BOA learning in Boolean functions: Structure encoding and random boolean functions
Butz, M. V., & Pelikan, M.
Proceedings of the Genetic and Evolutionary Computation Conference (GECCO-2006) (2006)
pp. 1449-1456

Unsupervised Learning of Inverse Dynamics Model
Herbort, O., & Butz, M. V.
CogSys II Radboud University Nijmegen, The Netherlands 12-13 April (2006)
p. 45

2005

Towards an Adaptive Hierarchical Anticipatory Behavioral Control System
Herbort, O., Butz, M.V.,& Hoffmann, J. AAAI Fall Symposium
(2005).

Kernel-based, ellipsoidal conditions in the real-valued XCS classifier system
Butz, M.V.
Proceedings of the Genetic and Evolutionary Computation Conference (GECCO-2005)
pp. 1835-1842.

Extracted global structure makes local building block processing effective in XCS
Butz, M.V., Goldberg, David E.,& Lanzi, P.L.
Proceedings of the Genetic and Evolutionary Computation Conference (GECCO-2005)
pp. 655-662.

Computational complexity of the XCS classifier system
Butz, M.V., Goldberg, David E.,& Lanzi, P.L.
In Bull, L., Kovacs, T. (Eds.) Foundations of Learning Classifier Systems
pp. 91-126 (2005).

Towards the Advantages of Hierarchical Anticipatory Behavioral Control
Herbort, O., Butz, M.V.,& Hoffmann, J. Kog Wis 2005, Schwabe, 77-82, (2005).

2004

Toward a Cognitive Sequence Learner: Hierarchy, Self-Organization, and Top-down Bottom-up Interaction
Butz, M. V.
IlliGAL report 2004021, University of Illinois at Urbana-Champaign (2004)

Knowledge extraction and problem structure identification in XCS
Butz, M.V., Lanzi, P.L., Llorà, X., Goldberg, D.E.
Parallel Problem Solving from Nature - PPSN VIII, LNCS 3242
pp. 1051-1060. Springer Verlag, Berlin (2004)

Speeding-Up Pittsburgh Learning Classifier Systems: Modeling Time and Accuracy
Bacardit, J., Goldberg, D.E., Butz, M.V., Llorà, X.,& Garrell, J.M.
Parallel Problem Solving from Nature - PPSN VIII, LNCS 3242
pp. 1021-1031. Springer Verlag, Berlin (2004)

Gradient-based Learning Updates Improve XCS Performance in Multistep Problems
Butz, M. V., Goldberg, David E.,& Lanzi, Pier Luca
Proceedings of the Genetic and Evolutionary Computation Conference (GECCO-2004)
pp. 751-762. Springer Verlag, Berlin (2004)

Bounding Learning Time in XCS
Butz, M. V., Goldberg, David E.,& Lanzi, Pier Luca
Proceedings of the Genetic and Evolutionary Computation Conference (GECCO-2004)
pp. . Springer Verlag, Berlin (2004)

Effective Online Detection of Task-Independent Landmarks
Butz, M. V., Samarth, Swarup ,& Goldberg, David E.
IlliGAL report 2004002, University of Illinois at Urbana-Champaign (2004)

2003

Gradient Descent Methods in Learning Classifier Systems
Butz, M. V., Goldberg, David E.,& Lanzi, Pier Luca
IlliGAL report 2003028, University of Illinois at Urbana-Champaign (2003)

Documentation of XCS+TS C-Code 1.2
Butz, M. V.
IlliGAL report 2003023, University of Illinois at Urbana-Champaign

Anticipatory Behavior: Exploiting Knowledge About the Future to Improve Current Behavior
Butz, M. V., Sigaud, Olivier,& Gérard, Pierre,
In Anticipatory Behavior in Adaptive Learning Systems: Foundations, Theories, and Systems (LNCS 2684), Butz, M. V., Sigaud, Olivier,& Gérard, Pierre (Eds.), 1-10, Springer Verlag, Berlin (2003)

Internal Models and Anticipations in Adaptive Learning Systems
Butz, M. V., Sigaud, Olivier,& Gérard, Pierre,
In Anticipatory Behavior in Adaptive Learning Systems: Foundations, Theories, and Systems (LNCS 2684), Butz, M. V., Sigaud, Olivier,& Gérard, Pierre (Eds.), 86-109, Springer Verlag, Berlin (2003)

Generalized State Values in an Anticipatory Learning Classifier System
Butz, M. V.,& Goldberg, David E.,
In Anticipatory Behavior in Adaptive Learning Systems: Foundations, Theories, and Systems (LNCS 2684), Butz, M. V., Sigaud, Olivier,& Gérard, Pierre (Eds.), 282-301, Springer Verlag, Berlin (2003)

Bidirectional ARTMAP: An artificial mirror neuron system
Butz, M. V. and Ray, Sylvian
Proceedings of the International Joint Conference on Neural Networks (2003), 1417-1422, (2003)

Bounding the Population Size in XCS to Ensure Reproductive Opportunities
Butz, M. V., & Goldberg, David E.
Proceedings of the Genetic and Evolutionary Computation Conference (GECCO-2003)
pp. 1844-56. Springer Verlag, Berlin (2003)

Towards Building Block Propagation in XCS: A Negative Result and Its Implications
Tharakunnel, Kurian, Butz, M. V., & Goldberg, David E.
Proceedings of the Genetic and Evolutionary Computation Conference (GECCO-2003)
pp. 1906-1917. Springer Verlag, Berlin (2003)

Tournament Selection in XCS
Butz, M. V., Sastry, Kumara,& Goldberg, David E.
Proceedings of the Genetic and Evolutionary Computation Conference (GECCO-2003)
pp. 1857-1869. Springer Verlag, Berlin (2003)


2002

Internal Models and Anticipations in Adaptive Learning Systems
Butz, M. V., Sigaud, Olivier, & Gérard, Pierre
From Animals to Animats 7: The seventh international conference on the Simulation of Adaptive Behavior (SAB 2002). Workshop Proceedings Adaptive Behavior in Anticipatory Learning Systems (ABiALS 2002)

Generalized State Values in an Anticipatory Learning Classifier System
Butz, M. V., & Goldberg, David E.
From Animals to Animats 7: The seventh international conference on the Simulation of Adaptive Behavior (SAB 2002). Workshop Proceedings Adaptive Behavior in Anticipatory Learning Systems (ABiALS 2002)

State Value Learning with an Anticipatory Learning Classifier System in a Markov Decision Process
Butz, M. V.
Technical Report 2002018 at the Illinois Genetic Algorithms Laboratory

An Algorithmic Description of ACS2
Butz, M. V., & Stolzmann, Wolfgang
In Lanzi, P. L., Stolzmann, W., and S. W. Wilson (Eds.),
Advances in Learning Classifier Systems: 4th International Workshop, IWLCS 2001
pp. 211-230. Berlin: Springer-Verlag (2002).

Biasing exploration in an anticipatory learning classifier system
Butz, M. V.
Advances in Learning Classifier Systems: 4th International Workshop, IWLCS 2001
pp. 3-22. Berlin: Springer-Verlag (2001).

2001

An implementation of the anticipatory classifier system ACS2 in C++
Butz, M. V.
Technical Report 2001026 at the Illinois Genetic Algorithms Laboratory

Analyzing the evolutionary pressures in XCS
Butz, M. V., & Pelikan, M.
In Spector, Lee and Goodman, Erik D. and Wu, Annie and Langdon, W. B. and Voigt, Hans-Michael and Gen, Mitsuo and Sen, Sandip and Dorigo, Marco and Pezeshk, Shahram and Garzon, Max H. and Burke, Edmund
Proceedings of the Genetic and Evolutionary Computation Conference (GECCO-2001)
pp. 935-942. San Francisco, CA: Morgan Kaufmann (2001)

How XCS evolves accurate classifiers
Butz, M. V., Kovacs, Tim, Lanzi, Pier-Luca, & Wilson, Stewart W.
In Spector, Lee and Goodman, Erik D. and Wu, Annie and Langdon, W. B. and Voigt, Hans-Michael and Gen, Mitsuo and Sen, Sandip and Dorigo, Marco and Pezeshk, Shahram and Garzon, Max H. and Burke, Edmund
Proceedings of the Genetic and Evolutionary Computation Conference (GECCO-2001)
pp. 927-934. San Francisco, CA: Morgan Kaufmann (2001).

An Algorithmic Description of XCS
Butz, M. V., & Wilson, Stewart W.
In Lanzi, P. L., Stolzmann, W., and S. W. Wilson (Eds.), Advances in Learning Classifier Systems, LNAI 1996,
pp. 253-272. Berlin: Springer-Verlag (2001)
also
Soft Computing, 6, 144-153 (2002)

Probability-Enhanced Predictions in the Anticipatory Classifier System
Butz, M. V., Goldberg, David E., & Stolzmann, Wolfgang
In Lanzi, P. L., Stolzmann, W., and S. W. Wilson (Eds.), Advances in Learning Classifier Systems, LNAI 1996,
pp. 37-51. Berlin: Springer-Verlag (2001).

2000

XCSJava1.0: An Implementation of the XCS classifier system in Java
Butz, M. V.
Technical Report 2000027 at the Illinois Genetic Algorithms Laboratory

Investigating Generalization in the Anticipatory Classifier System
Butz, M. V., Goldberg, David E., & Stolzmann, Wolfgang
Proceedings of the sixth international conference on Parallel Problem Solving from Nature (PPSN2000)

First Cognitive Capabilities in the Anticipatory Classifier System
Stolzmann, Wolfgang, Butz, M. V., Hoffmann, Joachim, & Goldberg, David E.
Proceedings of the sixth international conference on the Simulation of Adaptive Behavior (SAB2000)

Introducing a genetic generalization pressure to the Anticipatory Classifier System Part 2: Performance Analysis
Butz, M. V., Goldberg, David E., & Stolzmann, Wolfgang
Proceedings of the Genetic and Evolutionary Computation Conference (GECCO-2000)

Introducing a genetic generalization pressure to the Anticipatory Classifier System Part 1: Theoretical approach
Butz, M. V., Goldberg, David E., & Stolzmann, Wolfgang
Proceedings of the Genetic and Evolutionary Computation Conference (GECCO-2000)

1999

An Implementation of the XCS classifier system in C
Butz, M. V.

New Challenges for an Anticipatory Classifier System: Hard Problems and Possible Solutions
Butz, M. V., Goldberg, David E., & Stolzmann, Wolfgang

Action-Planning in Anticipatory Classifier Systems
Butz, M. V., & Stolzmann, Wolfgang


submitted, accepted, in press

Anticipatory Planning of Sequential Hand and Finger Movements.
Herbort, O. & Butz, M. V. (submitted).
Journal of Motor Behavior.

Abstract Hand movements may be anticipatorily planned to reach an immediate target and at the same time facilitate movements to subsequent targets. It has been proposed that in anticipatory planning, information about subsequent targets needs to be processed to engage in the planning of the very next movement. To test this hypothesis, we varied the information 48 participants had about to be executed two-step hand and nger movement sequences prior to a choice reaction signal. Movements were initialized faster if participants had advance information about the second target of the sequence than if participants had no advance information at all. The results imply that information about subsequent targets can be processed independent of the preparation of the forthcoming movement. Keywords: planning, movement sequences, anticipation

Paper upon request.


Integrating Dynamics into a Human Behavior Model For Highly Flexible Autonomous Manipulator Control
Pedersen, G., Butz, M. V., & Herbort, O. (submitted).
IEEE Robotics and Automation Magazine, Special Issue on Cognitive Robotics.

Abstract While industrial robotic systems are usually highly optimized to solve one particular task, we humans can solve various behavioral tasks highly flexibly and adaptively adhering to diverse task constraints. To design autonomous robot control systems that can be flexibly applied to various tasks and under varying task constraints, we propose to apply SURE_REACH---a psychologically and neurophysiologically plausible neural model of human motor behavior in arm reaching tasks. In this paper, we show how to extend SURE_REACH to achieve flexible, dynamic control of the Kuka KR16 industrial robot arm. We first introduce the functionality of SURE_REACH, enhance the architecture for the control of dynamic robot systems, and show promising results on the Kuka arm in a realistic simulation framework. In particular, we show that identical tasks can be flexibly achieved while adhering to particular task constraints such as obstacle avoidance or end-posture priorities. In sum, we propose the application of a neural model of human behavior to autonomous robot control tasks that demand behavioral flexibility and adaptivity.

Paper upon request.


Distinction Between Types of Motivations: Emergent Behavior with a Neural, Model-Based Reinforcement Learning System
Shirinov, E. & Butz, M. V. (in press).
IEEE ALife 2009: 2009 IEEE Symposium on Artificial Life.

Abstract In this paper, we analyze the behavior of a simulated mobile robot, which interacts with an initially unknown maze-environment. The robot is controlled by an interactive system that is based on a model building Time Growing Neural Gas (TGNG) algorithm and a homeostatic motivational system, which activates movement preferences and goals within the emergent model structure for behavioral control. We propose to differentiate two types of drives (if not more), which we call location- and characteristics-based drives. We exemplary implement the two types of drives by -Y¡hunger¡ and ¡fear¡, respectively. Several possible methods of combination of the two drives are investigated through simulation, identifying the combination that lead to the most suitable emergent behavior, such as emergent ¡wall-following¡ and ¡hiding¡. Moreover, we investigate performance in an ALife-like scenario, in which the robot interacts with several food-dispensers. It is shown that additional behavioral concepts, such as ¡curiosity¡ and ¡inhibition of return¡, can maximize the survival chances of the organism, who maintains maximal safety and keeps its belly full. In conclusion, we propose that the concept of motivation needs to be further differentiated to realize autonomous, lifelike robots that are able to optimally satisfy multiple, competing types of motivations by emergent, innovative behavioral patterns.


2008


How and Why the Brain Lays the Foundations for a Conscious Self.
Target Article with commentaries by Marco Bettoni, Ernst von Glasersfeld, Humberto R. Maturana, Günter Neumann, Mathias Osvath, Giovanni Pezzulo & Cristiano Castelfranchi, Martina Rieger, Ricarda Schubotz, John Stewart, Samarth Swarup, Jun Tani, and John G. Taylor
Butz, M. V. (2008).
Constructivist Foundations, 4, 1-42.

Abstract
Purpose: Constructivism postulates that the perceived reality is a complex construct formed during development. Depending on the particular school, these inner constructs take on different forms and structures and affect cognition in different ways. The purpose of this article is to address the questions of how and, even more importantly, why we form such inner constructs. Approach: This article proposes that brain development is controlled by an inherent anticipatory drive, which biases learning towards the formation of forward predictive structures and inverse goal-oriented control structures. This drive, in combination with increasingly complex environmental interactions during cognitive development, enforces the structuring of our conscious self, which is embedded in a constructed inner reality. Essentially, the following questions are addressed: Which basic mechanisms lead us to the construction of inner realities? How are these emergent inner realities structured? How is the self represented within the inner realities? And consequently, which cognitive structures constitute the media for conscious thought and selfconsciousness? Findings: Due to the anticipatory drive, representations in the brain shape themselves predominantly purposefully or intentionally. Taking a developmental, evolutionary perspective, we show how the brain is forced to develop progressively complex and abstract representations of the self embedded in the constructed inner realities. These self representations can evoke different stages of self-consciousness. Implications: The anticipatory drive shapes brain structures and cognition during the development of progressively more complex, competent, and flexible goal-oriented bodyenvironment interactions. Self-consciousness develops because increasingly abstract, individualizing self representations are necessary to realize these progressively more challenging environmental interactions. Key words: Anticipatory drive, self consciousness, mirror neurons, sensorimotor bodyspaces, language, social cognition.


Sensomotorische Raumrepräsentationen
Butz, M. V. (2008).
Informatik-Spektrum, 31, 237-240.

Abstract In der Künstlichen Intelligenz (KI) und Robotik werden verschiedenste Raumrepräsentationen genutzt, um mit der Umwelt zu interagieren, Pläne zu erstellen oder Verhaltensentscheidungen zu treffen. Traditionelle Ansätze kodieren dabei Räume eher abstrakt und allozentrisch. Im Gegensatz dazu suggerieren neurowissenschaftliche und kognitionspsychologische Studien, dass das Gehirn Räume viel verhaltensorientierter und egozentrischer repräsentiert.


Bridging the gap: Learning sensorimotor-linked population codes for planning and motor control. Butz, M.V., Reif, K., & Herbort, O. (2008).
International Conference on Cognitive Systems (CogSys 2008).

Abstract Humans and animals are able to flexibly learn internal, cognitive maps of their environments and are able to use these maps to approach goals efficiently, reliably, and flexibly. Recent neuroscientific evidence suggests that such maps are formed in the hippocampus by means of interconnected place, view, and head direction cell encodings. This paper presents a neural learning architecture that develops an interconnected population code of place cells during random exploration. Connections develop dependent on the experienced sensorimotor contingencies. The learned spatial representation enables the agent to flexibly plan shortest paths to any goal location within the explored environment by means of dynamic programming. It approaches activated goal locations by means of closed loop control. While the algorithm currently relies on the Markov property, it is able to connect the network over radical sensory changes as long as they are close in sensorimotor distance. Moreover, the agent is able to flexibly adjust its behavior dependent on current constraints without further learning. This paper introduces the algorithm and evaluates its robustness and consequent behavioral flexibility.


Function Approximation with XCS: Hyperellipsoidal Conditions, Recursive Least Squares, and Compaction
Butz, M. V., Lanzi, Pier-Luca, & Wilson, Stewart W.
IEEE Transactions on Evolutionary Computation (2008), 12, 355-376.

Abstract An important strength of learning classifier systems (LCSs) lies in the combination ofAn important strength of learning classifier systems (LCSs) lies in the combination of genetic optimization techniques with gradient-based approximation techniques. The chosen approximation technique develops locally optimal approximations, such as accurate classification estimates, Q-value predictions, or linear function approximations. The genetic optimization technique is designed to distribute these local approximations efficiently over the problem space. Together, the two components develop a distributed, locally optimized problem solution in the form of a population of expert rules, often called classifiers. In function approximation problems, the XCSF classifier system develops a problem solution in the form of overlapping, piecewise linear approximations. This paper shows that XCSF performance on function approximation problems additively benefits from (1) improved representations, (2) improved genetic operators, and (3) improved approximation techniques. Additionally, this paper introduces a novel closest classifier matching mechanism for the efficient compaction of XCS's final problem solution. The resulting compaction mechanism can boil the population size down by 90% on average, while decreasing prediction accuracy only marginally. Performance evaluations show that the additional mechanisms enable XCSF to reliably, accurately, and compactly approximate even seven dimensional functions. Performance comparisons with other, heuristic function approximation techniques show that XCSF yields competitive or even superior noise-robust performance.

Paper upon request.


Context-dependent predictions and cognitive arm control with XCSF
Butz, M. V. & Herbort, O.
GECCO 2008: Genetic and Evolutionary Computation Conference, 1357-1364.

Abstract While John Holland has always envisioned learning classifier systems (LCSs) as cognitive systems, most work on LCSs has focused on classification, datamining, and function approximation. In this paper, we show that the XCSF classifier system can be very suitably modified to control a robot system with redundant degrees of freedom, such as a robot arm. Inspired by recent research insights that suggest that sensorimotor codes are nearly ubiquitous in the brain and an essential ingredient for cognition in general, the XCSF system is modified to learn classifiers that encode piecewise linear sensorimotor structures, which are conditioned on prediction-relevant contextual input. In the investigated robot arm problem, we show that XCSF partitions the (contextual) posture space of the arm in such a way that accurate hand movements can be predicted given particular motor commands. Furthermore, we show that the inversion of the sensorimotor predictive structures enables accurate goal-directed closed-loop control of arm reaching movements. Besides the robot arm application, we also investigate performance of the modified XCSF system on a set of artificial functions. All results point out that XCSF is a useful tool to evolve problem space partitions that are maximally effective for the encoding of sensorimotor dependencies. A final discussion elaborates on the relation of the taken approach to actual brain structures and cognitive psychology theories of learning and behavior.


An analysis of matching in learning classifier systems.
Butz, M. V., Lanzi, P. L., Llorà, X., & Loiacono, D.
GECCO 2008: Genetic and Evolutionary Computation Conference, 1349-1356.

Abstract We investigate rule matching in learning classifier systems for problems involving binary and real inputs. We consider three rule encodings: the widely used character-based encoding, a specificity-based encoding, and a binary encoding used in Alecsys. We compare the performance of the three algorithms both on matching alone and on typical test problems. The results on matching alone show that the population generality influences the performance of the matching algorithms based on string representations in different ways. Character-based encoding becomes slower and slower as generality increases, specificity-based encoding becomes faster and faster as generality increases. The results on typical test problems show that the specificity-based representation can halve the time required for matching but also that binary encoding is about ten times faster on the most difficult problems. Moreover, we extend specificity-based encoding to real-inputs and propose an algorithm that can halve the time require for matching real inputs using an interval-based representation.


Self-adaptive mutation in XCSF.
Butz, M. V., Stalph, P., & Lanzi, P. L.
GECCO 2008: Genetic and Evolutionary Computation Conference, 1365-1372.

Abstract Recent advances in XCS technology have shown that self-adaptive mutation can be highly useful to speed-up the evolutionary progress in XCS. Moreover, recent publications have shown that XCS can also be successfully applied to challenging real-valued domains including datamining, function approximation, and clustering. In this paper, we combine these two advances and investigate self-adaptive mutation in the XCS system for function approximation with hyperellipsoidal condition structures, referred to as XCSF in this paper. It has been shown that XCSF solves function approximation problems with an accuracy, noise robustness, and generalization capability comparable to other statistical machine learning techniques and that XCSF outperforms simple clustering techniques to which linear approximations are added. This paper shows that the right type of self-adaptive mutation can further improve XCSF's performance solving problems more parameter independent and more reliably. We analyze various types of self-adaptive mutation and show that XCSF with self-adaptive mutation ranges, differentiated for the separate classifier condition values, yields most robust performance results. Future work may further investigate the properties of the self-adaptive values and may integrate advanced self-adaptation techniques.


Towards increasing learning speed and robustness of XCSF: Experimenting with larger offspring set sizes.
Stalph, P. & Butz, M. V.
GECCO 2008: Genetic and Evolutionary Computation Conference, Workshop Proceedings IWLCS 2008.

Abstract The XCS classifier system has been successfully applied to various problem domains including datamining, boolean classifications, and function approximation. In all these applications just two classifiers were reproduced in a match or action set, given a time-recency threshold was met in the set. In this paper, we investigate the effect of selecting more than two classifiers for reproduction in XCSF. We either increase the number of selected classifiers or select a number of classifiers relative to the current match set size. In the functions investigated, both approaches showed a highly significant increase in initial learning speed. Also, in less challenging approximation tasks, the final accuracy reached is not affected by the approach. However, in harder functions, learning may stall due to over-reproductions of inaccurate, ill-estimated classifiers. Thus, we propose an adaptive offspring size rate that may depend on the current reliability of classifier parameter estimates. First results with a fixed offspring set size decrement show promising results. Future work is needed to speed-up XCS's learning progress and adjust its learning speed to the perceived problem difficulty.


Multimodal goal representations and feedback in hierarchical motor control.
Herbort, O., Butz, M. V., & Hoffmann, J.
International Conference on Cognitive Systems (CogSys 2008).

Abstract The capabilities of human motor behavior build on the integration of multiple sensory modalities in goal representation and feedback processing. Here, we present a hierarchical neural network model of motor control to simulate these capabilities, based on the SURE REACH model. The model is able to integrate visual and proprioceptive goal representations, but, by now, relies only on proprioceptive feedback to control ongoing movements. Here, we extend the model to a neural network that processes both, proprioceptive and visual feedback. In simulated reaching experiments we demonstrate that visual feedback considerably enhances the accuracy of the original controller. Moreover, the ability to combine visual and proprioceptive goal representations, or to adjust behavior to task-specific constraints is not affected. Finally, we discuss the results, propose further enhancements, and outline the model’s relevance for other domains of human cognition.


2007


Exploiting Redundancy for Flexible Behavior: Unsupervised Learning in a Modular Sensorimotor Control Architecture
Butz, M. V., Herbort, Oliver, & Hoffmann, Joachim (2007).
Psychological Review, 114, 1015-1046.

Abstract Autonomously developing learning organisms face several problems when learning reaching movements, such as reaching for a glass. First, motor control has to be learned unsupervised, or self-supervised. Second, sensorimotor contingencies have to be acquired in contexts in which action consequences unfold over time. Third, motor redundancies need to be resolved. The proposed sensorimotor, unsupervised, redundancy-resolving control architecture (SURE_REACH) solves these problems self-supervised, based on the ideomotor principle. SURE_REACH encodes hand end-point coordinate space and angular posture space with population codes. A posture memory solves the inverse kinematics by associating end-point space activities with posture space activities. An inverse sensorimotor model associates posture transitions to motor commands. The population encoding, redundant posture memory, and the inverse sensorimotor model enable the architecture to learn sensorimotor grounded distance measures and to use dynamic programming to reach goals flexibly and efficiently. The architecture does not only solve the redundancy problem, but significantly increases goal approaching flexibility, accounting for additional task constraints or exhibiting obstacle avoidance. While the spatial population codes resemble neurophysiological structures, simulations on a three degree of freedom arm in a 2-D environment confirm the plausibility and flexibility of the model, mimicking various previously published behavior data on arm reaching tasks.

Paper upon request.


Learning to select targets within targets in reaching tasks.
Herbort, O., Ognibene, O., Butz, M.V., & Baldassarre, G. (2007).
6th IEEE International Conference on Development and Learning, ICDL 2007, 7 - 12.

Abstract We present a developmental neural network model of motor learning and control, called RL_SURE_REACH. In a childhood phase, a motor controller for goal directed reaching movements with a redundant arm develops unsupervisedly. In subsequent task-specific learning phases, the neural network acquires goal-modulation skills. These skills enable RL_SURE_REACH to master a task that was used in a psychological experiment by Trommershäuser, Maloney, and Landy (2003). This task required participants to select aimpoints within targets that maximize the likelihood of hitting a rewarded target and minimizes the likelihood of accidentally hitting an adjacent penalty area. The neural network acquires the necessary skills by means of a reinforcement learning based modulation of the mapping from visual representations to the target representation of the motor controller. This mechanism enables the model to closely replicate the data from the original experiment. In conclusion, the effectiveness of learned actions can be significantly enhanced by fine-tuning action selection based on the combination of information about the statistical properties of the motor system with different environmental payoff scenarios.


Combining Gradient-Based With Evolutionary Online Learning: An Introduction to Learning Classifier Systems
Butz, Martin V. (2007).
Seventh International Conference on Hybrid Intelligent Systems (HIS 2007), 12-17.

Abstract Learning Classifier Systems (LCSs), introduced by John H. Holland in the 1970s, are rule-based evolutionary online learning systems that combine gradient-based rule evaluation with evolutionary-based rule structuring techniques. Since the introduction of the accuracy-based XCS classi- fier system by Stewart W. Wilson in 1995, LCSs showed to be flexible, online learning methods that are applicable to datamining, reinforcement learning, and function approximation problems. Comparisons showed that performance is competitive with state-of-the art machine learning algorithms, but the learning algorithms applied are usually more flexible and highly adaptive. Moreover, problem knowledge can be extracted easily. This tutorial provides a gentle introduction to LCSs and their general functioning. It then gives further details on the XCS classifier system and highlights various successful applications. In conclusion, promising future directions of LCS research and applications are discussed.

Paper.


Anticipatory Behavior in Adaptive Learning Systems: From Brains to Individual and Social Behavior
Butz, M. V., Sigaud, O., Pezzulo, G., & Baldassarre, G. (Eds., 2007).
LNAI 4520 (State-of-the-Art Survey). Springer Verlag, Berlin-Heidelberg, Germany.

Preface The matter of anticipatory behavior in adaptive learning systems is steadily gaining interest, although many researchers still do not explicitly consider the actual anticipatory capabilities of their systems. Similarly to the previous two workshops, the third workshop on anticipatory behavior in adaptive learning systems (ABiALS 2006) has shown yet again that the similarities between different anticipatory mechanisms in diverse cognitive systems are striking. The discussions and presentations on the workshop day on the 30th of September 2006 during the Simulation of Adaptive Behavior Conference (SAB 2006) confirmed that the investigations into anticipatory cognitive mechanisms for behavior and learning strongly overlap among researchers from various disciplines, including the whole interdisciplinary cognitive science area.
Thus, further conceptualizations of anticipatory mechanisms seem mandatory. The introductory chapter of this volume therefore does not only provide an overview over the contributions included in this volume but also proposes a taxonomy of how anticipatory mechanisms can improve adaptive behavior and learning in cognitive systems. During the workshop it became clear that anticipations are involved in various cognitive processes that range from individual anticipatory mechanisms to social anticipatory behavior. This book reflects this structure by first providing neuroscientific as well as psychological evidence for anticipatory mechanisms involved in behavior, learning, language, and cognition. Next, individual predictive capabilities and anticipatory behavior capabilities are investigated. Finally, anticipation relevant in social interaction is studied.
Anticipatory behavior research on cognitive, adaptive systems aims at exploiting the insights gained from neuroscience, linguistics, and psychology for the improvement of behavior and learning in artificial cognitive systems. However, this knowledge exchange is expected to become increasingly bidirectional. That is, the insights gained during the design and evaluation of different anticipatory cognitive mechanisms and architectures may also provide insights into how anticipatory mechanisms can actually shape, guide, and control natural brain activity. This book reveals many interesting and thought-provoking connections between distinct cognitive science areas. We strongly hope that these connections do not only lead to a deeper understanding of the functioning of anticipatory processes but also enable a more effective, bidirectional knowledge exchange and consequently more effective scientific progress in the natural and artificial cognitive systems research disciplines.

Book available.


Anticipations, brains, individual and social behavior: An introduction to anticipatory systems
Butz, M.V., Sigaud, O., Pezzullo, G., & Baldassarre, G. (2007).
In Butz M.V., Sigaud O., Pezzulo G., & Baldassarre, G. (Eds.), Anticipatory Behavior in Adaptive Learning Systems: From Brains to Individual and Social Behavior. LNAI 4520, pp. 1-18, Berlin Heidelberg: Springer Verlag.

Abstract Research on anticipatory behavior in adaptive learning systems continues to gain more recognition and appreciation in various research disciplines. This book provides an overarching view on anticipatory mechanisms in cognition, learning, and behavior. It connects the knowledge from cognitive psychology, neuroscience, and linguistics with that of artificial intelligence, machine learning, cognitive robotics, and others. This introduction offers an overview over the contributions in this volume highlighting their interconnections and interrelations from an anticipatory behavior perspective. We first clarify the main foci of anticipatory behavior research. Next, we present a taxonomy of how anticipatory mechanisms may be beneficially applied in cognitive systems. With relation to the taxonomy, we then give an overview over the book contributions. The first chapters provide surveys on currently known anticipatory brain mechanisms, anticipatory mechanisms in increasingly complex natural languages, and an intriguing challenge for artificial cognitive systems. Next, conceptualizations of anticipatory processes inspired by cognitive mechanisms are provided. Subsequent chapters address predictive challenges in vision and the processing of event correlations over time. Next, anticipatory mechanisms in individual decision making and behavioral execution are studied. Finally, the book offers systems and conceptualizations of anticipatory processes involved in social interaction.

Paper


From Actions to Goals and Vice-versa: Theoretical Analysis and Models of the Ideomotor Principle and TOTE.
Pezzullo, G., Baldassarre, G., Butz, M. V., Castelfranchi, C., & Hoffmann, J. (2007).
In Butz M.V., Sigaud O., Pezzulo G., & Baldassarre, G. (Eds.), Anticipatory Behavior in Adaptive Learning Systems: From Brains to Individual and Social Behavior. LNAI 4520, pp. 73-93, Berlin Heidelberg: Springer Verlag.

Abstract How can goals be represented in natural and artificial systems? How can they be learned? How can they trigger actions? This paper describes, analyses and compares two of the most influential models of goal-oriented behavior: the ideomotor principle (IMP), which was introduced in the psychological literature, and the “test, operate, test, exit” model (TOTE), proposed in the field of cybernetics. This analysis indicates that the IMP and the TOTE highlight complementary aspects of goal-orientedness. In order to illustrate this point, the paper reviews three computational architectures that implement various aspects of the IMP and the TOTE, discusses their main peculiarities and limitations, and suggests how some of their features can be translated into specific mechanisms in order to implement them in artificial intelligent systems.

Paper


Encoding complete body models enables task dependent optimal behavior.
Herbort, Oliver, & Butz, Martin V. (2007).
Proceedings of International Joint Conference on Neural Networks, Orlando, Florida, USA, August 12-17, 2007, 1424-1429.

Abstract Many neural network models of (human) motor learning focus on the acquisition of direct goal-to-action mappings, thus rendering motor control inflexible. We propose a neural network architecture (SURE_REACH) that acquires complete body models by unsupervised learning. It encodes redundancy on the kinematic and on the motor command level to exert highly flexible, task-dependent optimal control. This paper shows that our approach accounts for two forms of effective human behavior based on exploiting kinematic redundancy. First, depending on the starting posture, hand targets are pursued in different ways optimizing movement efficiency. Second, the arm posture at the end of a movement can be aligned anticipatorily to facilitate a subsequent movement. A discussion of computational implications and relations to behavioral and neurophysiological findings concludes the paper.

Paper


Emergent effector-independent internal spaces: Adaptation and intermanual learning transfer in humans and neural networks.
Butz, M.V., Lenhard, A., & Herbort, O. (2007).
Proceedings of International Joint Conference on Neural Networks, Orlando, Florida, USA, August 12-17, 2007, 1509-1514.

Abstract Psychological studies have shown immense behavioral plasticity in arm reaching tasks. Intermanual learning transfer (ILT) tasks have shown that both reaching movements adapt to distorted spaces rather rapidly and the adaptation generalizes to the behavior of other limbs. In this paper, we present an ILT experiment and replicate it with feedforward neural network (NN) architectures. We show that the NN architecture is the key to successfully replicating the experiments. Moreover, we show that dependent on the architecture and the initial training schedule applied, an internal space representation emerges that enables ILT. The results confirm that internal body spaces, identified in neuroscience and cognitive psychological research, can emerge solely due to an interdependence between different limb movements and the right neural architecture. We hypothesize that, in order to develop internal spatial representations observed in animals and humans, it might be sufficient to enforce the integration of multiple correlated sensory and motor information into one compact internal representation.

Paper


Empirical Analysis of Generalization and Learning in XCS with Gradient Descent.
Lanzi, Pier-Luca, Butz, Martin V., & Goldberg, David E. (2007)
GECCO 2007: Genetic and Evolutionary Computation Conference, 1814-1821.

Abstract We analyze generalization and learning in XCS with gradient descent. At first, we show that the addition of gradient in XCS may slow down learning because it indirectly decreases the learning rate. However, in contrast to what was suggested elsewhere, gradient descent has no effect on the achieved generalization. We also show that when gradient descent is combined with roulette wheel selection, which is known to be sensitive to small values of the learning rate, the learning speed can slow down dramatically. Previous results reported no difference in the performance of XCS with gradient descent when roulette wheel selection or tournament selection were used. In contrast, we suggest that gradient descent should always be combined with tournament selection, which is not sensitive to the value of the learning rate. When gradient descent is used in combination with tournament selection, the results show that (i) the slowdown in learning is limited and (ii) the generalization capabilities of XCS are not affected.

Paper


Data mining in learning classifier systems: Comparing XCS with GAssist
Bacardit, J., & Butz, M.V. (2007).
In Kovacs, T., Llorà, X., Takadama, K., Lanzi, P.L., Stolzmann, W., & Wilson, S.W. (Eds.), Learning Classifier Systems: International Workshops, IWLCS 2003-2005, LNAI 4399. pp. 282-290, Berlin Heidelberg: Springer Verlag.

Abstract This paper compares performance of the Pittsburgh-style system GAssist with the Michigan-style system XCS on several datamining problems. Our analysis shows that both systems are suitable for datamining but have different advantages and disadvantages. The study does not only reveal important differences between the two systems but also suggests several structural properties of the underlying datasets.

Paper


Improving the performance of a Pittsburgh learning classifier system using a default rule
Bacardit, J., Goldberg, D.E., & Butz, M.V. (2007).
In Kovacs, T., Llorà, X., Takadama, K., Lanzi, P.L., Stolzmann, W., & Wilson, S.W. (Eds.), Learning Classifier Systems: International Workshops, IWLCS 2003-2005, LNAI 4399. pp. 291-307, Berlin Heidelberg: Springer Verlag.

Abstract An interesting feature of encoding the individuals of a Pittsburgh learning classifier system as a decision list is the emergent generation of a default rule. However, performance of the system is strongly tied to the learning system choosing the correct class for this default rule. In this paper we experimentally study the use of an explicit (static) default rule. We first test simple policies for setting the class of the default rule, such as the majority/minority class of the problem. Next, we introduce some techniques to automatically determine the most suitable class.

Paper


Effect of pure error-based fitness in XCS
Butz, M.V., Lanzi, P.L., & Goldberg, D.E. (2007).
In Kovacs, T., Llorà, X., Takadama, K., Lanzi, P.L., Stolzmann, W., & Wilson, S.W. (Eds.), Learning Classifier Systems: International Workshops, IWLCS 2003-2005, LNAI 4399. pp. 104-114, Berlin Heidelberg: Springer Verlag.

Abstract The accuracy-based fitness approach in XCS is one of the most significant changes in comparison with original learning classifier systems. Nonetheless, neither the scaled accuracy function, nor the importance of the relative fitness approach has been investigated in detail. The recent introduction of tournament selection to XCS has shown to make the system more independent from parameter settings and scaling issues. The question remains if relative accuracy itself is actually necessary in XCS or if the evolutionary process could be based directly on error. This study investigates advantages and disadvantages of pure error-based fitness vs. relative accuracy-based fitness in XCS.

Paper


Explorations of anticipatory behavioral control (ABC): A report from the cognitive psychology unit of the University of Würzburg
Hoffmann, J., Berner, M., Butz, M.V., Herbort, O., Kiesel, A., Kunde, W., Lenhard, A. (2007).
Cognitive Processing, 8, 133-142.

Paper upon request.


Spekulationen zur Struktur ideo-motorischer Beziehungen
Hoffmann, J., Butz, M.V., Herbort, O., Kiesel, A., & Lenhard, A. (2007).
Zeitschrift für Sportpsychologie, 14, 95-104.

Abstract According to the ideo-motor hypothesis, actions become bidirectionally connected to their sensory effects so that anticipations of the effects directly trigger the actions which have been learned to produce them. We discuss i) the role of exteroceptive as well proprioceptive effects, ii) the dependency of action-effect relations on the current situation, iii) the necessity of more abstract effector-unspecific action representations and finally iv) the use of sensory feedback in action control. The discussion led us to a tentative structure of ideo-motor relations by which the "idea" (anticipated goal) determines the "motor activity" (body movements) by a cascade of inverse models.

Paper (in German) upon request.


Problem Solution Sustenance in XCS: Markov Chain Analysis of Niche Support Distributions and the Impact on Computational Complexity
Butz, M. V., Goldberg, David E., Lanzi, Pier-Luca, & Sastry, K.
Genetic Programming and Evolvable Machines, 8, 5-37.

Abstract Michigan-style learning classifier systems iteratively evolve a distributed solution to a problem in the form of potentially overlapping subsolutions Each problem niche is covered by subsolutions that are represented by a set of predictive rules, termed classifiers. The genetic algorithm is designed to evolve classifier structures that together cover the whole problem space and represent a complete problem solution. An obvious challenge for such an online evolving, distributed knowledge representation is to continuously sustain all problem subsolutions covering all problem niches, that is, to ensure niche support. Effective niche support depends both on the probability of reproduction and on the probability of deletion of classifiers in a niche. In XCS, reproduction is occurrence-based whereas deletion is supportbased. In combination, niche support is assured effectively. In this paper we present a Markov chain analysis of the niche support in XCS, which we validate experimentally. Evaluations in diverse Boolean function settings, which require non-overlapping and overlapping solution structures, support the theoretical derivations. We also consider the effects of mutation and crossover on niche support. With respect to computational complexity, the paper shows that XCS is able to maintain (partially overlapping) niches with a computational effort that is linear in the inverse of the niche occurrence frequency.

Download earlier technical report version in pdf format - it is recommended, to get the journal version, though.


2006


Automated Global Structure Extraction for Effective Local Building Block Processing in XCS
Butz, M. V., Pelikan, M., Llorà, Xavier,& Goldberg, David E.
Evolutionary Computation,14, 345-380 (2006)

Abstract Learning Classifier Systems (LCSs), such as the accuracy-based XCS, evolve distributed problem solutions represented by a population of rules. During evolution, features are specialized, propagated, and recombined to provide increasingly accurate subsolutions. Recently, it was shown that, as in conventional genetic algorithms (GAs), some problems require efficient processing of subsets of features to find problem solutions efficiently. In such problems, standard variation operators of genetic and evolutionary algorithms used in LCSs suffer from potential disruption of groups of interacting features, resulting in poor performance. This paper introduces efficient crossover operators to XCS by incorporating techniques derived from competent GAs: the extended compact GA (ECGA) and the Bayesian optimization algorithm (BOA). Instead of simple crossover operators such as uniform crossover or one-point crossover, ECGA or BOA-derived mechanisms are used to build a probabilistic model of the global population and to generate offspring classifiers locally using the model. Several offspring generation variations are introduced and evaluated. The results show that it is possible to achieve performance similar to runs with an informed crossover operator that is specifically designed to yield ideal problem-dependent exploration, exploiting provided problem structure information. Thus, we create the first competent LCSs, XCS/ECGA and XCS/BOA, that detect dependency structures online and propagate corresponding lower-level dependency structures effectively without any information about these structures given in advance.

Download earlier technical report version in pdf format (journal version recommended, though).


An Analysis of the Ideomotor Principle and TOTE
Pezzulo, G., Baldassarre, G., Butz, M. V., Castelfranchi, C., & Hoffmann, J.
Proceedings of the Third Workshop on Anticipatory Behavior in Adaptive Learning Systems (ABiALS 2006) (2006)

Abstract What does it mean for a system to be goal oriented? In this paper we investigate how goals are represented and how they activate actions. We review the main philosophical and psychological assumptions about the ideomotor principle and we compare it with the TOTE model in cybernetics. We also present three computational architectures that implement goal orientedness, discussing their main peculiarities and limitations with respect to the ideomotor principle and TOTE.


Performance of Evolutionary Algorithms on Random Decomposable Problems
Pelikan, M., Sastry, K., Butz, M., Goldberg, D. E.
Parallel Problem Solving from Nature - PPSN IX, 788-797 (2006)

Abstract Recently, studies with the XCS classifier system on Boolean functions have shown that in certain types of functions simple crossover operators can lead to disruption and, consequently, a more effective recombination mechanism is required. Simple crossover operators were replaced by recombination based on estimation of distribution algorithms (EDAs). The combination showed that XCS with such a statistics-based crossover operator can solve challenging hierarchical functions more efficiently. This study elaborates the gained competence further investigating the coding scheme for the EDA component (BOA in our case) of XCS as well as performance in randomly generated Boolean function problems. Results in hierarchical Boolean functions show that the originally used 2-bit coding scheme induces a certain learning bias that stresses additional diversity in the evolving XCS population. A 1-bit coding scheme as well as a restricted 2-bit coding scheme confirm the suspected bias. The alternative encodings decrease the unnecessary bias towards specificity and increase performance robustness. The paper concludes with a discussion on the challenges ahead for XCS in Boolean function problems as well as on the implications of the obtained results for real-valued and multiple-valued classification problems, multi-step problems, and function approximation problems.


Substructural Neighborhoods for Local Search in the Bayesian Optimization Algorithm
Lima, C. F., Pelikan, M., Sastry, K., Butz, M. V., Goldberg, D. E., & Lobo, F. G.
Parallel Problem Solving from Nature - PPSN IX, 232-241 (2006)

Abstract This paper studies the utility of using substructural neigh- borhoods for local search in the Bayesian optimization algorithm (BOA). The probabilistic model of BOA, that automatically identi¯es important problem substructures, is used to de¯ne the structure of the neighbor- hoods to explore in local search. Three different neighborhoods operators are proposed for BOA, while the remaining of the paper focus on one of these operators to perform hillclimbing in the substructural space. Addi- tionally, a surrogate ¯tness model is considered to evaluate the improve- ment of the local search steps. Initial results indicate that performing local search in substructural neighborhoods signi¯catively reduces the number of generations necessary to converge to optimal solutions.


Hyper-ellipsoidal conditions in XCS: Rotation, linear approximation, and solution structure
Butz, M. V., Lanzi, Pier-Luca, & Wilson, S. W.
Proceedings of the Genetic and Evolutionary Computation Conference (GECCO-2006), 1457-1464 (2006)

Abstract The learning classifier system XCS is an iterative rule-learning system that evolves rule structures based on gradient-based prediction and rule quality estimates. Besides classification and reinforcement learning tasks, XCS was applied as an effective function approximator. Hereby, XCS learns space partitions to enable a maximally accurate and general function approximation. Recently, the function approximation approach was improved by replacing (1) hyperrectangular conditions with hyper-ellipsoids and (2) iterative linear approximation with the recursive least squares method. This paper combines the two approaches assessing the usefulness of each. The evolutionary process is further improved by changing the mutation operator implementing an angular mutation that rotates ellipsoidal structures explicitly. Both enhancements improve XCS performance in various non-linear functions. We also analyze the evolving ellipsoidal structures confirming that XCS stretches and rotates the evolving ellipsoids according to the shape of the underlying function. The results confirm that improvements in both the evolutionary approach and the gradient approach can result in significantly better performance.


Studying XCS/BOA learning in Boolean functions: Structure encoding and random boolean functions
Butz, M. V., & Pelikan, M.
Proceedings of the Genetic and Evolutionary Computation Conference (GECCO-2006), 1449-1456 (2006)

Abstract Recently, studies with the XCS classifier system on Boolean functions have shown that in certain types of functions simple crossover operators can lead to disruption and, consequently, a more effective recombination mechanism is required. Simple crossover operators were replaced by recombination based on estimation of distribution algorithms (EDAs). The combination showed that XCS with such a statistics-based crossover operator can solve challenging hierarchical functions more efficiently. This study elaborates the gained competence further investigating the coding scheme for the EDA component (BOA in our case) of XCS as well as performance in randomly generated Boolean function problems. Results in hierarchical Boolean functions show that the originally used 2-bit coding scheme induces a certain learning bias that stresses additional diversity in the evolving XCS population. A 1-bit coding scheme as well as a restricted 2-bit coding scheme confirm the suspected bias. The alternative encodings decrease the unnecessary bias towards specificity and increase performance robustness. The paper concludes with a discussion on the challenges ahead for XCS in Boolean function problems as well as on the implications of the obtained results for real-valued and multiple-valued classification problems, multi-step problems, and function approximation problems.


Unsupervised Learning of Inverse Dynamics Model
Herbort, O., & Butz, M. V.
CogSys II Radboud University Nijmegen, The Netherlands 12-13 April, 45 (2006)

Abstract The ability to interact efficiently and goal-directedly with a complex, dynamic, noisy environment is of paramount importance for any adaptive cognitive organism. To exert goal directed behavior, the organism must be able to convert a desired perceptual state into actual motor commands based on its experienced interactions with the environment. Thereby, the organism faces several problems: (1) Appropriate actions have to be triggered based on current state and selected goals (inverse dynamics). (2) The selected actions should optimize performance, that is, minimize effort (optimality principles). (3) These dynamics should be learned while interacting with the environment, without an actual teacher (unsupervised learning). Insights from psychology and neuroscience suggest that the problem of how to choose the right action given situation and goals is based on an anticipatory, successive decomposition of (perceptual) goals into progressively more action-related sub-goals. Our research focuses on the realization of such an anticipatory, adaptive system. We present an artificial recurrent neural network (ARNN) controller that implements an inverse dynamics model that incorporates optimality principles. The network can be entirely trained by unsupervised associative learning. We propose that a set of interacting modules can be used to control complex behavior by a cascade of ARNN controllers. The controller is implemented in a computational model and applied to a simple motor-arm control task.


2005


Towards an Adaptive Hierarchical Anticipatory Behavioral Control System
Herbort, O., Butz, M.V.,& Hoffmann, J. AAAI Fall Symposium
(2005).

Abstract Despite recent successes in control theoretical programs for limb control, behavior-based cognitive approaches for control are somewhat lacking behind. Insights in psychology and neuroscience suggest that the most important ingredients for a successful developmental approach to control are anticipatory mechanisms and hierarchical structures. Anticipatory mechanisms are beneficial in handling noisy sensors, bridging sensory delays, and directing attention and action processing capacities. Moreover, action selection may be immediate using inverse modeling techniques. Hierarchies enable anticipatory influences on multiple levels of abstraction in time and space. This paper provides an overview over recent insights in anticipatory, hierarchical, cognitive behavioral mechanisms, reviews previous modeling approaches, and introduces a novel model well-suited to study hierarchical anticipatory behavioral control in simulated as well as real robotic control scenarios.

Download Submitted Paper in pdf format


Gradient Descent Methods in Learning Classifier Systems: Improving XCS Performance in Multistep Problems
Butz, M.V., Goldberg, D.E.,& Lanzi, P.L.
IEEE Transactions on Evolutionary Computation, 9, 452-473 (2005).

Abstract The accuracy-based XCS classifier system has been shown to solve typical data mining problems in a machine-learning competitive way. However, successful applications in multistep problems, modeled by a Markov decision process, were restricted to very small problems. Until now, the temporal difference learning technique in XCS was based on deterministic updates. However, since a prediction is actually generated by a set of rules in XCS and Learning Classifier Systems in general, gradient-based update methods are applicable. The extension of XCS to gradient-based update methods results in a classifier system that is more robust and more parameter independent solving large and difficult maze problems reliably. Additionally, the paper highlights the relation of XCS to other function approximation methods in reinforcement learning.

Download earlier and shorter technical report version in compressed postscript format - journal version recommended, though.


Kernel-based, ellipsoidal conditions in the real-valued XCS classifier system
Butz, M.V.
Proceedings of the Genetic and Evolutionary Computation Conference (GECCO-2005)
pp. 1835-1842.

Abstract Many learning classifier system (LCS) implementations are restricted to the binary problem realm. Recently, the XCS classifier system was enhanced to be able to handle real-valued inputs among others. In the real-valued enhancement, XCSF applies as a function approximation system that partitions the input space in hyperrectangular subspaces specified in the classifiers. This paper changes the classifier conditions to hyperspheres and hyperellipsoids and investigates the consequent performance impact. It is shown that the modifications yield improved performance in continuous functions. Even in discontinuous functions with parallel boundaries, XCS's performance does not degrade. Thus, for the real-valued problem domain, ellipsoidal condition structures can improve XCS's performance. From a more general perspective, this paper shows that XCS is readily applicable in diverse problem domains. To apply the system even more successfully, suitable kernel-based bases need to be found and used as classifier conditions. XCS distributes the available structures over the problem space evolving more specialized structures in more complex problem subspaces.

Download submitted version in pdf format.


Extracted global structure makes local building block processing effective in XCS
Butz, M.V., Goldberg, David E.,& Lanzi, P.L.
Proceedings of the Genetic and Evolutionary Computation Conference (GECCO-2005)
pp. 655-662.

Abstract Michigan-style learning classifier systems (LCSs), such as the accuracy-based XCS system, evolve distributed problem solutions represented by a population of rules. Recently, it was shown that decomposable problems may require effective processing of subsets of problem attributes, which cannot be generally assured with standard crossover operators. A number of competent crossover operators capable of effective identification and processing of arbitrary subsets of variables or string positions were proposed for genetic and evolutionary algorithms. This paper effectively introduces two competent crossover operators to XCS by incorporating techniques from competent genetic algorithms (GAs): the extended compact GA (ECGA) and the Bayesian optimization algorithm (BOA). Instead of applying standard crossover operators, here a probabilistic model of the global population is built and sampled to generate offspring classifiers locally. Various offspring generation methods are introduced and evaluated. Results indicate that the performance of the proposed learning classifier systems XCS/ECGA and XCS/BOA is similar to that of XCS with informed crossover operators that is given all information about problem structure on input and exploits this knowledge using problem-specific crossover operators.

Download earlier technical report version in pdf format


Computational complexity of the XCS classifier system
Butz, M.V., Goldberg, David E.,& Lanzi, P.L.
In Bull, L., Kovacs, T. (Eds.) Foundations of Learning Classifier Systems
pp. 91-126 (2005).

Abstract Although Learning Classifier Systems date back more than twenty years ago, theory regard- ing issues like convergence or computational effort remained sparse. This paper establishes a PAC learning bound for the accuracy-based classifier system XCS. XCS is a flexible genetic-based learning mechanism applicable in classification problems, reinforcement learning problems, as well as different types of representations. Using a facet-wise analysis focusing on the learning of Boolean functions, we show that given several assumptions, k-DNF and related Boolean func- tions are PAC-learnable by XCS. The facet-wise analysis is modular, flexible, and extendable to other problem types and representations.

Download earlier technical report version in pdf format


Towards the Advantages of Hierarchical Anticipatory Behavioral Control
Herbort, O., Butz, M.V.,& Hoffmann, J. Proceedings of KogWis05. The German Cognitive Science Conference 2005, Schwabe, 2005, 77-82

Abstract Despite recent successes in control theoretical programs for limb control, behavior-based cognitive approaches for control are somewhat lacking behind. Insights in psychology and neuroscience suggest that the most important ingredients for a successful developmental approach to control are anticipatory mechanisms and hierarchical structures. Anticipatory mechanisms are beneficial in handling noisy sensors, bridging sensory delays, and directing attention and action processing capacities. Moreover, action selection may be immediate using inverse modeling techniques. Hierarchies enable anticipatory influences on multiple levels of abstraction in time and space. This paper provides an overview over recent insights in anticipatory, hierarchical, cognitive behavioral mechanisms, reviews previous modeling approaches, and introduces a novel model well-suited to study hierarchical anticipatory behavioral control in simulated as well as real robotic control scenarios.

Download Submitted Paper in pdf format


2004


Strong, Stable, and Reliable Fitness Pressure in XCS due to Tournament Selection
Butz, M. V., Sastry, Kumara,& Goldberg, David E.
Genetic Programming and Evolvable Machines, 6, 53-77.
also : IlliGAL Report No. 2003025

Abstract Recent analysis of the XCS classifier system have shown that successful genetic learning strongly depends on the amount of fitness pressure towards accurate classifiers. Since the traditionally used proportionate selection is dependent on fitness scaling and fitness distribution, the resulting evolutionary fitness pressure may be neither stable nor sufficiently strong. Thus, we apply tournament selection to XCS. In particular, we exhibit the weakness of proportionate selection and suggest tournament selection as a more reliable alternative. We show that tournament selection results in a learning classifier system that is more parameter independent, noise independent, and more efficient in exploiting fitness guidance in single-step problems as well as multistep problems. The evolving population is more focused on promising subregions of the problem space and thus finds the desired accurate, maximally general representation faster and more reliably.

Download Technical Report in ps.Z format


Anticipation for Learning, Cognition, and Education
Butz, M. V.
On the Horizon

2004, 12, 111-116.

Abstract Predictions, desires, or intentions have recently shown to strongly influence behavior, adaptation, and learning. These anticipations influence behavior mediating decision making and action execution as well as attention. Although it is not the future itself that influences the present but the anticipated future states or future properties, the difference to purely stimulus driven behavior and learning is highly significant. Recent analyses investigate under which environmental properties which type of anticipatory mechanism is helpful to improve behavior. Vice versa, since anticipatory mechanisms also bias attention, future sensory processing and thus future learning capabilities are immediately influenced by current anticipations. The impact on the understanding of the world, social systems, human learning and understanding, as well as education principles might be immense. The perspective of expectation and purpose as part of the cause in the general case might have been underestimated and requires further investigations and considerations.

Download in pdf format


Toward a Cognitive Sequence Learner: Hierarchy, Self-Organization, and Top-down Bottom-up Interaction
Butz, M. V.
IlliGAL report 2004021, University of Illinois at Urbana-Champaign (2004)

Abstract This paper introduces a hierarchical learning architecture that grows online an adaptive problem representation from scratch. The representation extracts frequent perceptual patterns representing them in a layered hierarchy where neural activity in higher layers is initiated bottom-up by firing neurons in lower layers and, vice versa, firing neurons in higher layers predispose activity and provide reinforcement feedback to neurons in lower layers. The structure evolves by a mixture of reinforcement learning and a genetic algorithm. Learning is biased towards extracting frequently recurring sequences in an input stream. We evaluate the architecture on a text document, which is presented iteratively---character by character---to the system. The results show that the proposed system reliably evolves representations of most frequent characters, syllables, and words in the document. We also confirm that top-down influences bias the evolution of syllable and character representations.
Download Technical Report in pdf format


Knowledge extraction and problem structure identification in XCS
Butz, M.V., Lanzi, P.L., Llorà, X., Goldberg, D.E.
Parallel Problem Solving from Nature - PPSN VIII, LNCS 3242
pp. 1051-1060. Springer Verlag, Berlin (2004)

Abstract XCS has been shown to solve hard problems in a machine-learning competitive way. Recent theoretical advancements show that the system can scale-up polynomially in the problem complexity and problem size given the problem is a k-DNF with certain properties. This paper addresses two major issues in XCS: (1) knowledge extraction and (2) structure identification. Knowledge extraction addresses the issue of mining problem knowledge from the final solution developed by XCS. The goal is to identify most important features in the problem and the dependencies among those features. The extracted knowledge may not only be used for further data mining, but may actually be re-fed into the system giving it further competence in solving problems in which dependent features, that is, building blocks, need to be processed effectively. This paper proposes to extract a feature dependency tree out of the developed rule-based problem representation of XCS. The investigations herein focus on Boolean function problems. The extension to nominal and real-valued features is discussed.
Download Technical Report in pdf format


Speeding-Up Pittsburgh Learning Classifier Systems: Modeling Time and Accuracy
Bacardit, J., Goldberg, D.E., Butz, M.V., Llorà, X.,& Garrell, J.M.
Parallel Problem Solving from Nature - PPSN VIII, LNCS 3242
pp. 1021-1031. Springer Verlag, Berlin (2004)

Abstract Windowing methods are useful techniques to reduce the computational cost of Pittsburgh style genetic-based machine learning techniques. If used properly, they additionally can be used to improve the classification accuracy of the system. In this paper we develop a theoretical framework for a windowing scheme called ILAS, developed previously by the authors. The framework allows us to approximate the degree of windowing we can apply to a given dataset as well as the gain in run-time. The framework sets the first stage for the development of a larger methodology with several types of learning strategies in which we can apply ILAS, such as maximizing the learning performance of the system, or achieving the maximum run-time reduction without significant accuracy loss.
Download Technical Report in pdf format


Toward a Theory of Generalization and Learning in XCS
Butz, M. V., Kovacs, Tim, Lanzi, Pier Luca, & Wilson, Stewart W.
IEEE Transaction on Evolutionary Computation, 8, 28-46 (2004)
previous version available as IlliGAL technical report 2002011

Abstract In this paper we provide the very first steps toward a theory of generalization and learning in XCS. We start from Wilson's generalization hypothesis which states that in XCS the interaction between niched evolution and panmictic deletion produces an intrinsic pressure toward generality. We analyze the different evolutionary pressures in XCS and derive a simple equation which support Wilson's hypothesis from a theoretical standpoint. The equation is tested with a number of experiments which confirm the basic model of generalization pressure we provide. Then, we focus on the conditions which must be satisfied for the existence of effective fitness or accuracy pressure in XCS. We analyze these conditions, termed ``challenges''. We derive two equations which provide some indications about how to set the population size and the covering probability so as to ensure the development of fitness pressure in XCS. We argue that when the challenges are met, XCS is able to evolve a solution to the problem reliably. When the challenges are not met, we suggest that intrinsic fitness guidance as well as biased reward in a problem might ease the challenges. The equations and the influence of intrinsic fitness guidance and biased reward are tested on Boolean multiplexer. The paper represents an initial contribution to the understanding of XCS functioning and lays the foundation for research on XCS's learning complexity.

Download Technical Report in ps.Z format


Gradient-based Learning Updates Improve XCS Performance in Multistep Problems
Butz, M. V., Goldberg, David E.,& Lanzi, Pier Luca
Proceedings of the Genetic and Evolutionary Computation Conference (GECCO-2004)
pp. 751-762. Springer Verlag, Berlin (2004)

Abstract This paper introduces a gradient-based reward prediction update mechanism to the XCS classifier system as applied in neural-network type learning and function approximation mechanisms. A strong relation of XCS to tabular reinforcement learning and more importantly to neural-based reinforcement learning techniques is drawn. The resulting gradient-based XCS system learns more stable and reliable in previously investigated hard multistep problems. While the investigations are limited to the binary XCS classifier system, the applied gradient-based update mechanism appears also suitable for the real-valued XCS and other learning classifier systems.


Bounding Learning Time in XCS
Butz, M. V., Goldberg, David E.,& Lanzi, Pier Luca
Proceedings of the Genetic and Evolutionary Computation Conference (GECCO-2004)
pp. 739-750. Springer Verlag, Berlin (2004)

Abstract It has been shown empirically that the XCS classifier system solves typical classification problems in a machine learning competitive way. However, until now, no learning time estimate has been available for the system. This paper introduces a time estimate that bounds the learning time of XCS until maximally accurate classifiers are found. We assume a domino convergence model in which each attribute is successively specialized to the correct value. It is shown that learning time in XCS scales polynomial in problem length and exponential in the order of problem difficulty and thus in a machine learning competitive way.
Download Technical Report in pdf format


Effective Online Detection of Task-Independent Landmarks
Butz, M. V., Samarth, Swarup ,& Goldberg, David E.
IlliGAL report 2004002, University of Illinois at Urbana-Champaign (2004)

Abstract One of the key problems in building adaptive autonomous agents is landmark detection. Landmarks can be used for efficient navigation as well as for developing hierarchical cognitive structures. Previous approaches to landmark detection often simply chose landmarks as the agent's location after fixed intervals of time. Other approaches to landmark detection have focused on the reliability and ease of detection of landmarks. However, systems that use landmarks for hierarchy formations rely on a set of landmarks that provides a means for a concise and effective decomposition of the environment. We believe that such a decomposition is achieved most effectively by identifying transitions that partition the environment into relatively independent sub-regions. Using notions of surprise and consolidation via continued novelty, implemented by relatively simple statistics on the sensory inputs, we introduce an online landmark detection mechanism that reliably identifies landmarks that correspond to such transitions. Since the detected landmarks partition the environment into relatively independent subspaces, the resulting set of landmarks should be useful for the formation of an online adaptive hierarchical problem decomposition enabling efficient hierarchical adaptation and cognition.
Download Technical Report in pdf format


2003


Gradient Descent Methods in Learning Classifier Systems
Butz, M. V., Goldberg, David E.,& Lanzi, Pier Luca
IlliGAL report 2003028, University of Illinois at Urbana-Champaign (2003)

Abstract The accuracy-based XCS classifier system has been shown to solve typical data mining problems in a machine-learning competitive way. However, successful applications in multistep problems, modeled by a Markov decision process, were restricted to very small problems. Until now, the temporal difference learning technique in XCS was based on deterministic updates. However, since a prediction is actually generated by a set of rules in XCS and Learning Classifier Systems in general, gradient-based update methods are applicable. The extension of XCS to gradient-based update methods results in a classifier system that is more robust and more parameter independent solving large and difficult maze problems reliably. Additionally, the paper highlights the relation of XCS to other function approximation methods in reinforcement learning.
Download Technical Report in ps.Z format


Documentation of XCS+TS C-Code 1.2
Butz, M. V.
IlliGAL report 2003023, University of Illinois at Urbana-Champaign

Abstract This is the documentation of the XCS 1.2 C-code released on the IlliGAL web-page. The code includes the option to apply tournament selection as well as several other new features in comparison to the XCS1.1 release. Moreover, XCS parameters as well as experimental settings can be specified in a parameter file so that recompiling is not necessary anymore. Finally, an additional output file is generated that determines means and standard deviations over the run experiments. XCS 1.2 is written in ANSI C. This documentation explains the general outline of the code, all possible parameter manipulations, and how to test this implementation on other problems (environments).
Download Technical Report in ps.Z format
Download the Code in compressed TAR


Anticipatory Behavior: Exploiting Knowledge About the Future to Improve Current Behavior
Butz, M. V., Sigaud, Olivier,& Gérard, Pierre,
In Anticipatory Behavior in Adaptive Learning Systems: Foundations, Theories, and Systems (LNCS 2684), Butz, M. V., Sigaud, Olivier,& Gérard, Pierre (Eds.), 1-10, Springer Verlag, Berlin (2003)

Abstract This chapter is meant to give a concise introduction to the topic of this book. The study of anticipatory behavior is referring to behavior that is dependent on predictions, expectations, or intentions of future states. Hereby, behavior includes actual decision making, internal decision making, internal preparatory mechanisms, as well as learning. Despite several recent theoretical approaches on this topic, until now it remains unclear in which situations anticipatory behavior is useful or even mandatory to achieve competent behavior in adaptive learning systems. This book provides a collection of articles that investigate these questions. We provide an overview for all articles relating them to each other and highlighting their significance to anticipatory behavior research in general.
Anticipatory Behavior in Adaptive Learning Systems


Internal Models and Anticipations in Adaptive Learning Systems
Butz, M. V., Sigaud, Olivier,& Gérard, Pierre,
In Anticipatory Behavior in Adaptive Learning Systems: Foundations, Theories, and Systems (LNCS 2684), Butz, M. V., Sigaud, Olivier,& Gérard, Pierre (Eds.), 86-109, Springer Verlag, Berlin (2003)

Abstract The explicit investigation of anticipations in relation to adaptive behavior is a recent approach. This chapter first provides psychological background that motivates and inspires the study of anticipations in the adaptive behavior field. Next, a basic framework for the study of anticipations in adaptive behavior is suggested. Different anticipatory mechanisms are identified and characterized. First fundamental distinctions are drawn between implicitly anticipatory behavior, payoff anticipatory behavior, sensorial anticipatory behavior, and state anticipatory behavior. A case study allows further insights into the drawn distinctions. Many future research direction are suggested.
Anticipatory Behavior in Adaptive Learning Systems


Generalized State Values in an Anticipatory Learning Classifier System
Butz, M. V.,& Goldberg, David E.,
In Anticipatory Behavior in Adaptive Learning Systems: Foundations, Theories, and Systems (LNCS 2684), Butz, M. V., Sigaud, Olivier,& Gérard, Pierre (Eds.), 282-301, Springer Verlag, Berlin (2003)

Abstract This paper introduces generalized state values to the anticipatory learning classifier system ACS2. Previous studies with ACS2 showed that the system reliably evolves an accurate and generalized predictive model in typical Markov decision process (MDP) environments. The predictive model approximates the state transition function of the MDP in a compact, generalized form. However, it was also shown that the evolving generalized predictive model might be over-general for an accurate representation of reinforcement values. Thus, a function approximation module is added that approximates state values. In combination, actual action choice depends on state values predicted by the means of the predictive model yielding anticipatory behavior. It is shown that the function approximation module accurately generalizes the state value function in the investigated MDP.We also suggest the implementation of task dependent anticipatory attentional mechanisms exploiting the representation of the generalized state-value function. Moreover, improvement of the approach by the means of further anticipatory interaction between predictive model learner and state value learner is suggested.
Anticipatory Behavior in Adaptive Learning Systems


Bidirectional ARTMAP: An artificial mirror neuron system
Butz, M. V. and Ray, Sylvian
Proceedings of the International Joint Conference on Neural Networks, 1417-1422, (2003)

Abstract The recent detection of mirror neurons in monkeys suggests that brains encode parts of an observed action in a similar way they encode own actions. This paper models such a mirror system by means of adaptive resonance theory (ART) artificial neural networks coupling them in a manner similar to ARTMAP systems. Particularly, a bidirectional ARTMAP system (BiARTMAP) is created. The system associates executed actions with consequent action-effects. The associative structure gives the system mirror capabilities: On the one hand, perceived environmental changes cause an action association. On the other hand, activated action patterns cause the expectation of resulting environmental change. We also show that many other proposed cognitive processes relate to the BiARTMAP architecture. Future work includes the incorporation of situational dependencies, the combination of BiARTMAP with vector associative maps (VAMs), and the integration of BiARTMAP in a behavioral module enabling anticipatory behavior. Application wise, BiARTMAP can be applied as a general classifier and/or associative network.
Download ps.gz format


Analysis and Improvement of Fitness Exploitation in XCS: Bounding Models, Tournament Selection, and Bilateral Accuracy
Butz, M. V., Goldberg, David E.,& Tharakunnel, K.
Evolutionary Computation, 11, 239-277 (2003)

Abstract The evolutionary learning mechanism in XCS strongly depends on its accuracy-based fitness approach. The approach is meant to result in an evolutionary drive from classifiers of low accuracy to those of high accuracy. Since, given inaccuracy, lower specificity often corresponds to lower accuracy, fitness pressure most often also results in a pressure towards higher specificity. Moreover, fitness pressure should cause the evolutionary process to be innovative in that it combines low-order building blocks of lower accurate classifiers, to higher-order building blocks with higher accuracy. This paper investigates how, when, and where accuracy-based fitness results in successful rule evolution in XCS. Along the way, a weakness in the current proportionate selection method in XCS is identified. Several problem bounds are derived that need to be obeyed to enable proper evolutionary pressure. Moreover, a fitness dilemma is identified that causes accuracy-based fitness to be misleading. Improvements are introduced to XCS to make fitness pressure more robust and overcome the fitness dilemma. Specifically, (1) tournament selection results in a much better fitness-bias exploitation, and (2) bilateral accuracy prevents the fitness dilemma. While the improvements stand for themselves, we believe they also contribute to the ultimate goal of an evolutionary learning system that is able to solve decomposable machine-learning problems quickly, accurately, and reliably. The paper also contributes to the further understanding of XCS in general and the fitness approach in XCS in particular.

also previous version: IlliGAL Report No. 2002027
Download Technical Report in ps.Z format


Bounding the Population Size in XCS to Ensure Reproductive Opportunities
Butz, M. V.,& Goldberg, David E.
Proceedings of the Genetic and Evolutionary Computation Conference (GECCO-2003)
pp. 1844-56. Springer Verlag, Berlin (2003)

Abstract Recently, the accuracy-based learning classifier system XCS successfully underwent several comparisons with other established machine learning algorithms. Despite these encouraging results, it is hardly understood how crucial parameters should be set in XCS nor how XCS scales up in larger problems. Previous research identified a covering challenge in XCS that needs to be obeyed to enable the genetic algorithm (GA) from taking place. Furthermore, a schema challenge was identified that, once obeyed, ensures the existence of accurate classifiers. This paper departs from these challenges deriving a reproductive opportunity bound. The bound assured that more accurate classifiers get a chance for reproduction. The relation to the previous bounds as well as the general specificity pressure in XCS are discussed as well. The derived bound shows that XCS scales in a machine learning competitive way.

also previous version: IlliGAL Report No. 2003009
Download Technical Report in ps.Z format


Towards Building Block Propagation in XCS: A Negative Result and Its Implications
Tharakunnel, Kurian, Butz, M. V., & Goldberg, David E.
Proceedings of the Genetic and Evolutionary Computation Conference (GECCO-2003)
pp. 1906-1917. Springer Verlag, Berlin (2003)

Abstract The accuracy-based classifier system XCS is currently the most successful learning classifier system. Several recent studies showed that XCS can produce machine-learning competitive results. Nonetheless, until now the evolutionary mechanisms in XCS remained somewhat ill-understood. This study investigates the selectorecombinative capabilities of the current XCS system. We reveal the accuracy dependence of XCS's evolutionary algorithm and identify a fundamental limitation of the accuracy-based fitness approach in certain problems. Implications and future research directions conclude the paper.


Tournament Selection in XCS
Butz, M. V., Sastry, Kumara,& Goldberg, David E.
Proceedings of the Genetic and Evolutionary Computation Conference (GECCO-2003)
pp. 1857-1869. Springer Verlag, Berlin (2003)

Abstract Selection in the accuracy-based learning classifier system XCS, introduced by Wilson in 1995, has always been done by the means of proportionate selection. Although it is known from GA literature that proportionate selection is subject to many pitfalls, the LCS community adhered to proportionate selection. In XCS, the accuracy-based fitness is scaled which made proportionate selection work in many problems. This paper investigates performance in Boolean functions in which proportionate selection fails to solve the function. Tournament selection with tournament sizes proportionate to the actual set size is shown to outperform proportionate selection in all investigated problems. Moreover, it is shown that tournament selection makes XCS more independent from various parameter settings. The message of this paper is plain and simple: Who works with XCS should use tournament selection with tournament sizes proportionate to the actual set size.

also previous version: IlliGAL Report No. 2002020
Download Technical Report in ps.Z format


2002


Anticipations Control Behavior: Animal Behavior in an Anticipatory Learning Classifier System
Butz, M. V. and Hoffmann, Joachim
Adaptive Behavior, 10, 75-96

Abstract The concept of anticipations controlling behavior is introduced. Background is provided about the importance of anticipations from a psychological perspective. Based on the psychological background wrapped in a framework of anticipatory behavioral control, the anticipatory learning classifier system ACS2 is explained. ACS2 learns and generalizes online a predictive environmental model (a model that allows the prediction of future environmental states). The model is a subjective model, that is, no global state information is available to the agent. It is shown that ACS2 can simulate anticipatory learning processes and anticipatory controlled behavior by the means of the model. The simulations of various rat experiments, previously conducted by Colwill and Rescorla, show that the incorporation of anticipations is indeed crucial for simulating the behavior observed in rats. Despite the simplicity of the tasks, we show that the observed behavior reaches beyond the capabilities of model-free reinforcement learning as well as model-based reinforcement learning without online generalization. Possible future impacts of anticipations in adaptive learning systems are outlined.

Download Paper in pdf format


The anticipatory classifier system and genetic generalization
Butz, M. V., Goldberg, David E., and Stolzmann, Wolfgang
Natural Computing, 1(4) 427-467
[An improved version, with clarifications and corrections, of: Technical Report No. 2000032 at the Illinois Genetic Algorithms Laboratory]

Abstract
The anticipatory classifier system (ACS) combines the learning classifier system framework with the cognitive learning theory of anticipatory behavioral control. The result is an evolutionary system that builds a complete and generalized predictive environmental model. Reinforcement learning techniques are applied to form a behavioral policy represented in the model. After providing some background as well as outlining the objectives of the system, we explain in detail all involved current processes. Furthermore, we analyze the deficiency of over-specialization in the anticipatory learning process (ALP), the main learning mechanism in the ACS. Consequently, we introduce a genetic algorithm (GA) to the ACS that is meant for generalization of over-specialized classifiers. We show that it is possible to form a symbiosis between a directed specialization and a genetic generalization mechanism achieving a learning mechanism that evolves a complete, accurate, and compact description of the perceived environment. Results in three different environmental settings confirm the usefulness of the genetic algorithm in the ACS. Finally, we discuss future research directions with the ACS and anticipatory systems in general.
Download Technical Report in compressed PS


Internal Models and Anticipations in Adaptive Learning Systems
Butz, M. V. and Sigaud, Olivier and Gérard, Pierre
From Animals to Animats 7: The seventh international conference on the Simulation of Adaptive Behavior (SAB 2002). Workshop Proceedings Adaptive Behavior in Anticipatory Learning Systems (ABiALS 2002)

Abstract
The workshop Adaptive Behavior in Anticipatory Learning Systems 2002 (ABiALS 2002) held in association with the seventh international conference on the Simulation of Adaptive Behavior (SAB 2002) in Edinburgh, Scotland, is the first of its kind. The explicit investigation of anticipations in relation to adaptive behavior is a recent approach. In this introduction to the workshop, we first provide psychological background that motivates and inspires the study of anticipations in the adaptive behavior field. Next, we endow the workshop with a basic framework for the study of anticipations in adaptive behavior. Different anticipatory mechanisms are identified and categorized. First fundamental distinctions are drawn between implicitly anticipatory mechanisms, payoff anticipations, sensorial anticipations, and state anticipations. A case study allows further insights into the drawn distinctions. Moreover, an overview of all accepted workshop contributions is provided categorizing the contributions in the light of the outlined distinctions and highlighting the relations to each other. Many future research directions are suggested.

Download Paper in pdf format


Generalized State Values in an Anticipatory Learning Classifier System
Butz, M. V. and Goldberg, David E.
From Animals to Animats 7: The seventh international conference on the Simulation of Adaptive Behavior (SAB 2002). Workshop Proceedings Adaptive Behavior in Anticipatory Learning Systems (ABiALS 2002)

Abstract
This paper introduces generalized state values to the anticipatory learning classifier system ACS2. Previous studies with ACS2 showed that the system reliably evolves a generalized predictive model in typical Markov decision process (MDP). The predictive model approximates the state transition function of the MDP in a compact, generalized form. However, it was also shown that the evolving predictive model might be over-general for an accurate representation of reinforcement values. Thus, a function approximation module is added that approximates state values. In combination, actual action choice depends on state values predicted by the means of the predictive model yielding anticipatory behavior. It is shown that the function approximation module accurately generalizes the state value function in the investigated MDP. We also suggest the implementation of task dependent anticipatory attentional mechanisms exploiting the representation of the generalized state-value function. Moreover, improvement of the approach by the means of further anticipatory interaction between predictive model learner and state value learner is suggested.

Download Paper in pdf format


State Value Learning with an Anticipatory Learning Classifier System in a Markov Decision Process
Butz, M. V.
Technical Report 2002018 at the Illinois Genetic Algorithms Laboratory

Abstract
This paper addresses the combination of an online generalizing model learner with a state value learner in a Markov decision process (MDP). The model learner evolves online a generalized representation of the MDP's state transition function. The learned model is called predictive model. State values are evaluated by the means of the evolving predictive model representation. State values approximate the Bellman equation giving rise to an optimal policy in the MDP. It is proven that if the reinforcement provision in the MDP only depends on resulting states, state values can be redefined to yield an optimal policy independent of any immediate reward representation. Internal temporal difference learning is applied to further speed up learning of an optimal policy.

Download Technical Report in compressed PS


An Algorithmic Description of ACS2
Butz, M. V. and Goldberg, David E.
In Lanzi, P. L., Stolzmann, W., and S. W. Wilson (Eds.),
Advances in Learning Classifier Systems: 4th International Workshop, IWLCS 2001
pp. 211-230. Berlin: Springer-Verlag (2002).

Abstract
The chapter investigates how model and behavioral learning can be improved in an anticipatory learning classifier system by biasing exploration. First, the applied system ACS2 is explained. Next, an overview over the possibilities of applying exploration biases in an anticipatory learning classifier system and specifically ACS2 is provided. In ACS2, a recency bias termed action delay bias as well as an error bias termed knowledge array bias is implemented. The system is applied in a dynamic maze task and an hand-eye coordination task to validate the biases. The experiments exhibit that biased exploration enables ACS2 to evolve and adapt its internal environmental model faster. Also adaptive behavior is improved.


Biasing exploration in an anticipatory learning classifier system
Butz, M. V.
Advances in Learning Classifier Systems: 4th International Workshop, IWLCS 2001
pp. 3-22. Berlin: Springer-Verlag (2001).

Abstract
The chapter investigates how model and behavioral learning can be improved in an anticipatory learning classifier system by biasing exploration. First, the applied system ACS2 is explained. Next, an overview over the possibilities of applying exploration biases in an anticipatory learning classifier system and specifically ACS2 is provided. In ACS2, a recency bias termed action delay bias as well as an error bias termed knowledge array bias is implemented. The system is applied in a dynamic maze task and an hand-eye coordination task to validate the biases. The experiments exhibit that biased exploration enables ACS2 to evolve and adapt its internal environmental model faster. Also adaptive behavior is improved.


Anticipatory learning classifier systems
Butz, M. V.
Boston, MA. Kluwer Academic Publishers.
Book in the book series Genetic Algorithms and Evolutionary Computation
More information about the book can be found here.


2001


An implementation of the anticipatory classifier system ACS2 in C++
Butz, M. V.
Technical Report 2001026 at the Illinois Genetic Algorithms Laboratory

Abstract
A documentation of a C++ implementation of ACS2, the current state-of-the-art of the anticipatory learning classifier system ACS, is provided. The documentation explains how to get started with the code. A detailed overview of the structure of the code and of all possible parameter manipulations are given. Input and Output interfaces are revealed. Finally, the documentation exhibits how to run ACS2 in the provided test environments as well as how to program new environments for further runs with ACS2.

Download Technical Report in compressed PS
Download the Code in compressed TAR


Analyzing the evolutionary pressures in XCS
Butz, M. V. and Pelikan, M.
In Spector, Lee and Goodman, Erik D. and Wu, Annie and Langdon, W. B. and Voigt, Hans-Michael and Gen, Mitsuo and Sen, Sandip and Dorigo, Marco and Pezeshk, Shahram and Garzon, Max H. and Burke, Edmund
Proceedings of the Genetic and Evolutionary Computation Conference (GECCO-2001)
pp. 935-942. San Francisco, CA: Morgan Kaufmann (2001)
(previous version as Technical Report 2001009 at the Illinois Genetic Algorithms Laboratory)

Abstract
After an increasing interest in learning classifier systems and the XCS classifier system in particular, this paper locates and analyzes the distinct evolutionary pressures in XCS. Combining several of the pressures, an equation is derived that validates the generalization hypothesis which was stated by Wilson (1995). A detailed experimental study of the equation exhibits its applicability in predicting the change in specificity in XCS as well as reveals several other specificity influences.

Download Technical Report version in compressed PS


How XCS evolves accurate classifiers
Butz, M. V. and Kovacs, Tim and Lanzi, Pier-Luca and Wilson, Stewart W.
In Spector, Lee and Goodman, Erik D. and Wu, Annie and Langdon, W. B. and Voigt, Hans-Michael and Gen, Mitsuo and Sen, Sandip and Dorigo, Marco and Pezeshk, Shahram and Garzon, Max H. and Burke, Edmund
Proceedings of the Genetic and Evolutionary Computation Conference (GECCO-2001)
pp. 927-934. San Francisco, CA: Morgan Kaufmann (2001).
(previous version Technical Report 2001008 at the Illinois Genetic Algorithms Laboratory)

Abstract
Due to the accuracy based fitness approach, the ultimate goal for XCS is the evolution of a compact, complete, and accurate payoff mapping of an environment. This paper investigates what causes the XCS classifier system to evolve accurate classifiers. The investigation leads to two challenges for XCS, the covering challenge and the schema challenge. Both challenges are revealed theoretically and experimentally. Furthermore, the paper provides suggestions for overcoming the challenges as well as investigates environmental properties that can help XCS to overcome the challenges autonomously. Along those lines, a deeper insight into how to set the initial parameter values in XCS is provided.

Download Technical Report version in compressed PS


An Algorithmic Description of XCS
Butz, M. V. and Wilson, Stewart W.
In Lanzi, P. L., Stolzmann, W., and S. W. Wilson (Eds.), Advances in Learning Classifier Systems, LNAI 1996,
pp. 253-272. Berlin: Springer-Verlag (2001)
also
Soft Computing, 6, 144-153 (2002)
[An improved version, with clarifications and corrections, of: Technical Report No. 2000017 at the Illinois Genetic Algorithms Laboratory]

Abstract
A concise description of the XCS classifier system's parameters, structures, and algorithms is presented as an aid to research. The algorithms are written in modular structured pseudo code with accompanying explanations.

Download up-to-date version in gzipped PS


Probability-Enhanced Predictions in the Anticipatory Classifier System
Butz, M. V., Goldberg, David E., and Stolzmann, Wolfgang
In Lanzi, P. L., Stolzmann, W., and S. W. Wilson (Eds.), Advances in Learning Classifier Systems, LNAI 1996,
pp. 37-51. Berlin: Springer-Verlag (2001).
[An improved version, with clarifications and corrections, of: Technical Report No. 2000016, Illinois Genetic Algorithms Laboratory]

Abstract
The Anticipatory Classifier System (ACS) recently showed many capabilities new to the Learning Classifier System field. Due to its enhanced rule structure with an effect part, it forms an internal environmental representation, learns latently besides the common reward learning, and can use many cognitive processes. This paper introduces a probability-enhancement in the predictions of the ACS which enables the system to handle different kinds of non-determinism in an environment. Experiments in two different mazes will show that the ACS is now able to handle action-noise and irrelevant random attributes in the perceptions. Furthermore, applications with a recently introduced GA will reveal the general independence of the two new mechanism as well as the ability of the GA to substantially decrease the population size.

Download technical report version in compressed PS


2000


XCSJava 1.0: An Implementation of the XCS classifier system in Java
Butz, M. V.
Technical Report 2000027 at the Illinois Genetic Algorithms Laboratory

Abstract
The XCSJava 1.0 implementation of the XCS classifier system in Java is freely available from the IlliGAL anonymous ftp-site. The implementation covers the basic features of the XCS classifier system and provides a multiplexer and maze environment for testing purposes. This paper explains how to download, compile, and run the code. Moreover, it explains the object oriented approach in the implementation and the possible parameter manipulation as well as the environmental interface to hook in other test environments. Additionally to the source code, an executable package of the version as well as an XCSJava 1.0 API documentation is provided.

Download Technical Report in compressed PS
Download the Code in compressed TAR


Investigating Generalization in the Anticipatory Classifier System
Butz, M. V., Goldberg, David E., and Stolzmann, Wolfgang
Proceedings of the sixth international conference on Parallel Problem Solving from Nature (PPSN2000)
[previous version as Technical Report 2000014 at the Illinois Genetic Algorithms Laboratory]

Abstract
Recently, a genetic algorithm (GA) was introduced to the Anticipatory Classifier System (ACS) which surmounted the occasional problem of over-specification of rules. This paper investigates the resulting generalization capabilities further by monitoring in detail the performance of the ACS in the highly challenging multiplexer task. Moreover, by comparing the ACS to XCS in this task it is shown that the ACS generates accurate, maximally general rules and its population converges to those rules. Besides the observed ability of latent learning and the formation of an internal environmental representation, this ability of generalization adds a new advantage to the ACS in comparison with similar approaches.

Download Technical Report in compressed PS


First Cognitive Capabilities in the Anticipatory Classifier System
Stolzmann, Wolfgang, Butz, M. V., Hoffmann, Joachim, and Goldberg, David E.
Proceedings of the sixth international conference on the Simulation of Adaptive Behavior (SAB2000), 287-296 (2000).
[previous version as Technical Report 2000008 at the Illinois Genetic Algorithms Laboratory]

Abstract
This paper adds a new viewpoint to the Anticipatory Classifier System (ACS). It approaches the system from a psychological perspective and thus provides new insights to the current system. The main learning mechanism in the ACS, the Anticipatory Learning Process (ALP), evolved out of the psychological learning theory of anticipatory behavioral control. The paper compares the ALP directly to this theory and reveals the similarities. Moreover, it investigates the behavior of the ACS. By simulating previously published rat experiments, the paper compares the behavior of the ACS with the behavior of the rats. Finally, two further cognitive mechanisms are introduced to the ACS. These two mechanisms result in an animal-like behavior of the ACS in the simulations. Furthermore, they prove the usability of the internal environmental model for reward-learning tasks for the first time.

Download Technical Report in compressed PS


Introducing a genetic generalization pressure to the Anticipatory Classifier System
Part 2: Performance Analysis
Butz, M. V., Goldberg, David E., and Stolzmann, Wolfgang
Proceedings of the Genetic and Evolutionary Computation Conference (GECCO-2000)
[previous version as Technical Report 2000006 at the Illinois Genetic Algorithms Laboratory]

Abstract
The Anticipatory Classifier System (ACS) is able to form a complete internal representation of an environment. Unlike most other classifier system and reinforcement learning approaches, it is able to learn latently (i.e. to learn in an environment without getting any reward). Compared to other systems which are also able to form an internal representation of the outside world, the advantage of the ACS is that it is not forming an identical copy of the environment but it is generating a complete but more general model. After the observation that the model is not necessarily maximally general a genetic generalization pressure was introduced to the ACS (Butz:Technical Report 2000005). This paper focuses on the different mechanisms in the anticipatory learning process, which resembles the specification pressure, and in the genetic algorithm, which realizes the genetic generalization pressure. The capability of generating maximally general rules and evolving a completely converged population is investigated in detail. Furthermore, the paper approaches a first comparison with the XCS classifier system in different mazes and the multiplexer problem.

Download Technical Report in compressed PS


Introducing a genetic generalization pressure to the Anticipatory Classifier System
Part 1: Theoretical approach
Butz, M. V., Goldberg, David E., and Stolzmann, Wolfgang
Proceedings of the Genetic and Evolutionary Computation Conference (GECCO-2000)
[previous version as Technical Report 2000005 at the Illinois Genetic Algorithms Laboratory]

Abstract
The Anticipatory Classifier System is a learning classifier system that is based on the cognitive mechanism of anticipatory behavioral control. Besides the common reward learning, the ACS is able to learn latently (i.e. to learn in an environment without getting any reward) which is not possible with reinforcement learning techniques. Furthermore, it is forming a complete internal representation of the environment and thus, it is able to use cognitive processes such as reasoning and planning. Latest research showed that there are problems that challenge the current ACS learning mechanism. It was observed that the ACS is not generating accurate, maximally general rules reliably (i.e. rules which are accurate and in the mean time as general as possible), but it is sometimes generating over-specific rules. This paper shows how a genetic algorithm can be used to overcome this present pressure of over-specification in the ACS mechanism with a genetic generalization pressure. The ACS works then as a hybrid which learns latently, forms a cognitive map, and evolves accurate, maximally general rules.

Download Technical Report in compressed PS


1999


An Implementation of the XCS classifier system in C
Butz, M. V.
Technical Report 99021 at the Illinois Genetic Algorithms Laboratory

Abstract
The XCS classifier system was developed by Wilson (1995). The learning mechanism is based on the accuracy of its reward prediction. This method leads to the formation of accurate most general classifiers. This paper explains how to download, compile and use the XCS code version 1.0 written in ANSI C. It discusses how to select various parameter settings, how to add and remove certain procedures in the XCS, how to apply the XCS in the multiplexer environment and diverse woods environments, and how to add code to apply the XCS in other environments. The code provides the mechanisms introduced by Wilson (1995) and the enhancements published by Wilson (1998).

Download Technical Report in compressed PS


New Challenges for an Anticipatory Classifier System: Hard Problems and Possible Solutions
Butz, M. V., Goldberg, David E., and Stolzmann, Wolfgang
Technical Report 99019 at the Illinois Genetic Algorithms Laboratory

Abstract
An Anticipatory Classifier System (ACS) is a learning mechanism based on learning classifier systems and the cognitive model of ``Anticipatory Behavioral Control''. By comparing perceived consequences with its own expectations (anticipations), an ACS is able to learn in multi-step environments. To date, the ACS has proven its abilities in various problems of that kind. It is able to learn latently (i.e. to learn without getting any reward) and it is able to distinguish between non-Markov states. Additionally, an ACS is capable of incrementally building a cognitive map that can be used to do action-planning. Although the ACS has proven to scale up in suitable environments, it depends on certain environmental properties. It believes itself to be the only agent that can change the perceptions received from an environment. Any environmental change is considered and believed to be caused by the executed actions. The ACS learns from the changes by using fixed mechanisms. This paper reveals the properties of an environment that the current ACS assumes to be given. By investigating the problems of the current ACS when violating these properties we believe that this investigation will immediately serve for a better understanding of the ACS and lead to many ideas to improve the current ACS. We will propose some ideas and discuss the important ones in more detail.

Download Technical Report in compressed PS


Action-Planning in Anticipatory Classifier Systems
Butz, M. V. & Stolzmann, Wolfgang
Presented on the Second International Workshop on Learning Classifier Systems (2.IWLCS'99) during the Genetic and Evolutionary Computation Conference (GECCO 1999)
Also appeared in a slightly different form In Lanzi, Stolzmann, & Wilson (Eds.), Learning Classifier Systems - From Foundations to Applications.
In chapter Latent learning and action-planning in robots with Anticipatory Classifier Systems.

Abstract
Learning consists in the acquisition of knowledge. In Reinforcement Learning this is knowledge about how to reach a maximum of environmental reward. We are interested in the acquisition of knowledge that consists in having expectations of behavioral consequences. Behavioral consequences depend on the current situation, so it is necessary to learn in which situation S which behavior/reaction R leads to which behavioral consequences C. In other words, SRC units are learned. It was the psychologist Edward Tolman (1932) who firstly stated that animals learn SRC units. Seward (1949) proved that rats are able to learn in the absence of reward and confirmed Tolman's assumption. Learning in the absence of reinforcement is called 'latent learning' and cannot be explained by usual reinforcement learning techniques. In the field of Learning Classifier Systems (LCS) latent learning is realized in Riolo's CFSC2 (Riolo, 1991) and Stolzmann's ACS (Stolzmann, 1997, 1998). Both authors prove the performance of their learning algorithms with a simulation of Seward's experiment. This experiment consists in a learning phase without any reward followed by a test phase where the rats have to use the knowledge they acquired during the learning phase to do action-planning. Action-planning and latent learning occur at different times. This paper focuses on the integration of action-planning and latent learning in ACS. Using an example about learning of the hand-eye coordination of a robot arm in conjunction with a camera it will be shown, that a combination of action-planning and latent learning in ACS induces a substantial reduction of the number of trials which are required to learn a complete model of a prototypically environment.


...last modified: 13 September 2007