Quantifying Emergent Behavior of Autonomous Robots

by Georg Martius, Eckehard Olbrich

Abstract: Quantifying the behavior of robots which was generated autonomously by task-independent objective functions is an important prerequisite for objective comparisons of algorithms with each other or even with human or animal movements. The excess entropy (Shaw 1984), effective measure complexity (Grassberger 1986) or predictive information (Bialek et al.~2001) is usually considered as the most natural complexity measure for temporal sequences. It measures the amount of information that the past contains about the future (predictive information) which is equal to the non-extensive part of the entropy (excess entropy) of the sequence. However, using the excess entropy as a complexity measure for continuous valued time series one has to deal with the fact that its value will be different for different partitions and diverges with increasing resolution. We therefore propose a new decomposition of the excess entropy in an resolution dependent and a resolution independent part and discuss how they depend on the dimensionality of the dynamics, correlations and the noise level. For the practical estimation we propose to use estimates based on the correlation integral instead of the direct estimation of the mutual information using the algorithm by Kraskov et al.~(2004) which is based on next neighbor statistics because the latter allows less control of the scale dependencies. Using our algorithm we are able to show how autonomous learning generates behavior of increasing complexity with increasing learning duration.

Videos

Snake

Video S1: Snake: Side rolling behavior. The segments are connected by universal joints and are actively controlled. So in order to make this rolling all motors have to act accordingly.

Video S2: Snake: Side rolling behavior. The segments are connected by universal joints and are actively controlled. So in order to make this rolling all motors have to act accordingly.

Hexapod

The learning Hexapod developes due to the interaction of the internal parameters dynamics and the environemnt a transition of behaviors is observed that results in a jumping behavior.

Video H1: Hexapod: Behavior 2 min after start. The robot periodically moves up and down with a small amplitude. When freezing the controller parameters this behavior continues.

Video H2: Hexapod: Behavior 4 min after start. We observe a precursor of the jumping behavior.

Video H3: Hexapod: Behavior 8 min after start. A stable jumping behavior was learned.

Data files

The data files can be downloaded from here. An overview of the data format and handling is given here.

Code

Sources are on Github georgmartius/behavior-quant

This document was translated from L^AT_EX by H^EV^EA.