We present an information-theoretic method permitting one to find structure in a problem space (here, in a spatial navigation domain) and cluster it in ways that are convenient to solve different classes of control problems, which include planning a path to a goal from a known or an unknown location, achieving multiple goals and exploring a novel environment. Our generative nonparametric approach, called the generative embedded Chinese restaurant process (geCRP), extends the family of Chinese restaurant process (CRP) models by introducing a parameterizable notion of distance (or kernel) between the states to be clustered together. By using different kernels, such as the the conditional probability or joint probability of two states, the same geCRP method clusters the environment in ways that are more sensitive to different control-related information, such as goal, sub-goal and path information. We perform a series of simulations in three scenariosan open space, a grid world with four rooms and a maze having the same structure as the Hanoi Towerin order to illustrate the characteristics of the different clusters (obtained using different kernels) and their relative benefits for solving planning and control problems.
Nonparametric Problem-Space Clustering: Learning Efficient Codes for Cognitive Control Tasks
Publication type:
Articolo
Publisher:
MDPI,, Basel
Source:
Entropy (Basel, Online) 18 (2016). doi:10.3390/e18020061
info:cnr-pdr/source/autori:Maisto, Domenico; Donnarumma, Francesco; Pezzulo, Giovanni/titolo:Nonparametric Problem-Space Clustering: Learning Efficient Codes for Cognitive Control Tasks/doi:10.3390/e18020061/rivista:Entropy (Basel, Online)/anno:2016/pagin
Date:
2016
Resource Identifier:
http://www.cnr.it/prodotto/i/361311
https://dx.doi.org/10.3390/e18020061
info:doi:10.3390/e18020061
http://www.mdpi.com/1099-4300/18/2/61
Language:
Eng