Data mining by evolving agents for clusters discovery and metric learning

02 Pubblicazione su volume
Martino Alessio, Giampieri Mauro, Luzi Massimiliano, Rizzi Antonello
ISSN: 2190-3018

In this paper we propose a novel evolutive agent-based clustering algorithm where agents act as individuals of an evolving population, each one performing a random walk on a different subset of patterns drawn from the entire dataset. Such agents are orchestrated by means of a customised genetic algorithm and are able to perform simultaneously clustering and feature selection. Conversely to standard clustering algorithms, each agent is in charge of discovering well-formed (compact and populated) clusters and, at the same time, a suitable subset of features corresponding to the subspace where such clusters lie, following a local metric learning approach, where each cluster is characterised by its own subset of relevant features. This will not only lead to a deeper knowledge of the dataset at hand, revealing clusters that are not evident when using the whole set of features, but will also be suitable for large datasets, as each agent will process a small subset of patterns. We show the effectiveness of our algorithm on synthetic datasets, remarking some interesting future work scenarios and extensions.

© Università degli Studi di Roma "La Sapienza" - Piazzale Aldo Moro 5, 00185 Roma