Complexity vs. performance in granular embedding spaces for graph classification

2020

Proceedings of the 12th International Joint Conference on Computational Intelligence - Volume 1: NCTA

Complexity vs. performance in granular embedding spaces for graph classification

04 Pubblicazione in atti di convegno

Baldini Luca, Martino Alessio, Rizzi Antonello

The most distinctive trait in structural pattern recognition in graph domain is the ability to deal with the organization and relations between the constituent entities of the pattern. Even if this can be convenient and/or necessary in many contexts, most of the state-of the art classiﬁcation techniques can not be deployed directly in the graph domain without ﬁrst embedding graph patterns towards a metric space. Granular Computing is a powerful information processing paradigm that can be employed in order to drive the synthesis of automatic embedding spaces from structured domains. In this paper we investigate several classiﬁcation techniques starting from Granular Computing-based embedding procedures and provide a thorough overview in terms of model complexity, embedding space complexity and performances on several open-access datasets for graph classiﬁcation. We witness that certain classiﬁcation techniques perform poorly both from the point of view of complexity and learning performances as the case of non-linear SVM, suggesting that high dimensionality of the synthesized embedding space can negatively affect the effectiveness of these approaches. On the other hand, linear support vector machines, neuro-fuzzy networks and nearest neighbour classiﬁers have comparable performances in terms of accuracy, with second being the most competitive in terms of structural complexity and the latter being the most competitive in terms of embedding space dimensionality.

embedding spaces granular computing graph edit distances structural pattern recognition