Robust fuzzy clustering of multivariate time trajectories

01 Pubblicazione su rivista
D'Urso Pierpaolo, De Giovanni Livia, Massari Riccardo
ISSN: 0888-613X

The detection of patterns in multivariate time series is a relevant task, especially for large datasets. In this paper, four clustering models for multivariate time series are proposed, with the following characteristics. First, the Partitioning Around Medoids (PAM) framework is considered. Among the different approaches to the clustering of multivariate time series, the observation-based is adopted. To cope with the complexity of the features of each multivariate time series and the associated assignment uncertainty a fuzzy clustering approach is adopted. Finally, to neutralize the effect of possible outliers, a robust metric approach is used, i.e., the exponential transformation of dissimilarity measures. The proposed models are robust extensions of the Fuzzy C-Medoids clustering algorithm for multivariate time series. With respect to the management of the time behaviour, four variants are proposed: the Cross-Sectional Fuzzy C-Medoids clustering model with exponential transformation (CS-Exp-FCMd) classifies the multivariate time series taking into account their respective instantaneous features; the Longitudinal Fuzzy C-Medoids clustering model with exponential transformation (L-Exp-FCMd) takes into account the evolutive (longitudinal) features; the Mixed Fuzzy C-Medoids clustering model with exponential transformation (M-Exp-FCMd) which consider simultaneously both the instantaneous and the longitudinal features in the clustering process; the Dynamic Time Warping-based Fuzzy C-Medoids model with exponential transformation (DTW-Exp-FCMd) uses the Dynamic Time Warping (DTW) distance. Three simulation studies show the clustering performance of the proposed models in presence of outliers, compared to their non-robust counterparts, and to other models proposed in the literature. An application on real-world data on the concentration of three pollutants in nineteen stations in the Metropolitan City of Rome shows the relevance of the robustness to outliers in the identification of the clusters.

© Università degli Studi di Roma "La Sapienza" - Piazzale Aldo Moro 5, 00185 Roma