Model-based biclustering of multivariate longitudinal trajectories

04 Pubblicazione in atti di convegno
Alfò M., Marino M. F., Martella F.

Model-based clustering represents nowadays a popular tool of analysis thanks to its probabilistic foundations and its great flexibility. To deal with
multivariate longitudinal sequences, standard approaches need to be extended to accommodate the peculiarities of such kind of data. These come
in the form of three-way data: the first dimension identifies individuals, the second dimension identifies variables, the third one identifies time
occasions. A method for simultaneous clustering of subjects and multivariate outcomes repeatedly recorded over time is proposed. In particular,
a finite mixture of generalized linear models is considered to cluster individuals; within each component of the finite mixture, a flexible and
parsimonious parameterization of the corresponding canonical parameter is adopted to identify clusters of outcomes evolving in a similar manner
across time. This allows us to obtain clusters of individuals that share common trajectories for one of more outcomes over time and, consequently,
a dimensionality reduction on the first two dimensions of three-way data structure. Parameter estimates are derived within a maximum likelihood
framework, by considering an indirect approach based on an extended expectation-maximization algorithm.

© Università degli Studi di Roma "La Sapienza" - Piazzale Aldo Moro 5, 00185 Roma