This paper proposes clustering methods for large-scale stationary time series using a fuzzy approach. Adopting partitioning around centroids (PAC) and partitioning around medoids (PAM), and focusing on distributional properties of individual series, we classify a large set of time series by transforming the series into probability density functions via nonparametric density estimation, such as the kernel estimation, and using a proper distance measure, such as the Hellinger distance, between density functions.We use simulations and two real applications to demonstrate the good performance and effectiveness of the proposed clustering methods in finite samples. The proposed methods are also applicable to the spectral density functions if one focuses on the serial dependence of individual series.

D'Urso, Pierpaolo; De Giovanni, Livia; Tsay, Ruey; Vitale, Vincenzina. (2026). Clustering Large-scale Time Series. JOURNAL OF CLASSIFICATION, (ISSN: 0176-4268), 42: 236-274. Doi: 10.1007/s00357-025-09535-0.

Clustering Large-scale Time Series

De Giovanni, Livia;
2026

Abstract

This paper proposes clustering methods for large-scale stationary time series using a fuzzy approach. Adopting partitioning around centroids (PAC) and partitioning around medoids (PAM), and focusing on distributional properties of individual series, we classify a large set of time series by transforming the series into probability density functions via nonparametric density estimation, such as the kernel estimation, and using a proper distance measure, such as the Hellinger distance, between density functions.We use simulations and two real applications to demonstrate the good performance and effectiveness of the proposed clustering methods in finite samples. The proposed methods are also applicable to the spectral density functions if one focuses on the serial dependence of individual series.
2026
Large-scale dependent data · Kernel density estimation · Hellinger distance · Fuzzy clustering · Partitioning around centroids · Partitioning around medoids
D'Urso, Pierpaolo; De Giovanni, Livia; Tsay, Ruey; Vitale, Vincenzina. (2026). Clustering Large-scale Time Series. JOURNAL OF CLASSIFICATION, (ISSN: 0176-4268), 42: 236-274. Doi: 10.1007/s00357-025-09535-0.
File in questo prodotto:
File Dimensione Formato  
s00357-025-09535-0J_Cl_2026.pdf

Open Access

Tipologia: Documento in Post-print
Licenza: Creative commons
Dimensione 2.49 MB
Formato Adobe PDF
2.49 MB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11385/262479
Citazioni
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
  • OpenAlex ND
social impact