In this paper, we propose a new algorithm for the joint clustering of two sets of statistical units N and M which are also equipped with an adjacency structure which is represented by a bipartite network. Our model is based on the fuzzy Partition Around Medoids, and it combines it with techniques for community detection in bipartite complex networks based on Barber modularity maximization. The goal is to produce a partition of N ∪Minto clusters, each of which is also identified by two medoids, one in N and one inM, which represent the typical units in the cluster for each set. Such clusters are optimized so that units in the same cluster both have similar values on their attributes and are likely to be adjacent.We test the algorithm on both simulated and real data, to show how it is able to capture a wide range of different interactions between the distribution of the attributes and the network structure.
Fuzzy clustering with Barber modularity regularization / De Giovanni, Livia; D'Urso, Pierpaolo; Federico, Lorenzo; Vitale, Vincenzina. - In: STATISTICS AND COMPUTING. - ISSN 0960-3174. - 34:214(2024), pp. 1-34. [10.1007/s11222-024-10495-5]
Fuzzy clustering with Barber modularity regularization
Livia De Giovanni;Pierpaolo D'Urso;Lorenzo Federico
;Vincenzina Vitale
2024
Abstract
In this paper, we propose a new algorithm for the joint clustering of two sets of statistical units N and M which are also equipped with an adjacency structure which is represented by a bipartite network. Our model is based on the fuzzy Partition Around Medoids, and it combines it with techniques for community detection in bipartite complex networks based on Barber modularity maximization. The goal is to produce a partition of N ∪Minto clusters, each of which is also identified by two medoids, one in N and one inM, which represent the typical units in the cluster for each set. Such clusters are optimized so that units in the same cluster both have similar values on their attributes and are likely to be adjacent.We test the algorithm on both simulated and real data, to show how it is able to capture a wide range of different interactions between the distribution of the attributes and the network structure.File | Dimensione | Formato | |
---|---|---|---|
s11222-024-10495-5.pdf
Open Access
Tipologia:
Versione dell'editore
Licenza:
Creative commons
Dimensione
1.29 MB
Formato
Adobe PDF
|
1.29 MB | Adobe PDF | Visualizza/Apri |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.