Listing dense subgraphs is a fundamental task with a variety of network analytics applications. A lot of research has been done focusing on k-cliques, i.e. complete subgraphs on k nodes. However, requiring complete connectivity between the nodes of a subgraph may be too restrictive in many real applications. Hence, in this paper, we consider a natural relaxation of cliques, called k-diamonds and defined as cliques of size with one missing edge. We first provide a sequential algorithm that counts and lists all the k-diamonds in large graphs, for any constant k. A parallel extension of the sequential algorithm is then proposed and analyzed in a MapReduce-style model, achieving the same local and total space usage of the state-of-the-art algorithms for k-cliques. The running time is optimal on dense graphs and sqrt(m) larger than k-clique counting if the graph is sparse. Our algorithms compute induced diamonds by analyzing the structure of directed stars formed by the graph nodes and their neighbors.
Finocchi, Irene; Leon Garcia, Renan; Sinaimeri, Blerina. (2024). From Stars to Diamonds: Counting and Listing Almost Complete Subgraphs in Large Networks. COMPUTER JOURNAL, (ISSN: 1460-2067), 67:6, 2151-2161. Doi: 10.1093/comjnl/bxad129.
From Stars to Diamonds: Counting and Listing Almost Complete Subgraphs in Large Networks
Irene Finocchi
;Blerina Sinaimeri
2024
Abstract
Listing dense subgraphs is a fundamental task with a variety of network analytics applications. A lot of research has been done focusing on k-cliques, i.e. complete subgraphs on k nodes. However, requiring complete connectivity between the nodes of a subgraph may be too restrictive in many real applications. Hence, in this paper, we consider a natural relaxation of cliques, called k-diamonds and defined as cliques of size with one missing edge. We first provide a sequential algorithm that counts and lists all the k-diamonds in large graphs, for any constant k. A parallel extension of the sequential algorithm is then proposed and analyzed in a MapReduce-style model, achieving the same local and total space usage of the state-of-the-art algorithms for k-cliques. The running time is optimal on dense graphs and sqrt(m) larger than k-clique counting if the graph is sparse. Our algorithms compute induced diamonds by analyzing the structure of directed stars formed by the graph nodes and their neighbors.| File | Dimensione | Formato | |
|---|---|---|---|
|
ComputerJournalProof-bxad129.pdf
Solo gestori archivio
Descrizione: Proof (accepted November 2023)
Tipologia:
Documento in Pre-print
Licenza:
Tutti i diritti riservati
Dimensione
1.2 MB
Formato
Adobe PDF
|
1.2 MB | Adobe PDF | Visualizza/Apri |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.



