Listing dense subgraphs is a fundamental task with a variety of network analytics applications. A lot of research has been done focusing on k-cliques, i.e. complete subgraphs on k nodes. However, requiring complete connectivity between the nodes of a subgraph may be too restrictive in many real applications. Hence, in this paper, we consider a natural relaxation of cliques, called k-diamonds and defined as cliques of size with one missing edge. We first provide a sequential algorithm that counts and lists all the k-diamonds in large graphs, for any constant k. A parallel extension of the sequential algorithm is then proposed and analyzed in a MapReduce-style model, achieving the same local and total space usage of the state-of-the-art algorithms for k-cliques. The running time is optimal on dense graphs and sqrt(m) larger than k-clique counting if the graph is sparse. Our algorithms compute induced diamonds by analyzing the structure of directed stars formed by the graph nodes and their neighbors.
From Stars to Diamonds: Counting and Listing Almost Complete Subgraphs in Large Networks / Finocchi, Irene; Leon Garcia, Renan; Sinaimeri, Blerina. - In: COMPUTER JOURNAL. - ISSN 1460-2067. - 67:6(2024), pp. 2151-2161. [10.1093/comjnl/bxad129]
From Stars to Diamonds: Counting and Listing Almost Complete Subgraphs in Large Networks
Irene Finocchi
;Blerina Sinaimeri
2024
Abstract
Listing dense subgraphs is a fundamental task with a variety of network analytics applications. A lot of research has been done focusing on k-cliques, i.e. complete subgraphs on k nodes. However, requiring complete connectivity between the nodes of a subgraph may be too restrictive in many real applications. Hence, in this paper, we consider a natural relaxation of cliques, called k-diamonds and defined as cliques of size with one missing edge. We first provide a sequential algorithm that counts and lists all the k-diamonds in large graphs, for any constant k. A parallel extension of the sequential algorithm is then proposed and analyzed in a MapReduce-style model, achieving the same local and total space usage of the state-of-the-art algorithms for k-cliques. The running time is optimal on dense graphs and sqrt(m) larger than k-clique counting if the graph is sparse. Our algorithms compute induced diamonds by analyzing the structure of directed stars formed by the graph nodes and their neighbors.File | Dimensione | Formato | |
---|---|---|---|
ComputerJournalProof-bxad129.pdf
Solo gestori archivio
Descrizione: Proof (accepted November 2023)
Tipologia:
Documento in Pre-print
Licenza:
Tutti i diritti riservati
Dimensione
1.2 MB
Formato
Adobe PDF
|
1.2 MB | Adobe PDF | Visualizza/Apri |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.