Finding dense subgraphs in large graphs is a key primitive in a variety of real-world application domains, encompass-ing social network analytics, event detection, biology, and finance. In most such applications, one typically aims at finding several (possibly overlapping) dense subgraphs which might correspond to communities in social networks or in-teresting events. While a large amount of work is devoted to finding a single densest subgraph, perhaps surprisingly, the problem of finding several dense subgraphs with limited overlap has not been studied in a principled way, to the best of our knowledge. In this work we define and study a natural generalization of the densest subgraph problem, where the main goal is to find at most k subgraphs with maximum to-tal aggregate density, while satisfying an upper bound on the pairwise Jaccard coefficient between the sets of nodes of the subgraphs. After showing that such a problem is NP-Hard, we devise an efficient algorithm that comes with provable guarantees in some cases of interest, as well as, an efficient practical heuristic. Our extensive evaluation on large real-world graphs confirms the efficiency and effectiveness of our algorithms.
Balalau, Oana Denisa; Bonchi, Francesco; Chan, T-H. Hubert; Gullo, Francesco; Sozio, Mauro. (2015). Finding Subgraphs with Maximum Total Density and Limited Overlap. In WSDM 2015 - Proceedings of the 8th ACM International Conference on Web Search and Data Mining (pp. 379- 388). Doi: 10.1145/2684822.2685298.
Finding Subgraphs with Maximum Total Density and Limited Overlap
Sozio, Mauro
2015
Abstract
Finding dense subgraphs in large graphs is a key primitive in a variety of real-world application domains, encompass-ing social network analytics, event detection, biology, and finance. In most such applications, one typically aims at finding several (possibly overlapping) dense subgraphs which might correspond to communities in social networks or in-teresting events. While a large amount of work is devoted to finding a single densest subgraph, perhaps surprisingly, the problem of finding several dense subgraphs with limited overlap has not been studied in a principled way, to the best of our knowledge. In this work we define and study a natural generalization of the densest subgraph problem, where the main goal is to find at most k subgraphs with maximum to-tal aggregate density, while satisfying an upper bound on the pairwise Jaccard coefficient between the sets of nodes of the subgraphs. After showing that such a problem is NP-Hard, we devise an efficient algorithm that comes with provable guarantees in some cases of interest, as well as, an efficient practical heuristic. Our extensive evaluation on large real-world graphs confirms the efficiency and effectiveness of our algorithms.| File | Dimensione | Formato | |
|---|---|---|---|
|
density.pdf
Open Access
Tipologia:
Documento in Post-print
Licenza:
Tutti i diritti riservati
Dimensione
687.21 kB
Formato
Adobe PDF
|
687.21 kB | Adobe PDF | Visualizza/Apri |
Pubblicazioni consigliate
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.



