The problem of finding densest subgraphs has received increasing attention in recent years finding applications in biology, finance, as well as social network analysis. The k-clique densest subgraph problem is a generalization of the densest subgraph problem, where the objective is to find a subgraph maximizing the ratio between the number of k-cliques in the subgraph and its number of nodes. It includes as a special case the problem of finding subgraphs with largest average number of triangles (k = 3), which plays an important role in social network analysis. Moreover, algorithms that deal with larger values of k can effectively find quasi-cliques. The densest subgraph problem can be solved in polynomial time with algorithms based on maximum flow, linear programming or a recent approach based on convex optimization. In particular, the latter approach can scale to graphs containing tens of billions of edges. While finding a densest subgraph in large graphs is no longer a bottleneck, the k-clique densest subgraph remains challenging even when k = 3. Our work aims at developing near-optimal and exact algorithms for the k-clique densest subgraph problem on large real-world graphs. We give a surprisingly simple procedure that can be employed to find the maximal k-clique densest subgraph in large-real world graphs. By leveraging appealing properties of existing results, we combine it with a recent approach for listing all k-cliques in a graph and a sampling scheme, obtaining the state-of-the-art approaches for the aforementioned problem. Our theoretical results are complemented with an extensive experimental evaluation showing the effectiveness of our approach in large real-world graphs.

Su, Bintao; Danisch, Maximilien; Hubert Chan, T. -H.; Sozio, Mauro. (2020). KClist++: A Simple Algorithm for Finding k-Clique Densest Subgraphs in Large Graphs. In Proceedings of the VLDB Endowment (pp. 1628- 1640). Doi: 10.14778/3401960.3401962. https://dl.acm.org/doi/pdf/10.14778/3401960.3401962.

KClist++: A Simple Algorithm for Finding k-Clique Densest Subgraphs in Large Graphs

Mauro Sozio
2020

Abstract

The problem of finding densest subgraphs has received increasing attention in recent years finding applications in biology, finance, as well as social network analysis. The k-clique densest subgraph problem is a generalization of the densest subgraph problem, where the objective is to find a subgraph maximizing the ratio between the number of k-cliques in the subgraph and its number of nodes. It includes as a special case the problem of finding subgraphs with largest average number of triangles (k = 3), which plays an important role in social network analysis. Moreover, algorithms that deal with larger values of k can effectively find quasi-cliques. The densest subgraph problem can be solved in polynomial time with algorithms based on maximum flow, linear programming or a recent approach based on convex optimization. In particular, the latter approach can scale to graphs containing tens of billions of edges. While finding a densest subgraph in large graphs is no longer a bottleneck, the k-clique densest subgraph remains challenging even when k = 3. Our work aims at developing near-optimal and exact algorithms for the k-clique densest subgraph problem on large real-world graphs. We give a surprisingly simple procedure that can be employed to find the maximal k-clique densest subgraph in large-real world graphs. By leveraging appealing properties of existing results, we combine it with a recent approach for listing all k-cliques in a graph and a sampling scheme, obtaining the state-of-the-art approaches for the aforementioned problem. Our theoretical results are complemented with an extensive experimental evaluation showing the effectiveness of our approach in large real-world graphs.
2020
Su, Bintao; Danisch, Maximilien; Hubert Chan, T. -H.; Sozio, Mauro. (2020). KClist++: A Simple Algorithm for Finding k-Clique Densest Subgraphs in Large Graphs. In Proceedings of the VLDB Endowment (pp. 1628- 1640). Doi: 10.14778/3401960.3401962. https://dl.acm.org/doi/pdf/10.14778/3401960.3401962.
File in questo prodotto:
File Dimensione Formato  
Sozio_ A Simple Algorithm for Finding k-Clique Densest.pdf

Open Access

Tipologia: Versione dell'editore
Licenza: Creative commons
Dimensione 765.46 kB
Formato Adobe PDF
765.46 kB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11385/251382
Citazioni
  • Scopus 43
  • ???jsp.display-item.citation.isi??? 33
  • OpenAlex ND
social impact