Matching problems are ubiquitous. They occur in economic markets, labor markets, internet advertising, and elsewhere. In this paper we focus on an application of matching for social media. Our goal is to distribute content from information suppliers to information consumers. We seek to maximize the overall relevance of the matched content from suppliers to consumers while regulating the overall activity, e.g., ensuring that no consumer is overwhelmed with data and that all suppliers have chances to deliver their content. We propose two matching algorithms, GreedyMR and StackMR, geared for the MapReduce paradigm. Both algorithms have provable approximation guarantees, and in practice they produce high-quality solutions. While both algorithms scale extremely well, we can show that Stack-MR requires only a poly-logarithmic number of MapReduce steps, making it an attractive option for applications with very large datasets. We experimentally show the trade-offs between quality and efficiency of our solutions on two large datasets coming from real-world social-media web sites. © 2011 VLDB Endowment.

Morales, G. F.; Gionis, A.; Sozio, Mauro. (2011). Social content matching in MapReduce. In Social content matching in MapReduce (pp. 460- 469). Doi: 10.14778/1988776.1988782.

Social content matching in MapReduce

Sozio M.
2011

Abstract

Matching problems are ubiquitous. They occur in economic markets, labor markets, internet advertising, and elsewhere. In this paper we focus on an application of matching for social media. Our goal is to distribute content from information suppliers to information consumers. We seek to maximize the overall relevance of the matched content from suppliers to consumers while regulating the overall activity, e.g., ensuring that no consumer is overwhelmed with data and that all suppliers have chances to deliver their content. We propose two matching algorithms, GreedyMR and StackMR, geared for the MapReduce paradigm. Both algorithms have provable approximation guarantees, and in practice they produce high-quality solutions. While both algorithms scale extremely well, we can show that Stack-MR requires only a poly-logarithmic number of MapReduce steps, making it an attractive option for applications with very large datasets. We experimentally show the trade-offs between quality and efficiency of our solutions on two large datasets coming from real-world social-media web sites. © 2011 VLDB Endowment.
2011
Morales, G. F.; Gionis, A.; Sozio, Mauro. (2011). Social content matching in MapReduce. In Social content matching in MapReduce (pp. 460- 469). Doi: 10.14778/1988776.1988782.
File in questo prodotto:
File Dimensione Formato  
p460-morales.pdf

Open Access

Tipologia: Documento in Post-print
Licenza: Tutti i diritti riservati
Dimensione 670.04 kB
Formato Adobe PDF
670.04 kB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11385/261225
Citazioni
  • Scopus 34
  • ???jsp.display-item.citation.isi??? ND
  • OpenAlex 46
social impact