In this paper we perform a large-scale homophily analysis on Twitter using a hierarchical representation of users' interests which we call a Twixonomy. In order to build a population, community, or single-user Twixonomy we first associate "topical" friends in users' friendship lists (i.e. friends representing an interest rather than a social relation between peers) with Wikipedia categories. A word-sense disambiguation algorithm is used to select the appropriate wikipage for each topical friend. Starting from the set of wikipages representing "primitive" interests, we extract all paths connecting these pages with topmost Wikipedia category nodes, and we then prune the resulting graph G efficiently so as to induce a direct acyclic graph. This graph is the Twixonomy. Then, to analyze homophily, we compare different methods to detect communities in a peer friends Twitter network, and then for each community we compute the degree of homophily on the basis of a measure of pairwise semantic similarity.We show that the Twixonomy provides a means for describing users' interests in a compact and readable way and allows for a fine-grained homophily analysis. Furthermore, we show that mid-low level categories in the Twixonomy represent the best balance between informativeness and compactness of the representation.

Faralli, Stefano; Stilo, Giovanni; Velardi, Paola. (2015). Large scale homophily analysis in twitter using a twixonomy. In IJCAI International Joint Conference on Artificial Intelligence (pp. 2334- 2340). Isbn: 9781577357384. https://www.ijcai.org/Abstract/15/330.

Large scale homophily analysis in twitter using a twixonomy

Stilo Giovanni
Membro del Collaboration Group
;
2015

Abstract

In this paper we perform a large-scale homophily analysis on Twitter using a hierarchical representation of users' interests which we call a Twixonomy. In order to build a population, community, or single-user Twixonomy we first associate "topical" friends in users' friendship lists (i.e. friends representing an interest rather than a social relation between peers) with Wikipedia categories. A word-sense disambiguation algorithm is used to select the appropriate wikipage for each topical friend. Starting from the set of wikipages representing "primitive" interests, we extract all paths connecting these pages with topmost Wikipedia category nodes, and we then prune the resulting graph G efficiently so as to induce a direct acyclic graph. This graph is the Twixonomy. Then, to analyze homophily, we compare different methods to detect communities in a peer friends Twitter network, and then for each community we compute the degree of homophily on the basis of a measure of pairwise semantic similarity.We show that the Twixonomy provides a means for describing users' interests in a compact and readable way and allows for a fine-grained homophily analysis. Furthermore, we show that mid-low level categories in the Twixonomy represent the best balance between informativeness and compactness of the representation.
2015
9781577357384
Artificial Intelligence
Faralli, Stefano; Stilo, Giovanni; Velardi, Paola. (2015). Large scale homophily analysis in twitter using a twixonomy. In IJCAI International Joint Conference on Artificial Intelligence (pp. 2334- 2340). Isbn: 9781577357384. https://www.ijcai.org/Abstract/15/330.
File in questo prodotto:
File Dimensione Formato  
[ c.11 ] - [20] IJCAI 2015.pdf

Open Access

Tipologia: Versione dell'editore
Licenza: Tutti i diritti riservati
Dimensione 1.05 MB
Formato Adobe PDF
1.05 MB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11385/253774
Citazioni
  • Scopus 26
  • ???jsp.display-item.citation.isi??? 18
  • OpenAlex ND
social impact