Temporal text mining (TTM) has recently attracted the attention of scientists as a mean to discover and track in real-time discussions in micro-blogs. However current approaches to temporal mining suffer from efficiency problems when applied to large micro-blog streams, like Twitter, now reaching an average of 500 million tweets per day. We propose a technique, named SAX (based on an algorithm named Symbolic Aggregate Approximation) to discretize the temporal series of terms into a small set of levels, leading to a string for each terms. We then define a subset of "interesting" strings, i.e. Those representing patterns of collective attention. Sliding temporal windows are used to detect clusters of terms with the same string. We show that SAX is more efficient (by orders of magnitude) than other approaches to temporal mining in literature. In this paper, we experiment SAX on the task of event discovery over one year 1% world while Twitter stream.

Stilo, Giovanni; Velardi, Paola. (2014). Time makes sense: Event Discovery in Twitter using Temporal Similarity. In WI-IAT '14 Proceedings of the 2014 IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT) (pp. 186- 193). Doi: 10.1109/WI-IAT.2014.97.

Time makes sense: Event Discovery in Twitter using Temporal Similarity

STILO, GIOVANNI;
2014

Abstract

Temporal text mining (TTM) has recently attracted the attention of scientists as a mean to discover and track in real-time discussions in micro-blogs. However current approaches to temporal mining suffer from efficiency problems when applied to large micro-blog streams, like Twitter, now reaching an average of 500 million tweets per day. We propose a technique, named SAX (based on an algorithm named Symbolic Aggregate Approximation) to discretize the temporal series of terms into a small set of levels, leading to a string for each terms. We then define a subset of "interesting" strings, i.e. Those representing patterns of collective attention. Sliding temporal windows are used to detect clusters of terms with the same string. We show that SAX is more efficient (by orders of magnitude) than other approaches to temporal mining in literature. In this paper, we experiment SAX on the task of event discovery over one year 1% world while Twitter stream.
2014
temporal text mining; Symbolic Aggregate approXimation; event discovery; Twitter mining
Stilo, Giovanni; Velardi, Paola. (2014). Time makes sense: Event Discovery in Twitter using Temporal Similarity. In WI-IAT '14 Proceedings of the 2014 IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT) (pp. 186- 193). Doi: 10.1109/WI-IAT.2014.97.
File in questo prodotto:
File Dimensione Formato  
Time_Makes_Sense_Event_Discovery_in_Twitter_Using_Temporal_Similarity.pdf

Solo gestori archivio

Tipologia: Versione dell'editore
Licenza: Tutti i diritti riservati
Dimensione 1.13 MB
Formato Adobe PDF
1.13 MB Adobe PDF   Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11385/253769
Citazioni
  • Scopus 29
  • ???jsp.display-item.citation.isi??? 19
  • OpenAlex ND
social impact