Graphs are data structures able to efficiently describe real-world systems and, as such, have been extensively used in recent years by many branches of science, including machine learning engineering. However, the design of efficient graph-based pattern recognition systems is bottlenecked by the intrinsic problem of how to properly match two graphs. In this paper, we investigate a granular computing approach for the design of a general purpose graph-based classification system. The overall framework relies on the extraction of meaningful pivotal substructures on the top of which an embedding space can be build and in which the classification can be performed without limitations. Due to its importance, we address whether information can be preserved by performing stochastic extraction on the training data instead of performing an exhaustive extraction procedure which is likely to be unfeasible for large datasets. Tests on benchmark datasets show that stochastic extraction can lead to a meaningful set of pivotal substructures with a much lower memory footprint and overall computational burden, making the proposed strategies suitable also for dealing with big datasets.

Stochastic information granules extraction for graph embedding and classification / Baldini, Luca; Martino, Alessio; Rizzi, Antonello. - Proceedings of the 11th International Joint Conference on Computational Intelligence, (2019), pp. 391-402. (11th International Joint Conference on Computational Intelligence (NCTA), Vienna, Austria, 17-19 September, 2019). [10.5220/0008149403910402].

Stochastic information granules extraction for graph embedding and classification

Martino, Alessio;
2019

Abstract

Graphs are data structures able to efficiently describe real-world systems and, as such, have been extensively used in recent years by many branches of science, including machine learning engineering. However, the design of efficient graph-based pattern recognition systems is bottlenecked by the intrinsic problem of how to properly match two graphs. In this paper, we investigate a granular computing approach for the design of a general purpose graph-based classification system. The overall framework relies on the extraction of meaningful pivotal substructures on the top of which an embedding space can be build and in which the classification can be performed without limitations. Due to its importance, we address whether information can be preserved by performing stochastic extraction on the training data instead of performing an exhaustive extraction procedure which is likely to be unfeasible for large datasets. Tests on benchmark datasets show that stochastic extraction can lead to a meaningful set of pivotal substructures with a much lower memory footprint and overall computational burden, making the proposed strategies suitable also for dealing with big datasets.
2019
granular computing
graph embedding
inexact graph matching
pattern recognition
supervised learning
File in questo prodotto:
File Dimensione Formato  
Baldini_Stochastic-information-granules_2019.pdf

Open Access

Tipologia: Versione dell'editore
Licenza: Creative commons
Dimensione 386.05 kB
Formato Adobe PDF
386.05 kB Adobe PDF Visualizza/Apri
Baldini_Stochastic_Cover_2019.pdf

Open Access

Descrizione: cover
Tipologia: Altro materiale allegato
Licenza: DRM (Digital rights management) non definiti
Dimensione 3.21 MB
Formato Adobe PDF
3.21 MB Adobe PDF Visualizza/Apri
Baldini_Stochastic_TOC_2019.pdf

Open Access

Descrizione: indice
Tipologia: Altro materiale allegato
Licenza: DRM (Digital rights management) non definiti
Dimensione 193.89 kB
Formato Adobe PDF
193.89 kB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11385/214579
Citazioni
  • Scopus 23
  • ???jsp.display-item.citation.isi??? 18
  • OpenAlex ND
social impact