(Hyper)Graph embedding and classification via simplicial complexes

Martino, Alessio; Giuliani, Alessandro; Rizzi, Antonello

doi:10.3390/a12110223

This paper investigates a novel graph embedding procedure based on simplicial complexes. Inherited from algebraic topology, simplicial complexes are collections of increasing-order simplices (e.g., points, lines, triangles, tetrahedrons) which can be interpreted as possibly meaningful substructures (i.e., information granules) on the top of which an embedding space can be built by means of symbolic histograms. In the embedding space, any Euclidean pattern recognition system can be used, possibly equipped with feature selection capabilities in order to select the most informative symbols. The selected symbols can be analysed by ﬁeld-experts in order to extract further knowledge about the process to be modelled by the learning system, hence the proposed modelling strategy can be considered as a grey-box. The proposed embedding has been tested on thirty benchmark datasets for graph classiﬁcation and, further, we propose two real-world applications, namely predicting proteins’ enzymatic function and solubility propensity starting from their 3D structure in order to give an example of the knowledge discovery phase which can be carried out starting from the proposed embedding strategy.

Martino, Alessio; Giuliani, Alessandro; Rizzi, Antonello. (2019). (Hyper)Graph embedding and classification via simplicial complexes. ALGORITHMS, (ISSN: 1999-4893), 12:11, 223-243. Doi: 10.3390/a12110223.

(Hyper)Graph embedding and classification via simplicial complexes

Martino, Alessio;Giuliani, Alessandro;Rizzi, Antonello

2019

Abstract

This paper investigates a novel graph embedding procedure based on simplicial complexes. Inherited from algebraic topology, simplicial complexes are collections of increasing-order simplices (e.g., points, lines, triangles, tetrahedrons) which can be interpreted as possibly meaningful substructures (i.e., information granules) on the top of which an embedding space can be built by means of symbolic histograms. In the embedding space, any Euclidean pattern recognition system can be used, possibly equipped with feature selection capabilities in order to select the most informative symbols. The selected symbols can be analysed by ﬁeld-experts in order to extract further knowledge about the process to be modelled by the learning system, hence the proposed modelling strategy can be considered as a grey-box. The proposed embedding has been tested on thirty benchmark datasets for graph classiﬁcation and, further, we propose two real-world applications, namely predicting proteins’ enzymatic function and solubility propensity starting from their 3D structure in order to give an example of the knowledge discovery phase which can be carried out starting from the proposed embedding strategy.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione
	
				2019
			
	Parole chiave
	
				granular computing
embedding spaces
graph embedding
topological data analysis
simplicial complexes
computational biology
protein contact networks
complex networks
complex systems
			
	Citazione
	
				Martino, Alessio; Giuliani, Alessandro; Rizzi, Antonello. (2019). (Hyper)Graph embedding and classification via simplicial complexes. ALGORITHMS, (ISSN: 1999-4893), 12:11, 223-243. Doi: 10.3390/a12110223.
			
	Appare nelle tipologie:
	
				01.1 - Articolo su rivista (Article)

File in questo prodotto:

File	Dimensione	Formato
Martino_(Hyper)Graph-embedding_2019.pdf Open Access Tipologia: Versione dell'editore Licenza: Creative commons Dimensione 1.39 MB Formato Adobe PDF Visualizza/Apri	1.39 MB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11385/214555

Citazioni

28

23

ND

IRIS - Institutional Research Information System