Modelling and recognition of protein contact networks by multiple kernel learning and dissimilarity representations

Martino, Alessio; Enrico De Santis,; Giuliani, Alessandro; Rizzi, Antonello

doi:10.3390/e22070794

Multiple kernel learning is a paradigm which employs a properly constructed chain of kernel functions able to simultaneously analyse different data or different representations of the same data. In this paper, we propose an hybrid classification system based on a linear combination of multiple kernels defined over multiple dissimilarity spaces. The core of the training procedure is the joint optimisation of kernel weights and representatives selection in the dissimilarity spaces. This equips the system with a two-fold knowledge discovery phase: by analysing the weights, it is possible to check which representations are more suitable for solving the classification problem, whereas the pivotal patterns selected as representatives can give further insights on the modelled system, possibly with the help of field-experts. The proposed classification system is tested on real proteomic data in order to predict proteins' functional role starting from their folded structure: specifically, a set of eight representations are drawn from the graph-based protein folded description. The proposed multiple kernel-based system has also been benchmarked against a clustering-based classification system also able to exploit multiple dissimilarities simultaneously. Computational results show remarkable classification capabilities and the knowledge discovery analysis is in line with current biological knowledge, suggesting the reliability of the proposed system.

Martino, Alessio; De Santis, Enrico; Giuliani, Alessandro; Rizzi, Antonello. (2020). Modelling and recognition of protein contact networks by multiple kernel learning and dissimilarity representations. ENTROPY, (ISSN: 1099-4300), 22:7, 1-32. Doi: 10.3390/e22070794.

Modelling and recognition of protein contact networks by multiple kernel learning and dissimilarity representations

Alessio Martino;Enrico De Santis;Alessandro Giuliani;Antonello Rizzi

2020

Abstract

Multiple kernel learning is a paradigm which employs a properly constructed chain of kernel functions able to simultaneously analyse different data or different representations of the same data. In this paper, we propose an hybrid classification system based on a linear combination of multiple kernels defined over multiple dissimilarity spaces. The core of the training procedure is the joint optimisation of kernel weights and representatives selection in the dissimilarity spaces. This equips the system with a two-fold knowledge discovery phase: by analysing the weights, it is possible to check which representations are more suitable for solving the classification problem, whereas the pivotal patterns selected as representatives can give further insights on the modelled system, possibly with the help of field-experts. The proposed classification system is tested on real proteomic data in order to predict proteins' functional role starting from their folded structure: specifically, a set of eight representations are drawn from the graph-based protein folded description. The proposed multiple kernel-based system has also been benchmarked against a clustering-based classification system also able to exploit multiple dissimilarities simultaneously. Computational results show remarkable classification capabilities and the knowledge discovery analysis is in line with current biological knowledge, suggesting the reliability of the proposed system.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione
	
				2020
			
	Parole chiave
	
				computational biology
dissimilarity spaces
Kernel methods
protein contact networks
support vector machines
systems biology
			
	Citazione
	
				Martino, Alessio; De Santis, Enrico; Giuliani, Alessandro; Rizzi, Antonello. (2020). Modelling and recognition of protein contact networks by multiple kernel learning and dissimilarity representations. ENTROPY, (ISSN: 1099-4300), 22:7, 1-32. Doi: 10.3390/e22070794.
			
	Appare nelle tipologie:
	
				01.1 - Articolo su rivista (Article)

File in questo prodotto:

File	Dimensione	Formato
Martino_Modelling-and-recognition_2020.pdf Open Access Tipologia: Versione dell'editore Licenza: Creative commons Dimensione 1.29 MB Formato Adobe PDF Visualizza/Apri	1.29 MB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11385/214543

Citazioni

7

5

ND

IRIS - Institutional Research Information System