We predict disease-genes relations on the human interactome network using a methodology that jointly learns functional and connectivity patterns surrounding proteins. Contrary to other data structures, the interactome is characterised by high incompleteness and absence of explicit negative knowledge, which makes predictive tasks particularly challenging. To exploit at best latent information in the network, we propose an extended version of random walks, named Random Watcher-Walker (RW2), which is shown to perform better than other state-of-the-art algorithms. We also show that the performance of RW2 and other compared state-of-the-art algorithms is extremely sensitive to the interactome used, and to the adopted disease categorisations, since this influences the ability to capture regularities in presence of sparsity and incompleteness.

Madeddu, Lorenzo; Stilo, Giovanni; Velardi, Paola. (2020). A Feature-Learning based method for the disease-gene prediction problem. INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, (ISSN: 1748-5673), 24:1, 1-21. Doi: 10.1504/IJDMB.2020.10031422.

A Feature-Learning based method for the disease-gene prediction problem

Lorenzo Madeddu;Giovanni Stilo
Membro del Collaboration Group
;
2020

Abstract

We predict disease-genes relations on the human interactome network using a methodology that jointly learns functional and connectivity patterns surrounding proteins. Contrary to other data structures, the interactome is characterised by high incompleteness and absence of explicit negative knowledge, which makes predictive tasks particularly challenging. To exploit at best latent information in the network, we propose an extended version of random walks, named Random Watcher-Walker (RW2), which is shown to perform better than other state-of-the-art algorithms. We also show that the performance of RW2 and other compared state-of-the-art algorithms is extremely sensitive to the interactome used, and to the adopted disease categorisations, since this influences the ability to capture regularities in presence of sparsity and incompleteness.
2020
network medicine; disease gene prediction; disease gene prioritisation; node embedding; random walks; graph-based methods; biological networks; complex diseases
Madeddu, Lorenzo; Stilo, Giovanni; Velardi, Paola. (2020). A Feature-Learning based method for the disease-gene prediction problem. INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, (ISSN: 1748-5673), 24:1, 1-21. Doi: 10.1504/IJDMB.2020.10031422.
File in questo prodotto:
File Dimensione Formato  
IJDMB_Madeddu et al(preprint).pdf

Solo gestori archivio

Tipologia: Documento in Pre-print
Licenza: Tutti i diritti riservati
Dimensione 483.72 kB
Formato Adobe PDF
483.72 kB Adobe PDF   Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11385/253813
Citazioni
  • Scopus 14
  • ???jsp.display-item.citation.isi??? 10
  • OpenAlex ND
social impact