Efficient pruning of large knowledge graphs

Faralli, Stefano; Finocchi, Irene; Simone Paolo Ponzetto,; Velardi, Paola

doi:10.24963/ijcai.2018/564

In this paper we present an efficient and highly accurate algorithm to prune noisy or over-ambiguous knowledge graphs given as input an extensional definition of a domain of interest, namely as a set of instances or concepts. Our method climbs the graph in a bottom-up fashion, iteratively layering the graph and pruning nodes and edges in each layer while not compromising the connectivity of the set of input nodes. Iterative layering and protection of pre-defined nodes allow to extract semantically coherent DAG structures from noisy or over-ambiguous cyclic graphs, without loss of information and without incurring in computational bottlenecks, which are the main problem of stateof- the-art methods for cleaning large, i.e., Webscale, knowledge graphs. We apply our algorithm to the tasks of pruning automatically acquired taxonomies using benchmarking data from a SemEval evaluation exercise, as well as the extraction of a domain-adapted taxonomy from theWikipedia category hierarchy. The results show the superiority of our approach over state-of-art algorithms in terms of both output quality and computational efficiency.

Faralli, Stefano; Finocchi, Irene; Paolo Ponzetto, Simone; Velardi, Paola. (2018). Efficient pruning of large knowledge graphs. In Proceedings of 27th International Joint Conference on Artificial Intelligence (pp. 4055- 4063). Doi: 10.24963/ijcai.2018/564. https://www.ijcai.org/Proceedings/2018/564.

Efficient pruning of large knowledge graphs

Stefano Faralli;Irene Finocchi;Simone Paolo Ponzetto;Paola Velardi

2018

Abstract

In this paper we present an efficient and highly accurate algorithm to prune noisy or over-ambiguous knowledge graphs given as input an extensional definition of a domain of interest, namely as a set of instances or concepts. Our method climbs the graph in a bottom-up fashion, iteratively layering the graph and pruning nodes and edges in each layer while not compromising the connectivity of the set of input nodes. Iterative layering and protection of pre-defined nodes allow to extract semantically coherent DAG structures from noisy or over-ambiguous cyclic graphs, without loss of information and without incurring in computational bottlenecks, which are the main problem of stateof- the-art methods for cleaning large, i.e., Webscale, knowledge graphs. We apply our algorithm to the tasks of pruning automatically acquired taxonomies using benchmarking data from a SemEval evaluation exercise, as well as the extraction of a domain-adapted taxonomy from theWikipedia category hierarchy. The results show the superiority of our approach over state-of-art algorithms in terms of both output quality and computational efficiency.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno del convegno
	
				2018
			
	Parole chiave
	
				knowledge graphs; Wikipedia category graph; pruning knowledge graphs
			
	Citazione
	
				Faralli, Stefano; Finocchi, Irene; Paolo Ponzetto, Simone; Velardi, Paola. (2018). Efficient pruning of large knowledge graphs. In Proceedings of 27th International Joint Conference on Artificial Intelligence (pp. 4055- 4063). Doi: 10.24963/ijcai.2018/564. https://www.ijcai.org/Proceedings/2018/564.
			
	Appare nelle tipologie:
	
				04.1 - Contributo in Atti di convegno (Paper in Proceedings)

File in questo prodotto:

File	Dimensione	Formato
Velardi_Efficient_2018.pdf Open Access Tipologia: Versione dell'editore Licenza: DRM (Digital rights management) non definiti Dimensione 292.83 kB Formato Adobe PDF Visualizza/Apri	292.83 kB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11385/192579

Citazioni

18

10

11

IRIS - Institutional Research Information System