Unsupervised Boosting-Based Autoencoder Ensembles for Outlier Detection

Sarvari, H.; Domeniconi, C.; Prenkaj, B.; Stilo, Giovanni

doi:10.1007/978-3-030-75762-5_8

Autoencoders have been recently applied to outlier detection. However, neural networks are known to be vulnerable to overfitting, and therefore have limited potential in the unsupervised outlier detection setting. The majority of existing deep learning methods for anomaly detection is sensitive to contamination of the training data to anomalous instances. To overcome the aforementioned limitations we develop a Boosting-based Autoencoder Ensemble approach (BAE). BAE is an unsupervised ensemble method that, similarly to boosting, builds an adaptive cascade of autoencoders to achieve improved and robust results. BAE trains the autoencoder components sequentially by performing a weighted sampling of the data, aimed at reducing the amount of outliers used during training, and at injecting diversity in the ensemble. We perform extensive experiments and show that the proposed methodology outperforms state-of-the-art approaches under a variety of conditions.

Sarvari, H.; Domeniconi, C.; Prenkaj, B.; Stilo, Giovanni. (2021). Unsupervised Boosting-Based Autoencoder Ensembles for Outlier Detection. In Advances in Knowledge Discovery and Data Mining 25th Pacific-Asia Conference, PAKDD 2021, Virtual Event, May 11–14, 2021, Proceedings, Part I (pp. 91- 103). Isbn: 978-3-030-75761-8. Isbn: 978-3-030-75762-5. Doi: 10.1007/978-3-030-75762-5_8.

Unsupervised Boosting-Based Autoencoder Ensembles for Outlier Detection

Sarvari H.;Domeniconi C.;Prenkaj B.;Stilo G.^{Membro del Collaboration Group}

2021

Abstract

Autoencoders have been recently applied to outlier detection. However, neural networks are known to be vulnerable to overfitting, and therefore have limited potential in the unsupervised outlier detection setting. The majority of existing deep learning methods for anomaly detection is sensitive to contamination of the training data to anomalous instances. To overcome the aforementioned limitations we develop a Boosting-based Autoencoder Ensemble approach (BAE). BAE is an unsupervised ensemble method that, similarly to boosting, builds an adaptive cascade of autoencoders to achieve improved and robust results. BAE trains the autoencoder components sequentially by performing a weighted sampling of the data, aimed at reducing the amount of outliers used during training, and at injecting diversity in the ensemble. We perform extensive experiments and show that the proposed methodology outperforms state-of-the-art approaches under a variety of conditions.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno del convegno
	
				2021
			
	Codice ISBN
	
				978-3-030-75761-8
978-3-030-75762-5
			
	Citazione
	
				Sarvari, H.; Domeniconi, C.; Prenkaj, B.; Stilo, Giovanni. (2021). Unsupervised Boosting-Based Autoencoder Ensembles for Outlier Detection. In Advances in Knowledge Discovery and Data Mining 25th Pacific-Asia Conference, PAKDD 2021, Virtual Event, May 11–14, 2021, Proceedings, Part I (pp. 91- 103). Isbn: 978-3-030-75761-8. Isbn: 978-3-030-75762-5. Doi: 10.1007/978-3-030-75762-5_8.
			
	Appare nelle tipologie:
	
				04.1 - Contributo in Atti di convegno (Paper in Proceedings)

File in questo prodotto:

File	Dimensione	Formato
978-3-030-75762-5_8.pdf Solo gestori archivio Tipologia: Versione dell'editore Licenza: Tutti i diritti riservati Dimensione 1.39 MB Formato Adobe PDF Visualizza/Apri	1.39 MB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11385/253738

Citazioni

18

12

ND

IRIS - Institutional Research Information System