Towards a Prediction of Machine Learning Training Time to Support Continuous Learning Systems Development

Marzi, F.; D'Aloisio, G.; Di Marco, A.; Stilo, Giovanni

doi:10.1007/978-3-031-66326-0_11

The problem of predicting the training time of machine learning (ML) models has become extremely relevant in the scientific community. Being able to predict a priori the training time of an ML model would enable the automatic selection of the best model both in terms of energy efficiency and in terms of performance in the context of, for instance, MLOps architectures or learning-enabled architectures. In this paper, we present the work we are conducting towards this direction. In particular, we present an extensive empirical study of the Full Parameter Time Complexity (FPTC) approach by Zheng et al., which is, to the best of our knowledge, the only approach formalizing the training time of ML models as a function of both dataset’s and model’s parameters. We study the formulations proposed for the Logistic Regression and Random Forest classifiers, and we highlight the main strengths and weaknesses of the approach. Finally, we observe how, from the conducted study, the prediction of training time is strictly related to the context (i.e., the involved dataset) and how the FPTC approach is not generalizable.

Marzi, F.; D'Aloisio, G.; Di Marco, A.; Stilo, Giovanni. (2024). Towards a Prediction of Machine Learning Training Time to Support Continuous Learning Systems Development. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (pp. 169- 184). Isbn: 9783031663253. Doi: 10.1007/978-3-031-66326-0_11.

Towards a Prediction of Machine Learning Training Time to Support Continuous Learning Systems Development

Marzi F.;d'Aloisio G.;Di Marco A.;Stilo G.^{Membro del Collaboration Group}

2024

Abstract

The problem of predicting the training time of machine learning (ML) models has become extremely relevant in the scientific community. Being able to predict a priori the training time of an ML model would enable the automatic selection of the best model both in terms of energy efficiency and in terms of performance in the context of, for instance, MLOps architectures or learning-enabled architectures. In this paper, we present the work we are conducting towards this direction. In particular, we present an extensive empirical study of the Full Parameter Time Complexity (FPTC) approach by Zheng et al., which is, to the best of our knowledge, the only approach formalizing the training time of ML models as a function of both dataset’s and model’s parameters. We study the formulations proposed for the Logistic Regression and Random Forest classifiers, and we highlight the main strengths and weaknesses of the approach. Finally, we observe how, from the conducted study, the prediction of training time is strictly related to the context (i.e., the involved dataset) and how the FPTC approach is not generalizable.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno del convegno
	
				2024
			
	Codice ISBN
	
				9783031663253
			
	Parole chiave
	
				Formal Analysis; Learning-enabled Architectures; Machine Learning; Training Time
			
	Citazione
	
				Marzi, F.; D'Aloisio, G.; Di Marco, A.; Stilo, Giovanni. (2024). Towards a Prediction of Machine Learning Training Time to Support Continuous Learning Systems Development. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (pp. 169- 184). Isbn: 9783031663253. Doi: 10.1007/978-3-031-66326-0_11.
			
	Appare nelle tipologie:
	
				04.1 - Contributo in Atti di convegno (Paper in Proceedings)

File in questo prodotto:

File	Dimensione	Formato
Towards-aPrediction-ofMachine-Learning-Training-Time-toSupport-Continuous-Learning-Systems-DevelopmentLecture-Notes-in-Computer-Science-including-subseries-Lecture-Notes-in-Artificial-Intelligence-and-Lecture-Notes-in-Bioinformatics.pdf Solo gestori archivio Tipologia: Versione dell'editore Licenza: Tutti i diritti riservati Dimensione 422.63 kB Formato Adobe PDF Visualizza/Apri	422.63 kB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11385/252859

Citazioni

0

0

ND

IRIS - Institutional Research Information System