2025: A GPT Odyssey. Deconstructing Intelligence by Gradual Dissolution of a Transformer

De Santis, Enrico; Martino, Alessio; Bruno, Edoardo; Rizzi, Antonello

doi:10.1109/ijcnn64981.2025.11227797

Hierarchical processing and information granulation have proven essential for intelligent systems, as exemplified by Large Language Models (LLMs) backed by Transformer architectures, which leverage stacked attention modules to learn progressively richer semantic features. In this work, we offer an investigation of the role of attention layers in the hierarchy through a GPT-2 layer ablation methodology, which recalls the deactivation of the HAL 9000 computer modules in the iconic scene of the film "2001: A Space Odyssey". The adopted methodology is based on the measurement of appropriate indices (Dale-Chall Readability, BLEU and Text Flow – a measure of the coherence within the flow of sentences) characterizing the text produced following the removal of a combination of layers assisted by a single-way analysis, to characterize these combinations. Subsequently, through a "machine-in-the-loop" procedure, we let GPT-4 to judge the texts produced by GPT-2. The obtained results are in line with the basic hypothesis according to which the hierarchical organization of the Transformers is the ground of the high semantic performances, opening the path to further insights and application hypotheses such as Explainable AI and the analytical characterization of the texts produced by Generative AI models.

De Santis, Enrico; Martino, Alessio; Bruno, Edoardo; Rizzi, Antonello. (2025). 2025: A GPT Odyssey. Deconstructing Intelligence by Gradual Dissolution of a Transformer. In 2025 International Joint Conference on Neural Networks (IJCNN) (pp. 1- 10). Institute of Electrical and Electronics Engineers (IEEE). Isbn: 979-8-3315-1042-8. Doi: 10.1109/ijcnn64981.2025.11227797.

2025: A GPT Odyssey. Deconstructing Intelligence by Gradual Dissolution of a Transformer

De Santis, Enrico;Martino, Alessio;Bruno, Edoardo;Rizzi, Antonello

2025

Abstract

Hierarchical processing and information granulation have proven essential for intelligent systems, as exemplified by Large Language Models (LLMs) backed by Transformer architectures, which leverage stacked attention modules to learn progressively richer semantic features. In this work, we offer an investigation of the role of attention layers in the hierarchy through a GPT-2 layer ablation methodology, which recalls the deactivation of the HAL 9000 computer modules in the iconic scene of the film "2001: A Space Odyssey". The adopted methodology is based on the measurement of appropriate indices (Dale-Chall Readability, BLEU and Text Flow – a measure of the coherence within the flow of sentences) characterizing the text produced following the removal of a combination of layers assisted by a single-way analysis, to characterize these combinations. Subsequently, through a "machine-in-the-loop" procedure, we let GPT-4 to judge the texts produced by GPT-2. The obtained results are in line with the basic hypothesis according to which the hierarchical organization of the Transformers is the ground of the high semantic performances, opening the path to further insights and application hypotheses such as Explainable AI and the analytical characterization of the texts produced by Generative AI models.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno del convegno
	
				2025
			
	Codice ISBN
	
				979-8-3315-1042-8
			
	Parole chiave
	
				Large Language Models, Explainable AI, Text
Modeling, Natural Language Processing, Text Embedding
			
	Citazione
	
				De Santis, Enrico; Martino, Alessio; Bruno, Edoardo; Rizzi, Antonello. (2025). 2025: A GPT Odyssey. Deconstructing Intelligence by Gradual Dissolution of a Transformer. In 2025 International Joint Conference on Neural Networks (IJCNN) (pp. 1- 10).  Institute of Electrical and Electronics Engineers (IEEE). Isbn: 979-8-3315-1042-8. Doi: 10.1109/ijcnn64981.2025.11227797.
			
	Appare nelle tipologie:
	
				04.1 - Contributo in Atti di convegno (Paper in Proceedings)

File in questo prodotto:

File	Dimensione	Formato
2025_A_GPT_Odyssey._Deconstructing_Intelligence_by_Gradual_Dissolution_of_a_Transformer.pdf Solo gestori archivio Tipologia: Versione dell'editore Licenza: Tutti i diritti riservati Dimensione 1.87 MB Formato Adobe PDF Visualizza/Apri	1.87 MB	Adobe PDF	Visualizza/Apri

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11385/255319

Citazioni

2

1

1

IRIS - Institutional Research Information System