Mining Hot Calling Contexts in Small Space

Daniele Cono D'Elia,; Demetrescu, Camil; Finocchi, Irene

doi:10.1145/1993498.1993559

Calling context trees (CCTs) associate performance metrics with paths through a program's call graph, providing valuable information for program understanding and performance analysis. Although CCTs are typically much smaller than call trees, in real applications they might easily consist of tens of millions of distinct calling contexts: this sheer size makes them difficult to analyze and might hurt execution times due to poor access locality. For performance analysis, accurately collecting information about hot calling contexts may be more useful than constructing an entire CCT that includes millions of uninteresting paths. As we show for a variety of prominent Linux applications, the distribution of calling context frequencies is typically very skewed. In this paper we show how to exploit this property to reduce the CCT size considerably. We introduce a novel run-time data structure, called Hot Calling Context Tree (HCCT), that offers an additional intermediate point in the spectrum of data structures for representing interprocedural control flow. The HCCT is a subtree of the CCT that includes only hot nodes and their ancestors. We show how to compute the HCCT without storing the exact frequency of all calling contexts, by using fast and space-efficient algorithms for mining frequent items in data streams. With this approach, we can distinguish between hot and cold contexts on the fly, while obtaining very accurate frequency counts. We show both theoretically and experimentally that the HCCT achieves a similar precision as the CCT in a much smaller space, roughly proportional to the number of distinct hot contexts: this is typically several orders of magnitude smaller than the total number of calling contexts encountered during a program's execution. Our space-efficient approach can be effectively combined with previous context-sensitive profiling techniques, such as sampling and bursting.

Cono D'Elia, Daniele; Demetrescu, Camil; Finocchi, Irene. (2011). Mining Hot Calling Contexts in Small Space. In Proceedings of the 32nd ACM SIGPLAN conference on Programming Language Design and Implementation (pp. 516- 527). Isbn: 9781450306638. Doi: 10.1145/1993498.1993559. http://dl.acm.org/citation.cfm?doid=1993498.1993559.

Mining Hot Calling Contexts in Small Space

Daniele Cono D'Elia;Camil Demetrescu;Irene Finocchi

2011

Abstract

Calling context trees (CCTs) associate performance metrics with paths through a program's call graph, providing valuable information for program understanding and performance analysis. Although CCTs are typically much smaller than call trees, in real applications they might easily consist of tens of millions of distinct calling contexts: this sheer size makes them difficult to analyze and might hurt execution times due to poor access locality. For performance analysis, accurately collecting information about hot calling contexts may be more useful than constructing an entire CCT that includes millions of uninteresting paths. As we show for a variety of prominent Linux applications, the distribution of calling context frequencies is typically very skewed. In this paper we show how to exploit this property to reduce the CCT size considerably. We introduce a novel run-time data structure, called Hot Calling Context Tree (HCCT), that offers an additional intermediate point in the spectrum of data structures for representing interprocedural control flow. The HCCT is a subtree of the CCT that includes only hot nodes and their ancestors. We show how to compute the HCCT without storing the exact frequency of all calling contexts, by using fast and space-efficient algorithms for mining frequent items in data streams. With this approach, we can distinguish between hot and cold contexts on the fly, while obtaining very accurate frequency counts. We show both theoretically and experimentally that the HCCT achieves a similar precision as the CCT in a much smaller space, roughly proportional to the number of distinct hot contexts: this is typically several orders of magnitude smaller than the total number of calling contexts encountered during a program's execution. Our space-efficient approach can be effectively combined with previous context-sensitive profiling techniques, such as sampling and bursting.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno del convegno
	
				2011
			
	Codice ISBN
	
				9781450306638
			
	Parole chiave
	
				program instrumentation; performance profiling; data streaming; dynamic program analysis; frequent items; data streaming algorithms; calling contexts
			
	Citazione
	
				Cono D'Elia, Daniele; Demetrescu, Camil; Finocchi, Irene. (2011). Mining Hot Calling Contexts in Small Space. In Proceedings of the 32nd ACM SIGPLAN conference on Programming Language Design and Implementation (pp. 516- 527). Isbn: 9781450306638. Doi: 10.1145/1993498.1993559. http://dl.acm.org/citation.cfm?doid=1993498.1993559.
			
	Appare nelle tipologie:
	
				04.1 - Contributo in Atti di convegno (Paper in Proceedings)

File in questo prodotto:

Non ci sono file associati a questo prodotto.

Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11385/192671

Citazioni

26

3

27

IRIS - Institutional Research Information System