TY - GEN
T1 - Optimization opportunities created by global data reordering
AU - Haber, G.
AU - Klausner, M.
AU - Eisenberg, V.
AU - Mendelson, B.
AU - Gurevich, M.
N1 - Publisher Copyright:
© 2003 IEEE.
PY - 2003
Y1 - 2003
N2 - Memory access has proven to be one of the bottlenecks in modern architectures. Improving memory locality and eliminating the amount of memory access can help release this bottleneck. We present a method for link-time profile-based optimization by reordering the global data of the program and modifying its code accordingly. The proposed optimization reorders the entire global data of the program, according to a representative execution rate of each instruction (or basic block) in the code. The data reordering is done in a way that enables the replacement of frequently-executed Load instructions, which reference the global data, with fast Add Immediate instructions. In addition, it tries to improve the global data locality and to reduce the total size of the global data area. The optimization was implemented into FDPR (Feedback Directed Program Restructuring), a post-link optimizer, which is part of the IBM AIX operating system for the IBM pSeries servers. Our results on SPECint2000 show a significant improvement of up to 11% (average 3%) in execution time, along with up to 97.9% (average 83%) reduction in memory references to the global variables via the global data access mechanism of the program.
AB - Memory access has proven to be one of the bottlenecks in modern architectures. Improving memory locality and eliminating the amount of memory access can help release this bottleneck. We present a method for link-time profile-based optimization by reordering the global data of the program and modifying its code accordingly. The proposed optimization reorders the entire global data of the program, according to a representative execution rate of each instruction (or basic block) in the code. The data reordering is done in a way that enables the replacement of frequently-executed Load instructions, which reference the global data, with fast Add Immediate instructions. In addition, it tries to improve the global data locality and to reduce the total size of the global data area. The optimization was implemented into FDPR (Feedback Directed Program Restructuring), a post-link optimizer, which is part of the IBM AIX operating system for the IBM pSeries servers. Our results on SPECint2000 show a significant improvement of up to 11% (average 3%) in execution time, along with up to 97.9% (average 83%) reduction in memory references to the global variables via the global data access mechanism of the program.
KW - Feedback
KW - Frequency
KW - Interference
KW - Libraries
KW - Optimization methods
KW - Program processors
KW - Reduced instruction set computing
KW - Registers
KW - Sun
UR - http://www.scopus.com/inward/record.url?scp=84943416062&partnerID=8YFLogxK
U2 - 10.1109/CGO.2003.1191548
DO - 10.1109/CGO.2003.1191548
M3 - ???researchoutput.researchoutputtypes.contributiontobookanthology.conference???
AN - SCOPUS:84943416062
T3 - International Symposium on Code Generation and Optimization, CGO 2003
SP - 228
EP - 237
BT - International Symposium on Code Generation and Optimization, CGO 2003
T2 - International Symposium on Code Generation and Optimization, CGO 2003
Y2 - 23 March 2003 through 26 March 2003
ER -