NVIDIA Working On "-flto-partition=locality" GCC Option To Boost Performance For Some CPU Workloads
([NVIDIA] 6 January 10:05 AM EST
-flto-partition=locality)
NVIDIA compiler engineers have spent the past several months working on a proposed GCC option -flto-partition=locality for having the compiler optimize the code layout for locality between callees and callers as part of the link-time optimization (LTO) process. For some workloads NVIDIA is finding this -flto-partition=locality compiler option being of significant help for bettering the CPU performance.