News: 0001484707

  ARM Give a man a fire and he's warm for a day, but set fire to him and he's warm for the rest of his life (Terry Pratchett, Jingo)

Mesa 24.3 Sees "Substantial Improvement" To AMD Clear/Copy-Buffer Compute Shader

([Mesa] 6 Hours Ago Optimized Clear / Copy Buffer Shader)


Well known AMD Mesa developer Marek Olšák continues relentlessly optimizing the RadeonSI Gallium3D driver and related code for ensuring the AMD graphics stack can reach peak performance.

Recently Marek has been working to optimize the clear/copy_buffer compute shader into AMD common code and as part of it adding support for unaligned copies.

In the merge request opened a few weeks ago Marek describes this as a "substantial improvement" that since overnight has been merged for Mesa 24.3. Marek notes in the [1]merge request :

"This is a substantial improvement of the clear/copy_buffer compute shader in radeonsi, which is also moved to src/amd/common.

This adds support for unaligned buffer clears and copies while maintaining the same performance as aligned clears and copies. The optimal alignment for buffer offsets is 256, not 4.

More chip-specific tuning will follow, but this is already optimal for Navi31."

Great to see more of Marek's optimizations ready for Mesa Git. It will be interesting to see what more tuning Marek achieves in time for Mesa 24.3 stable due out in Q4.



[1] https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30208



peterdk

Hop along my little friends, up the Withywindle!
Tom's going on ahead candles for to kindle.
Down west sinks the Sun; soon you will be groping.
When the night-shadows fall, then the door will open,
Out of the winfow-panes light will twinkle yellow.
Fear no alder black! Heed no hoary willow!
Fear neither root nor bough! Tom goes on before you.
Hey now! merry dol! We'll be waiting for you!
-- J. R. R. Tolkien