News: 0001481265

  ARM Give a man a fire and he's warm for a day, but set fire to him and he's warm for the rest of his life (Terry Pratchett, Jingo)

AMD Working On More Precise GPU Reset Capabilities For Their Linux Driver

([Radeon] 41 Minutes Ago Per Ring Resets)


While the hope remains that GPU resets are a very infrequent task, AMD Linux driver engineers have recently been working on the ability to support a per-queue GC reset capability for more precise reset capabilities when needed.

With the latest patches for the AMDGPU kernel driver code, the driver would be able to reset the graphics/compute ring where an error occurs and to just lose the jobs pending in that queue as opposed to carrying out a full GPU reset. But if the per-queue/ring reset doesn't work, the AMDGPU driver would continue to perform a full GPU reset as needed.

An updated set of 53 patches were posted today for introducing reset capabilities on a per-ring basis in the AMDGPU driver. This is then wired up across the recent generations of AMD graphics processors, including the upcoming GFX12 (RDNA4) hardware.

The hope is that this per-ring reset capability is more pleasant when encountering a specific job that's hung rather than going the route of a full "legacy" GPU reset.

Those interested in learning more can do so via [1]this patch series now under review for GC per-queue resets.



[1] https://lists.freedesktop.org/archives/amd-gfx/2024-July/111481.html



phoronix

parallel processors running perpendicular today