News: 0001563801

  ARM Give a man a fire and he's warm for a day, but set fire to him and he's warm for the rest of his life (Terry Pratchett, Jingo)

Vulkan + Mesa Drivers For AI Inferencing? It's Already Showing Potential On Radeon RADV

([Mesa] 24 July 07:42 PM EDT Mesa Vulkan Drivers)


Following the Vulkanised 2025 presentation how [1]NVIDIA is finding great success with Vulkan for AI / machine learning and already competitive to CUDA in some areas, Red Hat engineer and DRM subsystem lead maintainer David Airlie began exploring the potential of Mesa Vulkan drivers for AI inferencing. He was successful in using the Intel ANV, NVIDIA NVK, and Radeon RADV drivers for Vulkan-based AI inferencing while for the Radeon hardware tested is where it's showing the most potential (performance) at the moment and for even competing with the ROCm compute stack.

David Airlie shared a blog post today outlining his experiences exploring the Mesa Vulkan drivers for AI inferencing. At the same time he and others like Karol Herbst of Red Hat are working on addressing feature gaps in the Mesa Vulkan drivers to make then more suitable for handling AI workloads.

Using the ramalama wrapper to Llama.cpp, Airlie has tested the different open-source and closed-source driver options. With the Mesa NVK driver the performance is a lot slower still than the official NVIDIA closed-source driver stack. No real surprise, especially given our recent NVK graphics benchmarks in [2]Mesa 25.2 NVK vs. NVIDIA R575 Linux Graphics Performance For GeForce RTX 40 Series .

On the Intel side he was able to get the Vulkan AI inferencing working with the ANV driver but unable to get the oneAPI/SYCL stack working nicely. On the AMD side is where it currently shows the most potential. Airlie's results show that the RADV Vulkan performance with ramalama/llama.cpp for token generation could be faster than using the official AMD ROCm compute stack. With prompt processing is where ROCm came out ahead, at least for now but there is hope some Mesa/RADV optimizations could put it ahead of ROCm.

Here is the data shared from Airlie with his GPU driver/hardware comparison:

Read more about his Vulkan AI comparison adventure via [3]his blog . It will be interesting to run some benchmarks on our side once the Mesa Vulkan drivers further mature for AI workloads.



[1] https://www.phoronix.com/news/NVIDIA-Vulkan-AI-ML-Success

[2] https://www.phoronix.com/review/mesa-252-nvk-nvidia

[3] https://airlied.blogspot.com/2025/07/ramalamamesa-benchmarks-on-my-hardware.html



shmerl

JEBjames

Michael

QwertyChouskie

Jumbotron

JellyDonutz

ahrs

smitty3268

airlied

Well, we're big rock singers, we've got golden fingers,
And we're loved everywhere we go.
We sing about beauty, and we sing about truth,
At ten thousand dollars a show.
We take all kind of pills to give us all kind of thrills,
But the thrill we've never known,
Is the thrill that'll get'cha, when you get your picture,
On the cover of the Rolling Stone.

I got a freaky old lady, name of Cole King Katie,
Who embroiders on my jeans.
I got my poor old gray-haired daddy,
Drivin' my limousine.
Now it's all designed, to blow our minds,
But our minds won't be really be blown;
Like the blow that'll get'cha, when you get your picture,
On the cover of the Rolling Stone.

We got a lot of little, teen-aged, blue-eyed groupies,
Who'll do anything we say.
We got a genuine Indian guru, that's teachin' us a better way.
We got all the friends that money can buy,
So we never have to be alone.
And we keep gettin' richer, but we can't get our picture,
On the cover of the Rolling Stone.
-- Dr. Hook and the Medicine Show
[As a note, they eventually DID make the cover of RS. Ed.]