News: 0001531786

  ARM Give a man a fire and he's warm for a day, but set fire to him and he's warm for the rest of his life (Terry Pratchett, Jingo)

AMD ZenDNN 5.0.1 Released To Help With EPYC Inferencing For Recommender Systems & LLMs

([AMD] 86 Minutes Ago AMD ZenDNN 5.0.1)


Released last year shortly after the [1]EPYC 9005 "Turin" processor launch was [2]ZenDNN 5.0 for Zen 5 optimized CPU inferencing with the likes of PyTorch and TensorFlow. ZenDNN 5.0 delivers [3]up to a 400% performance uplift according to AMD engineers. Out today is ZenDNN 5.0.1 with further optimizations, particularly around recommendation engines and large language models (LLMs).

ZenDNN 5.0.1 is a small update focused on further pushing the performance envelope for recommender systems and large language models on AMD EPYC CPUs. ZenDNN 5.0.1 is a small update focused on further pushing the performance envelope for recommender systems and large language models on AMD EPYC CPUs. There is now support for INT8 and INT4 quantized deep learning recommendation models (DLRM) for faster inferencing and lower memory use compared to the previously preferred BF16 precision by ZenDNN.

[4]

That's it in terms of the listed changes for ZenDNN 5.0.1. There weren't any published benchmark numbers either from AMD of the impact they are seeing out of this ZenDNN point release but going from BF16 to INT4/INT8 can yield a significant boost. In any event those wanting to make use of ZenDNN 5.0.1 for optimized CPU-based inferencing with PyTorch and TensorFlow on AMD Zen processors can find the new version via [5]GitHub . The [6]ZenDNN library remains available under an Apache 2.0 open-source license.



[1] https://www.phoronix.com/search/EPYC+9005

[2] https://www.phoronix.com/news/AMD-ZenDNN-5.0-Released

[3] https://www.phoronix.com/news/AMD-ZenDNN-5.0-400p-Performance

[4] https://www.phoronix.com/image-viewer.php?id=2025&image=amd_epyc_zendnn_lrg

[5] https://github.com/amd/ZenDNN/releases/tag/v5.0.1

[6] https://www.phoronix.com/search/ZenDNN



phoronix

A program should be light and agile, its subroutines connected like a
strings of pearls. The spirit and intent of the program should be retained
throughout. There should be neither too little nor too much, neither needless
loops nor useless variables, neither lack of structure nor overwhelming
rigidity.
A program should follow the 'Law of Least Astonishment'. What is this
law? It is simply that the program should always respond to the user in the
way that astonishes him least.
A program, no matter how complex, should act as a single unit. The
program should be directed by the logic within rather than by outward
appearances.
If the program fails in these requirements, it will be in a state of
disorder and confusion. The only way to correct this is to rewrite the
program.
-- Geoffrey James, "The Tao of Programming"