News: 0001629284

  ARM Give a man a fire and he's warm for a day, but set fire to him and he's warm for the rest of his life (Terry Pratchett, Jingo)

OpenBLAS 0.3.33 Released With Automatic "BIGNUMA" For More Than 256 CPU Cores

([Programming] 4 Hours Ago OpenBLAS 0.3.33)


OpenBLAS 0.3.33 is out today as the latest update to this vendor-neutral, optimized Basic Linear Algebra Subprograms "BLAS" library.

For a while now OpenBLAS has supported a "BIGNUMA" setting if wanting to enable support for more than 256 CPU cores. This experimental option has allowed supporting up to 1024 CPU cores and 128 NUMA nodes in the "BIGNUMA=1" configuration. Now with OpenBLAS 0.3.33, the BIGNUMA setting is automatic when the number of CPU cores on the system exceeds 256. Great to see especially with the next-gen Intel Xeon and AMD EPYC processors pushing system core counts even higher.

OpenBLAS 0.3.33 has also made various adjustments from the reference LAPACK implementation to realign with its upcoming v3.13 release.

OpenBLAS 0.3.33 also brings a number of bug fixes, automatic detection for Arm Neoverse V3, some optimizations for Neoverse N2, x86_64 build fixes, and optimized STRSM and DTRSM kernels for WebAssembly.

Downloads and more details on the OpenBLAS 0.3.33 release can be found via [1]GitHub .



[1] https://github.com/OpenMathLib/OpenBLAS/releases/tag/v0.3.33



I put the shotgun in an Adidas bag and padded it out with four pairs of
tennis socks, not my style at all, but that was what I was aiming for: If
they think you're crude, go technical; if they think you're technical, go
crude. I'm a very technical boy. So I decided to get as crude as possible.
These days, though, you have to be pretty technical before you can even
aspire to crudeness.
-- William Gibson, "Johnny Mnemonic"