News: 0001504254

  ARM Give a man a fire and he's warm for a day, but set fire to him and he's warm for the rest of his life (Terry Pratchett, Jingo)

Intel oneCCL 2021.14 Brings New Performance & Scalability Optimizations

([Intel] 6 Hours Ago oneCCL)


Intel's oneCCL open-source software that is the oneAPI Collective Communications Library focused on providing an efficient implementation of deep learning communication patterns is out with a new release.

The oneAPI Collective Communications Library can be used with the likes of the PyTorch and Horovod frameworks and has been seeing renewed development now as part of the Unified Acceleration (UXL) Foundation efforts. With the oneCCL 2021.14 release there are optimizations for their key-value store to scale up to 3,000 nodes plus an assortment of other new performance optimizations:

- Optimizations on key-value store support to scale up to 3000 nodes

- New APIs for Allgather, Broadcast and group API calls

- Performance Optimizations for scaleup for Allgather, Allreduce, and Reduce-scatter for scaleup and scaleout

- Performance Optimizations for CPU single node

- Optimizations to reuse Level Zero events.

- Change of the default mechanism for IPC exchange to pidfd

Downloads and more details on the oneCCL 2021.14 release via [1]GitHub .



[1] https://github.com/oneapi-src/oneCCL/releases/tag/2021.14



phoronix

Some people have told me they don't think a fat penguin really embodies the
grace of Linux, which just tells me they have never seen a angry penguin
charging at them in excess of 100mph. They'd be a lot more careful about what
they say if they had.
-- Linus Torvalds, announcing Linux v2.0