News: 0001607996

  ARM Give a man a fire and he's warm for a day, but set fire to him and he's warm for the rest of his life (Terry Pratchett, Jingo)

VVenC H.266 Encoder Rolls Out More ARM Optimizations For Nice Performance Gains

([Multimedia] 53 Minutes Ago Benchmarks On NVIDIA GB10)


Fraunhofer HHI this week released a new version of VVenC, their open-source H.266 video encoder. Among the changes this release are more performance optimizations for ARM and I have run some comparison benchmarks using a NVIDIA GB10 SoC with the Dell Pro Max GB10.

This isn't the first release of VVenc to feature targeted ARM performance optimizations but simply the latest in continuing to strive for better ARM64 video encode performance. The VVenC 1.14 release brings "many" ARM SIMD optimizations around NEON and SVE as well as enabling Scalable Vector Extensions (SVE) by default for capable ARM processors.

[1]

The NVIDIA GB10 with its ten Arm Cortex-X925 and ten Arm Cortex-A725 cores do support SVE/SVE2. Thus a very interesting test target and thanks to Dell having provided the [2]Dell Pro Max GB10 review sample it's possible to benchmark these latest VVenC improvements.

For today's testing I built and benchmarked the VVenC 1.13 and VVenC 1.14 H.266 encoders on the Dell Pro Max GB10 using the same software stack and default compiler/flags with simply comparing the version of VVenC being tested.

Around 35% higher performance for VVenC 1.14 with 4K content at the fast preset. And that's on top of VVenC 1.13 already having some ARM optimizations and the like - this isn't their first release trying to tune for AArch64 performance.

The "faster" preset with 4K content was still yielding around 7% better performance for this new release.

Even for 1080p content some nice VVenC 1.14 performance gains for H.266 video encoding on the Dell Pro Max GB10.

More [3]VVenC benchmarks to come for those curious how the performance stacks up between AArch64 and x86_64 processors. Those wanting to try out this open-source H.266 encoder can find it on [4]GitHub .



[1] https://www.phoronix.com/image-viewer.php?id=2026&image=vvenc_gb10_lrg

[2] https://www.phoronix.com/search/Dell+Pro+Max+GB10

[3] https://openbenchmarking.org/test/pts/vvenc

[4] https://github.com/fraunhoferhhi/vvenc/releases/tag/v1.14.0



IV. The time required for an object to fall twenty stories is greater than or
equal to the time it takes for whoever knocked it off the ledge to
spiral down twenty flights to attempt to capture it unbroken.
Such an object is inevitably priceless, the attempt to capture it
inevitably unsuccessful.
V. All principles of gravity are negated by fear.
Psychic forces are sufficient in most bodies for a shock to propel
them directly away from the earth's surface. A spooky noise or an
adversary's signature sound will induce motion upward, usually to
the cradle of a chandelier, a treetop, or the crest of a flagpole.
The feet of a character who is running or the wheels of a speeding
auto need never touch the ground, especially when in flight.
VI. As speed increases, objects can be in several places at once.
This is particularly true of tooth-and-claw fights, in which a
character's head may be glimpsed emerging from the cloud of
altercation at several places simultaneously. This effect is common
as well among bodies that are spinning or being throttled. A "wacky"
character has the option of self-replication only at manic high
speeds and may ricochet off walls to achieve the velocity required.
-- Esquire, "O'Donnell's Laws of Cartoon Motion", June 1980