News: 0001573716

  ARM Give a man a fire and he's warm for a day, but set fire to him and he's warm for the rest of his life (Terry Pratchett, Jingo)

AMD "sbtsi_temp" Driver Being Updated For Linux 6.18 To Handle Freezing CPU Temperatures

([AMD] 3 Hours Ago sbtsi_temp)


Two years ago [1]the AMD Linux CPU temperature driver was updated to handle negative temperature reporting . That's for some users with exotic cooling systems and then also use within some industrial applications where the systems may be subject to sub-zero temperatures. The AMD sbtsi_temp driver is also now being similarly updated for handling freezing CPU temperatures.

The sbtsi_temp Linux driver is for the Sideband Interface (SBI) temperature sensor interface (SB-TSI). The AMD Sideband Interface is predominantly used for EPYC server processors and other select AMD SoCs. While the AMD k10temp CPU driver has already supported the expanded/negative temperature range, sbtsi_temp is only seeing that similar support now and is set to be merged for the upcoming Linux 6.18 cycle.

[2]

Cisco engineer Chuande Chen was the one working out this AMD CPU extended temperature range support for sbtsi_temp. Chuande explained with [3]the driver patch :

"Many AMD CPUs can support this feature now. We would get a wrong CPU DIE temperature if don't consider this. In low-temperature environments, the CPU die temperature can drop below zero. So many platforms would like to make extended temperature range as their default configuration.

Default temperature range (0C to 255.875C).

Extended temperature range (-49C to +206.875C).

Ref Doc: AMD V3000 PPR (Doc ID #56558)."

The engineer didn't elaborate on Cisco's low-temperature deployments of EPYC hardware. In any case this patch is queued up in the hardware monitoring "HWMON" subsystem's "hwmon-next" branch. Now that the patch is there it should be submitted as part of the Linux 6.18 kernel merge window in October and in turn will be out in stable form come December.



[1] https://www.phoronix.com/news/AMD-k10temp-Negative-Temps

[2] https://www.phoronix.com/image-viewer.php?id=2025&image=amd_ryzen_frozen_lrg

[3] https://git.kernel.org/pub/scm/linux/kernel/git/groeck/linux-staging.git/commit/?h=hwmon-next&id=612ecee07edcf235627b7c82a67a0fe4a2f61d13



Michael

zexelon

Vorpal

Type44Q

An architect's first work is apt to be spare and clean. He knows
he doesn't know what he's doing, so he does it carefully and with great
restraint.
As he designs the first work, frill after frill and embellishment
after embellishment occur to him. These get stored away to be used "next
time." Sooner or later the first system is finished, and the architect,
with firm confidence and a demonstrated mastery of that class of systems,
is ready to build a second system.
This second is the most dangerous system a man ever designs.
When he does his third and later ones, his prior experiences will
confirm each other as to the general characteristics of such systems,
and their differences will identify those parts of his experience that
are particular and not generalizable.
The general tendency is to over-design the second system, using
all the ideas and frills that were cautiously sidetracked on the first
one. The result, as Ovid says, is a "big pile."
-- Frederick Brooks, "The Mythical Man Month"