[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Re: Towards testing all Navi 3x architectures



Hi Cory,

On 2024-09-26 19:41, Cordell Bloor wrote:
> While I do support you in striving towards understanding and addressing
> these failures, there are many ways a GPU can be put into a bad state
> where a power cycle is the only fix. That goes double when initializing
> the GPU in unsupported ways for PCIe passthrough. We might be able to
> improve the reliability of using passthrough, but we will always have to
> be prepared for this sort of failure mode.

it's less about understanding the issue, more about finding data points
to detect it as early as possible, so that

>> gfx1102 also completed all its tests on ci-test.rocm.debian.net, but
>> because of the issue above, and the W7500 being only passively cooled,
>> I'm not going to move it to endeavour just yet.
> 
> The Radeon PRO W7500 has a fan in its picture on the AMD product page
> [1] and in reviews [2]. Yours doesn't have one?

it... does. What it doesn't have is a extra PCIe power connector, as its
draw is low enough to not need one.

I must have mixed these two up mentally a long time ago, and funnily
enough, never noticed my mistake. The brain can be funny. In German, we
call this "betriebsblind" (operational blindness).

Best,
Christian

> [1]: https://www.amd.com/en/products/graphics/workstations/radeon-pro/w7500.html
> [2]: https://www.pcmag.com/reviews/amd-radeon-pro-w7500



Reply to: