[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Re: Debian 11: Nvidia NVS 310 with nvidia driver freezes after two days



On Sun, 19 Sep 2021, Andrew M.A. Cater wrote:

On Sun, Sep 19, 2021 at 01:22:30PM +0200, Roger Price wrote:
My Nvidia NVS 310 card with the nvidia 390.144 driver starts off perfectly,
but after two days freezes: no reaction to keyboard or mouse action.

This comes down, perhaps, to having both nouveau and nididia drivers
on the same system.

I use synaptic, which wants to remove far too many packages if I remove nouveau
so I didn't insist.

On Sun, 19 Sep 2021, Alexander V. Makartsev wrote:

It looks like a hardware problem to me, even if you say it takes two days to freeze. Can you tell us more about your system. Is it laptop or is it stationary workstation?

The machine is a Dell 6-Core Precision WorkStation T7500, Intel Xeon E5645, with Bios dated 2013. The internal temperatures are: cpu: 39.0 C mobo: 26.0 C. The RAM total is 47.04 GiByte. I use Xfce 4.16.0.

Was it working just fine before you upgraded to 'bullseye' (ver 5.10 kernel)?

Ran perfectly for 2 years with opensuse 42.3, a Quadro 4000 card and the nvidia 384.69 driver. Rock solid. Quadro 4000 temperature typically 85C.

Have you tried to run some benchmarks to force the issue? By doing that you could reveal some potential problem with inadequate cooling or problems of...

Here is a short summary of my notes following my attempts to find a working setup on this workstation

    Driver: nouveau
    ---------------

Card Quadro 4000, GF100GL. Freezes after 11 minutes with monitors lit. Firmware issue: failed to load nvc0_fuc084. After freeze, journalctl shows

 fifo: INTR 0100 0000: 0...05   many repetitions
 fifo: INTR 0080 0000           once

Card NVS 310.  Freezes after 6 minutes with monitors lit. Card temperature 46C.

Card Quadro P400, GP107GL. Left monitor spontaneously rotates after 30 min. Other random reconfigurations. Firmware issue: gp107/nvdec/scrubber and acr/DL.bin not loaded. Message: Failed to create kernel channel -22. Card temperature 45C.

    Nvidia drivers
    --------------

Card Quadro 4000, GF100GL. 390.144. Freezes with blank monitors after 15 minutes. Card temperature 85C.

Card NVS 310. 390.144.  Freezes with monitors lit after 15 mins - 3 hours.

Card Quadro P400, GP107GL. 460.91. Card temperature 46C. Freezes with blank monitors before 30 mins.

    Current situation
    -----------------

I wondered if a common feature of all the freezing was the automatic screen saver failing, so I installed xscreensaver and configured it to start saving my screen after 10 mins inactivity with the Quadro P400 card + 490.91 driver. This has so far held up for 22 hours. If it holds up for a week, I will report it as a candidate workaround. If it doesn't, I will still be looking for a solution.

Roger


Reply to: