[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Re: Machine freezes and crashes with the message "soft lockup - CPU#0 stuck for 22s!"



Hi

For a while now I'm seeing what looks like the same problem with Jessie.
Typically after a few hours of using X the system freezes with a "BUG:
soft lockup". For a few minutes I can still log in through ssh, but then
that stops working as well and the system stops responding to pings. I'm
also using a Radeon graphics card and see a similar stack trace in the
log (exact details below).

This seems to be a regression in the kernel. The solution I found was to
freeze kernel updates. The last kernel that works reliably for me is
3.16.43-2+deb8u5. With all later kernels the system will hang with-in a day.

In Aptitude I held packages "linux-image-3.16.0-4-amd64" (version
3.16.43-2+deb8u5) and "linux-image-amd64" (version 3.16+63) to prevent
updates. This is not optimal, especially with the recent security fixes,
but I plan to update this system to Stretch eventually.

Best regards
Tomaž


Details from the crash yesterday with linux-image-3.16.0-5-amd64
(3.16.51-3+deb8u1):

Excerpt from /var/log/syslog:

BUG: soft lockup - CPU#3 stuck for 22s! [Xorg:1175]

Modules linked in: usb_storage sha256_ssse3 sha256_generic ecb cbc
algif_skcipher af_alg cpufreq_stats cpufreq_powersave
cpufreq_conservative cpufreq_userspace binfmt_misc cfg80211 rfkill usblp
snd_hda_codec_realtek snd_hda_codec_generic joydev radeon ttm
drm_kms_helper iTCO_wdt iTCO_vendor_support drm i2c_algo_bit evdev
coretemp kvm_intel snd_hda_codec_hdmi kvm snd_hda_intel
snd_hda_controller snd_hda_codec shpchp snd_hwdep snd_pcm_oss
i7core_edac edac_core pcspkr serio_raw snd_mixer_oss lpc_ich mfd_core
snd_pcm snd_timer snd soundcore acpi_cpufreq processor button
thermal_sys raid1 md_mod it87 hwmon_vid dm_crypt dm_mod fuse parport_pc
ppdev lp parport autofs4 ext4 crc16 mbcache jbd2 sg sd_mod crc_t10dif
crct10dif_generic crct10dif_common sr_mod cdrom ata_generic hid_generic
usbhid hid crc32c_intel ahci pata_jmicron libahci i2c_i801 ehci_pci
i2c_core uhci_hcd libata ehci_hcd r8169 mii scsi_mod usbcore usb_common
floppy
CPU: 3 PID: 1175 Comm: Xorg Not tainted 3.16.0-5-amd64 #1 Debian
3.16.51-3+deb8u1
Hardware name: Gigabyte Technology Co., Ltd. P55-UD3/P55-UD3, BIOS F5
11/20/2009

Call Trace:
 [<ffffffffa049df86>] ? ttm_bo_vm_fault+0x4c6/0x560 [ttm]
 [<ffffffffa04d601d>] ? radeon_bo_create+0x16d/0x220 [radeon]
 [<ffffffff812b5a27>] ? idr_mark_full+0x57/0x60
 [<ffffffff812b5aa5>] ? idr_alloc+0x75/0xd0
 [<ffffffffa044991a>] ? drm_gem_handle_create_tail+0xba/0x160 [drm]
 [<ffffffffa0499e78>] ? ttm_bo_del_sub_from_lru+0x18/0xb0 [ttm]
 [<ffffffffa04d4907>] ? radeon_ttm_fault+0x47/0x60 [radeon]
 [<ffffffff8116c81a>] ? __do_fault+0x3a/0xa0
 [<ffffffff8117715c>] ? mmap_region+0x19c/0x650
 [<ffffffff8116f93f>] ? do_shared_fault.isra.55+0x2f/0x1d0
 [<ffffffff81170d25>] ? handle_mm_fault+0x6c5/0x1140
 [<ffffffffa04b8069>] ? radeon_drm_ioctl+0x69/0x80 [radeon]
 [<ffffffff810594f7>] ? __do_page_fault+0x177/0x410
 [<ffffffff81527e68>] ? page_fault+0x28/0x30

(there are several other stack traces in the log as well, some not
apparently related to radeon)

$ cat /etc/debian_version
8.10

$ cat /proc/version
Linux version 3.16.0-5-amd64 (debian-kernel@lists.debian.org) (gcc
version 4.8.4 (Debian 4.8.4-1) ) #1 SMP Debian 3.16.51-3+deb8u1 (2018-01-08)

$ lspci|grep VGA
01:00.0 VGA compatible controller: Advanced Micro Devices, Inc.
[AMD/ATI] RV710 [Radeon HD 4350/4550]

I don't have "firmware-amd-graphics" package installed.

"xserver-xorg-core" installed version: 2:1.16.4-1+deb8u2

Attachment: signature.asc
Description: OpenPGP digital signature


Reply to: