Some updates on testing I'm doing:
$ init 1
also hangs
Booting from grub using init 3 gave me a single chance to reboot without hanging, other variables being acpi=off. But somehow I was not able to reproduce the behaviour.
Messages appearing on screen when init 3 was running, or in kernel.log:
---
TTM Buffer eviction failed
nouveau DRM failed idle channel 0
---
Now the messages below appear while the shutting down (or rebooting) is running its course, so I have no terminal at hand. Just the notifications scrolling:
(I took a picture with a mobile and wrote them by hand. Forgive any typos or abbreviations)
after init 1 or reboot or shutdown actually
INFO: rcu_sched detected stalls on CPUs/tasks:
rcu: $3-...0: (0 ticks this GP) idle=51a/1/0x40000
rcu: $(detected by 6, t=5252 jiffies, g...)
NMI watchdog: Watchdog detected hard LOCKUP on cpu 3
INFO: rcu_sched detected expedited stalls on CPUs/tasks:
rcu: blocking rcu_node structures:
watchdog: BUG: soft lockup - CPU#5 stuck for 23s! [systemd:1]
...
repeats in an endless loop
...
INFO: task irq/87-ELANO611:556 blocked for more than 120 seconds
Tainted: G W OEL 4.19.0-6-amd64 #1 Debian 4.19.67-2+deb10u1
echo 0 > /proc/sys/kernel/hung_task_timeout_secs disables this message
...
repeats for other tasks (haveged, systemd-logind, wpa_supplicant, dhclient, QQm1Thread, GlobalQueue, gdbus, ...)
...
i2c_transfer+0x51...
elan_i2c_get_report+0x1c...
? __switch_to+0x8c/0x440...
elan_isr+0x4b...
? __schedule+0x2aa...
? __wake_up_common_lock+...
? irq_finalize_oneshot....
irq_thread_fn+0x1f...
kthread+0x112
ret_from_fork+...
watchdog: BUG: soft lockup...
Modules linked in: ufs qnx4 hfsplus hfs minix vfat msdos fat jfs xfs dm_mod pc sst_ssp snd_hda_ext_core snd_soc_acpi_intel_match x86_pkg_temp_thermal btbcm irqbypass videobuf2_vmalloc btintel ideapad_l... pcspkr xor btrfs ecb zstd_decompre... xtables autofs4 media pcc_cpufreq...
CPU: 5 PID:: 1 Comm: systemd Tainted: G
Hardware name: LENOVO 81G3/LNVNB161216, BIOS 6JCN23WW 01/23/2018
RIP: 0010:smp_call_function_many+0x1f8/0x250
Code: c7 e8 6c 3f 5f 00...
RAX: 0000000000000003 RBX: ffff9e.... RCX: fff...
RDX: 0000000000000001 RSI: 0000.... RDI: fff9e...
RBP: .... R08: .... R09:....
R10:... R11:... R12:...
R13:... R14:.... R15:...
FS:... GS:... knlGS:...
CS: ... DS: ... ES: ... CRO:....
CR2: .... CR3:... CR4:....
Call Trace:
? tcp_v6_pre_connect...
? add_nops...
on_each_cpu...
text_poke_bp
__jump_label_transform...
arch_jump_label...
__jump_label_update...
__static_key_slow_dec_cpu...
__cgroup_bpf_detach...
__cgroup_bpf_prog_detach...
__x64_sys_bpf...
do_syscall_64+0x53/0x110
entry_SYSCALL_64_after_hwframe+0x44/0xa9
RIP: 0033:....
...
repeat, different memory dump
...
Task dump for CPU 3:
Xorg R running task
Call Trace:
? nvif_object_fini...
?nouveaus_vmm_fini
?nouveau_cli_fini
?nouveau_drm_postcl...
?drm_file_free.part....
?drm_release+...
?__fput
?task_work_run...
?do_exit
?handle_mm_fault
?do_group_exit
?__x64_sys_exit_group...
do_syscall_64+0x53/0x110
entry_SYSCALL_64_after_hwframe+0x44/0xa9
--------------
I would prefer to read that on logs instead of a mobile picture... Not sure yet what logs to look (or maybe turn on)
dmesg shows only the starting process, not the hanging avenue when trying to shutdown.
Thanks for any help or tip.
Att.,
Beco