Bug#1012100: linux-image-5.17-1: KVM LIBVIRT fails to start, slow disk access, and a kernel thread goes wild on Intel Xeon X3430
Dear Diederik,
yes, it works with the kernel 5.16.0-6, but disk access is still slow.
For example, virt-manager/viewer sometimes needs a minute to connect to
the KVM instances on localhost. But not all applications are this slow;
for example the E-Mail client Sylpheed starts as fast as before and is
operating at fast speed.
I assume there is also another bug now in the system, not only due to
the new kernel. There is also another bug in GDM3, which I also
reported: Loading GDM3 after bootup and logging in as normal user is
also very, very slow.
As you suggested, I installed the kernel 5.17.11 from Debian/unstable
and booted into this kernel.
virt-manager and my KVM VM instances do work again, but one VM instance
failed to load after bootup. I restarted the VM instance, and it is now
also operating fine.
When opening the virt-viewer instance from virt-manager, connecting to
the VM is still very slow with kernel 5.17.11. Something must be wrong
I/O wise.
I attached the dmesg output, you requested, as TXT file to this E-mail.
Thank you very much for your answer!
Sincerely,
Adrian Kiess
On Mon, 30 May 2022 11:45:29 +0200 Diederik de Haas
<didi.debian@cknow.org> wrote:
> On Monday, 30 May 2022 09:59:06 CEST Adrian Immanuel Kiess wrote:
> > Package: src:linux
> > Version: 5.17.3-1
> > Debian Release: bookworm/sid
> > APT policy: (990, 'testing')
> >
> > Kernel: Linux 5.16.0-6-amd64 (SMP w/4 CPU threads; PREEMPT)
>
> Does everything work correctly with kernel 5.16.0-6 ?
> Sid/Unstable currently has and it would be useful to know if
> the issue is still present in that version. Can you test that?
>
> If it is, then hopefully `dmesg` can give some clues. After you've noticed the
> described symptoms again, can you do `dmesg --level emerg,alert,crit,err,warn`
> and send that to this bug report?
[ 0.022299] ACPI: SPCR: Unexpected SPCR Access Width. Defaulting to byte size
[ 0.233929] core: CPUID marked event: 'bus cycles' unavailable
[ 0.281581] pmd_set_huge: Cannot satisfy [mem 0xe0000000-0xe0200000] with a huge-page mapping due to MTRR override.
[ 0.285753] [Firmware Warn]: HEST: Duplicated hardware error source ID: 9.
[ 8.897715] ERST: Can not request [mem 0xbf7ff000-0xbf7fffff] for ERST.
[ 9.537343] ACPI Warning: SystemIO range 0x0000000000001028-0x000000000000102F conflicts with OpRegion 0x0000000000001000-0x000000000000102F (\_SB.PCI0.LPC0.PMIO) (20211217/utaddress-204)
[ 9.549414] ACPI Warning: SystemIO range 0x0000000000001180-0x00000000000011AF conflicts with OpRegion 0x0000000000001180-0x00000000000011AF (\_SB.PCI0.LPC0.GPOX) (20211217/utaddress-204)
[ 9.556866] lpc_ich: Resource conflict(s) found affecting gpio_ich
[ 9.678746] resource sanity check: requesting [mem 0x000c0000-0x000dffff], which spans more than PCI Bus 0000:00 [mem 0x000d4000-0x000dbfff window]
[ 9.678756] caller pci_map_rom+0x79/0x1d0 mapping multiple BARs
[ 13.853298] IPMI Watchdog: Unable to register misc device
[ 14.329981] kvm: VM_EXIT_LOAD_IA32_PERF_GLOBAL_CTRL does not work properly. Using workaround
[ 24.691518] kauditd_printk_skb: 85 callbacks suppressed
[ 26.412447] kvm: KVM_SET_TSS_ADDR need to be called before entering vcpu
[ 90.991726] device-mapper: core: CONFIG_IMA_DISABLE_HTABLE is disabled. Duplicate IMA measurements will not be recorded in the IMA log.
[ 95.092388] BUG: kernel NULL pointer dereference, address: 000000000000000b
[ 95.092396] #PF: supervisor write access in kernel mode
[ 95.092398] #PF: error_code(0x0002) - not-present page
[ 95.092404] Oops: 0002 [#1] PREEMPT SMP PTI
[ 95.092407] CPU: 2 PID: 4379 Comm: CPU 0/KVM Not tainted 5.17.0-3-amd64 #1 Debian 5.17.11-1
[ 95.092411] Hardware name: HP ProLiant ML110 G6/ProLiant ML110 G6, BIOS O27 08/26/2011
[ 95.092413] RIP: 0010:kvm_replace_memslot+0xcf/0x390 [kvm]
[ 95.092481] Code: 44 24 08 48 85 db 0f 84 3b 02 00 00 48 89 ea 48 c1 e2 04 48 01 da 48 8b 4a 08 48 85 c9 74 1e 48 8b 32 48 89 31 48 85 f6 74 04 <48> 89 4e 08 48 c7 02 00 00 00 00 48 c7 42 08 00 00 00 00 48 8d 54
[ 95.092484] RSP: 0018:ffffa1fd87ffbd70 EFLAGS: 00010206
[ 95.092487] RAX: ffffa1fd87fb5058 RBX: ffff955e10f13200 RCX: ffffa1fd87fb5388
[ 95.092489] RDX: ffff955e10f13200 RSI: 0000000000000003 RDI: ffffa1fd87fb5000
[ 95.092491] RBP: 0000000000000000 R08: 0000000000000000 R09: 0000000000000000
[ 95.092493] R10: 0000000000000003 R11: 0000000000000004 R12: 0000000000000000
[ 95.092495] R13: 0000000000000000 R14: 0000000000000000 R15: ffffa1fd87fb5000
[ 95.092497] FS: 00007f0e39639640(0000) GS:ffff95606fd00000(0000) knlGS:0000000000000000
[ 95.092500] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 95.092503] CR2: 000000000000000b CR3: 00000001b4780000 CR4: 00000000000026e0
[ 95.092505] Call Trace:
[ 95.092509] <TASK>
[ 95.092513] ? _raw_read_unlock+0x18/0x30
[ 95.092519] kvm_set_memslot+0x3c2/0x4a0 [kvm]
[ 95.092564] kvm_vm_ioctl+0x2cb/0xd80 [kvm]
[ 95.092610] ? __seccomp_filter+0x38c/0x5a0
[ 95.092615] __x64_sys_ioctl+0x82/0xb0
[ 95.092620] do_syscall_64+0x3b/0xc0
[ 95.092625] entry_SYSCALL_64_after_hwframe+0x44/0xae
[ 95.092629] RIP: 0033:0x7f0e64d63397
[ 95.092632] Code: 3c 1c e8 1c ff ff ff 85 c0 79 87 49 c7 c4 ff ff ff ff 5b 5d 4c 89 e0 41 5c c3 66 0f 1f 84 00 00 00 00 00 b8 10 00 00 00 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d a9 da 0d 00 f7 d8 64 89 01 48
[ 95.092635] RSP: 002b:00007f0e39637f98 EFLAGS: 00000246 ORIG_RAX: 0000000000000010
[ 95.092638] RAX: ffffffffffffffda RBX: 000000004020ae46 RCX: 00007f0e64d63397
[ 95.092641] RDX: 00007f0e39638060 RSI: 000000004020ae46 RDI: 0000000000000011
[ 95.092643] RBP: 00005641f1094f50 R08: 0000000000000000 R09: 00000000000c0000
[ 95.092645] R10: 00000000000c0000 R11: 0000000000000246 R12: 00007f0e39638060
[ 95.092647] R13: 0000000000020000 R14: 00005641f0d412e0 R15: 00000000000c0000
[ 95.092651] </TASK>
[ 95.092652] Modules linked in: dm_mod vhost_net vhost vhost_iotlb tap tun qrtr cpufreq_conservative cpufreq_userspace cpufreq_powersave cpufreq_ondemand uinput bridge stp llc binfmt_misc quota_v2 quota_tree intel_powerclamp kvm_intel kvm irqbypass intel_cstate squashfs intel_uncore snd_usb_audio amdgpu coretemp snd_usbmidi_lib pcspkr joydev snd_rawmidi loop snd_seq_device snd_hda_codec_hdmi evdev mc snd_hda_intel snd_intel_dspcfg snd_intel_sdw_acpi snd_hda_codec snd_hda_core snd_hwdep ipmi_watchdog snd_pcm_oss ipmi_ssif sg snd_mixer_oss snd_pcm iTCO_wdt intel_pmc_bxt snd_timer snd iTCO_vendor_support watchdog gpu_sched soundcore acpi_cpufreq button acpi_ipmi ipmi_si ipmi_poweroff ipmi_devintf ipmi_msghandler msr i2c_dev parport_pc ppdev nfsd lp parport fuse nfs_acl lockd auth_rpcgss grace configfs sunrpc ip_tables x_tables autofs4 ext4 crc16 mbcache jbd2 crc32c_generic btrfs xor raid6_pq zstd_compress libcrc32c uhci_hcd hid_generic usbhid hid sd_mod t10_pi crc_t10dif crct10dif_gen
eric
[ 95.092721] sr_mod cdrom crct10dif_common radeon ahci i2c_algo_bit drm_ttm_helper libahci ttm libata drm_kms_helper tg3 xhci_pci cec scsi_mod rc_core xhci_hcd ehci_pci ehci_hcd libphy usbcore ptp i2c_i801 drm crc32c_intel scsi_common pps_core lpc_ich i2c_smbus usb_common
[ 95.092746] CR2: 000000000000000b
[ 95.092749] ---[ end trace 0000000000000000 ]---
[ 95.092751] RIP: 0010:kvm_replace_memslot+0xcf/0x390 [kvm]
[ 95.092793] Code: 44 24 08 48 85 db 0f 84 3b 02 00 00 48 89 ea 48 c1 e2 04 48 01 da 48 8b 4a 08 48 85 c9 74 1e 48 8b 32 48 89 31 48 85 f6 74 04 <48> 89 4e 08 48 c7 02 00 00 00 00 48 c7 42 08 00 00 00 00 48 8d 54
[ 95.092796] RSP: 0018:ffffa1fd87ffbd70 EFLAGS: 00010206
[ 95.092798] RAX: ffffa1fd87fb5058 RBX: ffff955e10f13200 RCX: ffffa1fd87fb5388
[ 95.092801] RDX: ffff955e10f13200 RSI: 0000000000000003 RDI: ffffa1fd87fb5000
[ 95.092803] RBP: 0000000000000000 R08: 0000000000000000 R09: 0000000000000000
[ 95.092805] R10: 0000000000000003 R11: 0000000000000004 R12: 0000000000000000
[ 95.092807] R13: 0000000000000000 R14: 0000000000000000 R15: ffffa1fd87fb5000
[ 95.092809] FS: 00007f0e39639640(0000) GS:ffff95606fd00000(0000) knlGS:0000000000000000
[ 95.092812] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 95.092814] CR2: 000000000000000b CR3: 00000001b4780000 CR4: 00000000000026e0
[ 190.869861] hrtimer: interrupt took 10082 ns
Reply to: