[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#782774: marked as done (linux-image-3.16.0-4-amd64: Nested KVM regression: "BUG: soft lockup - CPU#1 stuck for 22s! [qemu-system-x86:1221]")



Your message dated Sat, 08 May 2021 09:09:37 -0700 (PDT)
with message-id <6096b7c1.1c69fb81.149ac.7df9@mx.google.com>
and subject line Closing this bug (BTS maintenance for src:linux bugs)
has caused the Debian Bug report #782774,
regarding linux-image-3.16.0-4-amd64: Nested KVM regression: "BUG: soft lockup - CPU#1 stuck for 22s! [qemu-system-x86:1221]"
to be marked as done.

This means that you claim that the problem has been dealt with.
If this is not the case it is now your responsibility to reopen the
Bug report if necessary, and/or fix the problem forthwith.

(NB: If you are a system administrator and have no idea what this
message is talking about, this may indicate a serious mail system
misconfiguration somewhere. Please contact owner@bugs.debian.org
immediately.)


-- 
782774: https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=782774
Debian Bug Tracking System
Contact owner@bugs.debian.org with problems
--- Begin Message ---
Package: src:linux
Version: 3.16.7-ckt9-2
Severity: normal


In a level 1 KVM guest, starting a level 2 KVM guest with QEMU fails and
triggers "soft lockup" messages on the serial console. Then the level 1 KVM
guest becomes unresponsive.

That's a regression since Wheezy: this does not happen (everything works fine,
on the very same level 1 KVM guest, when running Wheezy's kernel.

The host system is running Wheezy + Linux from wheezy-backports.

The level 1 KVM guest is running Jessie, and is using statically allocated
HugePages as its memory backend.

The level 2 KVM guest is started with:

sudo qemu-system-x86_64 -cdrom /srv/tails-i386-1.3.2.iso -m 1280 -nographic -machine pc-0.15,accel=kvm

I could also reproduce this bug with:

   -machine pc-0.15,accel=kvm,mem-merge=off
   -machine pc-q35-2.1,accel=kvm

The various messages I could see on the serial console can be found at
https://labs.riseup.net/code/issues/9157 -- you'll see there various other
tests and their results.

See in particular:
https://labs.riseup.net/code/attachments/download/739/trace1
https://labs.riseup.net/code/attachments/download/740/trace2

If you prefer, I can attach these logs to this bug report.

I can also reproduce the bug with the 3.19.3-1~exp1 kernel.

-- Package-specific info:
** Version:
Linux version 3.16.0-4-amd64 (debian-kernel@lists.debian.org) (gcc version 4.8.4 (Debian 4.8.4-1) ) #1 SMP Debian 3.16.7-ckt9-2 (2015-04-13)

** Command line:
BOOT_IMAGE=/vmlinuz-3.16.0-4-amd64 root=/dev/mapper/vg1-root ro apparmor=1 security=apparmor console=ttyS0,115200n8 quiet

** Not tainted

** Kernel log:
[    0.643097] uhci_hcd: USB Universal Host Controller Interface driver
[    0.643499] uhci_hcd 0000:00:01.2: UHCI Host Controller
[    0.643509] uhci_hcd 0000:00:01.2: new USB bus registered, assigned bus number 1
[    0.643534] uhci_hcd 0000:00:01.2: detected 2 ports
[    0.643655] uhci_hcd 0000:00:01.2: irq 11, io base 0x0000c0c0
[    0.643774] usb usb1: New USB device found, idVendor=1d6b, idProduct=0001
[    0.643777] usb usb1: New USB device strings: Mfr=3, Product=2, SerialNumber=1
[    0.643779] usb usb1: Product: UHCI Host Controller
[    0.643782] usb usb1: Manufacturer: Linux 3.16.0-4-amd64 uhci_hcd
[    0.643784] usb usb1: SerialNumber: 0000:00:01.2
[    0.643934] hub 1-0:1.0: USB hub found
[    0.643942] hub 1-0:1.0: 2 ports detected
[    0.646119] ACPI: PCI Interrupt Link [LNKC] enabled at IRQ 10
[    0.648472] SCSI subsystem initialized
[    0.648633] ACPI: PCI Interrupt Link [LNKA] enabled at IRQ 10
[    0.649339] ACPI: PCI Interrupt Link [LNKB] enabled at IRQ 11
[    0.649946] libata version 3.00 loaded.
[    0.651443] ata_piix 0000:00:01.1: version 2.13
[    0.652063] scsi0 : ata_piix
[    0.652184] scsi1 : ata_piix
[    0.652226] ata1: PATA max MWDMA2 cmd 0x1f0 ctl 0x3f6 bmdma 0xc120 irq 14
[    0.652228] ata2: PATA max MWDMA2 cmd 0x170 ctl 0x376 bmdma 0xc128 irq 15
[    0.664521] FDC 0 is a S82078B
[    0.681997] virtio-pci 0000:00:03.0: irq 40 for MSI/MSI-X
[    0.682018] virtio-pci 0000:00:03.0: irq 41 for MSI/MSI-X
[    0.682037] virtio-pci 0000:00:03.0: irq 42 for MSI/MSI-X
[    0.682676] virtio-pci 0000:00:04.0: irq 43 for MSI/MSI-X
[    0.682801] virtio-pci 0000:00:04.0: irq 44 for MSI/MSI-X
[    0.684580]  vda: vda1 vda2
[    0.685193] virtio-pci 0000:00:06.0: irq 45 for MSI/MSI-X
[    0.685214] virtio-pci 0000:00:06.0: irq 46 for MSI/MSI-X
[    0.686163]  vdb: unknown partition table
[    0.686666] virtio-pci 0000:00:07.0: irq 47 for MSI/MSI-X
[    0.686686] virtio-pci 0000:00:07.0: irq 48 for MSI/MSI-X
[    0.687637]  vdc: unknown partition table
[    0.869798] device-mapper: uevent: version 1.0.3
[    0.869907] device-mapper: ioctl: 4.27.0-ioctl (2013-10-30) initialised: dm-devel@redhat.com
[    0.956079] usb 1-1: new full-speed USB device number 2 using uhci_hcd
[    1.040014] raid6: sse2x1    4759 MB/s
[    1.108009] raid6: sse2x2    7184 MB/s
[    1.176014] raid6: sse2x4    8373 MB/s
[    1.176016] raid6: using algorithm sse2x4 (8373 MB/s)
[    1.176017] raid6: using ssse3x2 recovery algorithm
[    1.176514] xor: automatically using best checksumming function:
[    1.204911] usb 1-1: New USB device found, idVendor=0627, idProduct=0001
[    1.204916] usb 1-1: New USB device strings: Mfr=1, Product=3, SerialNumber=5
[    1.204918] usb 1-1: Product: QEMU USB Tablet
[    1.204921] usb 1-1: Manufacturer: QEMU 1.1.2
[    1.204923] usb 1-1: SerialNumber: 42
[    1.213391] hidraw: raw HID events driver (C) Jiri Kosina
[    1.216009]    avx       : 18136.000 MB/sec
[    1.221592] Btrfs loaded
[    1.222239] usbcore: registered new interface driver usbhid
[    1.222243] usbhid: USB HID core driver
[    1.225889] input: QEMU 1.1.2 QEMU USB Tablet as /devices/pci0000:00/0000:00:01.2/usb1/1-1/1-1:1.0/0003:0627:0001.0001/input/input2
[    1.226109] hid-generic 0003:0627:0001.0001: input,hidraw0: USB HID v0.01 Pointer [QEMU 1.1.2 QEMU USB Tablet] on usb-0000:00:01.2-1/input0
[    1.499054] EXT4-fs (dm-0): mounted filesystem with ordered data mode. Opts: (null)
[    1.540148] tsc: Refined TSC clocksource calibration: 1799.962 MHz
[    3.140909] systemd[1]: Cannot add dependency job for unit display-manager.service, ignoring: Unit display-manager.service failed to load: No such file or directory.
[    3.234219] fuse init (API version 7.23)
[    3.290834] loop: module loaded
[    3.413794] systemd-udevd[195]: starting version 215
[    3.782819] input: Power Button as /devices/LNXSYSTM:00/LNXPWRBN:00/input/input3
[    3.782827] ACPI: Power Button [PWRF]
[    3.874144] input: PC Speaker as /devices/platform/pcspkr/input/input4
[    3.892594] [drm] Initialized drm 1.1.0 20060810
[    3.935690] ppdev: user-space parallel port driver
[    3.942719] piix4_smbus 0000:00:01.3: SMBus Host Controller at 0xb100, revision 0
[    4.037948] AVX version of gcm_enc/dec engaged.
[    4.041123] alg: No test for __gcm-aes-aesni (__driver-gcm-aes-aesni)
[    4.069710] alg: No test for crc32 (crc32-pclmul)
[    4.145293] intel_rapl: no valid rapl domains found in package 0
[    4.158321] intel_rapl: no valid rapl domains found in package 0
[    4.392146] EXT4-fs (dm-0): re-mounted. Opts: errors=remount-ro
[    4.455201] EXT4-fs (vdb): recovery complete
[    4.455208] EXT4-fs (vdb): mounted filesystem with ordered data mode. Opts: acl
[    4.455282] EXT4-fs (vdc): recovery complete
[    4.455289] EXT4-fs (vdc): mounted filesystem with ordered data mode. Opts: acl
[    4.742382] input: ImExPS/2 Generic Explorer Mouse as /devices/platform/i8042/serio1/input/input5
[    5.220431] EXT4-fs (vda1): mounting ext2 file system using the ext4 subsystem
[    5.227651] EXT4-fs (vda1): mounted filesystem without journal. Opts: (null)
[    5.234126] systemd-journald[177]: Received request to flush runtime journal from PID 1
[    5.561185] audit: type=1400 audit(1429286754.484:2): apparmor="STATUS" operation="profile_load" name="/{usr/,}bin/ping" pid=492 comm="apparmor_parser"
[    5.568313] audit: type=1400 audit(1429286754.492:3): apparmor="STATUS" operation="profile_load" name="gst_plugin_scanner" pid=492 comm="apparmor_parser"
[    5.582990] audit: type=1400 audit(1429286754.504:4): apparmor="STATUS" operation="profile_load" name="/sbin/klogd" pid=492 comm="apparmor_parser"
[    5.605687] audit: type=1400 audit(1429286754.528:5): apparmor="STATUS" operation="profile_load" name="/sbin/syslog-ng" pid=492 comm="apparmor_parser"
[    5.667149] audit: type=1400 audit(1429286754.588:6): apparmor="STATUS" operation="profile_load" name="/sbin/syslogd" pid=492 comm="apparmor_parser"
[    5.693422] audit: type=1400 audit(1429286754.616:7): apparmor="STATUS" operation="profile_load" name="system_tor" pid=492 comm="apparmor_parser"
[    5.746398] audit: type=1400 audit(1429286754.668:8): apparmor="STATUS" operation="profile_load" name="/usr/lib/chromium-browser/chromium-browser" pid=492 comm="apparmor_parser"
[    5.746405] audit: type=1400 audit(1429286754.668:9): apparmor="STATUS" operation="profile_load" name="browser_java" pid=492 comm="apparmor_parser"
[    5.746409] audit: type=1400 audit(1429286754.668:10): apparmor="STATUS" operation="profile_load" name="browser_openjdk" pid=492 comm="apparmor_parser"
[    5.746413] audit: type=1400 audit(1429286754.668:11): apparmor="STATUS" operation="profile_load" name="chromium_browser_sandbox" pid=492 comm="apparmor_parser"
[    6.338748] RPC: Registered named UNIX socket transport module.
[    6.338753] RPC: Registered udp transport module.
[    6.338755] RPC: Registered tcp transport module.
[    6.338757] RPC: Registered tcp NFSv4.1 backchannel transport module.
[    6.362961] FS-Cache: Loaded
[    6.393745] FS-Cache: Netfs 'nfs' registered for caching
[    6.435512] Installing knfsd (copyright (C) 1996 okir@monad.swb.de).
[   26.092089] random: nonblocking pool is initialized

** Model information
sys_vendor: Bochs
product_name: Bochs
product_version: 
chassis_vendor: Bochs
chassis_version: 
bios_vendor: Bochs
bios_version: Bochs

** Loaded modules:
nfsd
auth_rpcgss
oid_registry
nfs_acl
nfs
lockd
fscache
sunrpc
kvm_intel
kvm
crc32_pclmul
ghash_clmulni_intel
aesni_intel
aes_x86_64
lrw
gf128mul
glue_helper
ablk_helper
cryptd
i2c_piix4
ttm
ppdev
joydev
drm_kms_helper
psmouse
drm
pcspkr
i2c_core
parport_pc
evdev
virtio_balloon
serio_raw
parport
processor
thermal_sys
button
loop
fuse
autofs4
ext4
crc16
mbcache
jbd2
hid_generic
btrfs
usbhid
hid
xor
raid6_pq
dm_mod
md_mod
ata_generic
virtio_blk
virtio_net
crct10dif_pclmul
crct10dif_common
crc32c_intel
floppy
ata_piix
libata
scsi_mod
virtio_pci
uhci_hcd
virtio_ring
virtio
ehci_hcd
usbcore
usb_common

** PCI devices:
00:00.0 Host bridge [0600]: Intel Corporation 440FX - 82441FX PMC [Natoma] [8086:1237] (rev 02)
	Subsystem: Red Hat, Inc Qemu virtual machine [1af4:1100]
	Control: I/O+ Mem+ BusMaster- SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx-
	Status: Cap- 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-

00:01.0 ISA bridge [0601]: Intel Corporation 82371SB PIIX3 ISA [Natoma/Triton II] [8086:7000]
	Subsystem: Red Hat, Inc Qemu virtual machine [1af4:1100]
	Physical Slot: 1
	Control: I/O+ Mem+ BusMaster- SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx-
	Status: Cap- 66MHz- UDF- FastB2B- ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-

00:01.1 IDE interface [0101]: Intel Corporation 82371SB PIIX3 IDE [Natoma/Triton II] [8086:7010] (prog-if 80 [Master])
	Subsystem: Red Hat, Inc Qemu virtual machine [1af4:1100]
	Physical Slot: 1
	Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx-
	Status: Cap- 66MHz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
	Latency: 0
	Region 0: [virtual] Memory at 000001f0 (32-bit, non-prefetchable) [size=8]
	Region 1: [virtual] Memory at 000003f0 (type 3, non-prefetchable)
	Region 2: [virtual] Memory at 00000170 (32-bit, non-prefetchable) [size=8]
	Region 3: [virtual] Memory at 00000370 (type 3, non-prefetchable)
	Region 4: I/O ports at c120 [size=16]
	Kernel driver in use: ata_piix

00:01.2 USB controller [0c03]: Intel Corporation 82371SB PIIX3 USB [Natoma/Triton II] [8086:7020] (rev 01) (prog-if 00 [UHCI])
	Subsystem: Red Hat, Inc QEMU Virtual Machine [1af4:1100]
	Physical Slot: 1
	Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx-
	Status: Cap- 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
	Latency: 0
	Interrupt: pin D routed to IRQ 11
	Region 4: I/O ports at c0c0 [size=32]
	Kernel driver in use: uhci_hcd

00:01.3 Bridge [0680]: Intel Corporation 82371AB/EB/MB PIIX4 ACPI [8086:7113] (rev 03)
	Subsystem: Red Hat, Inc Qemu virtual machine [1af4:1100]
	Physical Slot: 1
	Control: I/O+ Mem+ BusMaster- SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx-
	Status: Cap- 66MHz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
	Interrupt: pin A routed to IRQ 9
	Kernel driver in use: piix4_smbus

00:02.0 VGA compatible controller [0300]: Cirrus Logic GD 5446 [1013:00b8] (prog-if 00 [VGA controller])
	Subsystem: Red Hat, Inc QEMU Virtual Machine [1af4:1100]
	Physical Slot: 2
	Control: I/O+ Mem+ BusMaster- SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx-
	Status: Cap- 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
	Region 0: Memory at fc000000 (32-bit, prefetchable) [size=32M]
	Region 1: Memory at febf0000 (32-bit, non-prefetchable) [size=4K]
	Expansion ROM at febd0000 [disabled] [size=64K]

00:03.0 Ethernet controller [0200]: Red Hat, Inc Virtio network device [1af4:1000]
	Subsystem: Red Hat, Inc Device [1af4:0001]
	Physical Slot: 3
	Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx+
	Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
	Latency: 0
	Interrupt: pin A routed to IRQ 10
	Region 0: I/O ports at c0e0 [size=32]
	Region 1: Memory at febf1000 (32-bit, non-prefetchable) [size=4K]
	Expansion ROM at febe0000 [disabled] [size=64K]
	Capabilities: <access denied>
	Kernel driver in use: virtio-pci

00:04.0 SCSI storage controller [0100]: Red Hat, Inc Virtio block device [1af4:1001]
	Subsystem: Red Hat, Inc Device [1af4:0002]
	Physical Slot: 4
	Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx+
	Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
	Latency: 0
	Interrupt: pin A routed to IRQ 11
	Region 0: I/O ports at c000 [size=64]
	Region 1: Memory at febf2000 (32-bit, non-prefetchable) [size=4K]
	Capabilities: <access denied>
	Kernel driver in use: virtio-pci

00:05.0 Unclassified device [00ff]: Red Hat, Inc Virtio memory balloon [1af4:1002]
	Subsystem: Red Hat, Inc Device [1af4:0005]
	Physical Slot: 5
	Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx-
	Status: Cap- 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
	Latency: 0
	Interrupt: pin A routed to IRQ 10
	Region 0: I/O ports at c100 [size=32]
	Kernel driver in use: virtio-pci

00:06.0 SCSI storage controller [0100]: Red Hat, Inc Virtio block device [1af4:1001]
	Subsystem: Red Hat, Inc Device [1af4:0002]
	Physical Slot: 6
	Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx+
	Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
	Latency: 0
	Interrupt: pin A routed to IRQ 11
	Region 0: I/O ports at c040 [size=64]
	Region 1: Memory at febf3000 (32-bit, non-prefetchable) [size=4K]
	Capabilities: <access denied>
	Kernel driver in use: virtio-pci

00:07.0 SCSI storage controller [0100]: Red Hat, Inc Virtio block device [1af4:1001]
	Subsystem: Red Hat, Inc Device [1af4:0002]
	Physical Slot: 7
	Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx+
	Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
	Latency: 0
	Interrupt: pin A routed to IRQ 10
	Region 0: I/O ports at c080 [size=64]
	Region 1: Memory at febf4000 (32-bit, non-prefetchable) [size=4K]
	Capabilities: <access denied>
	Kernel driver in use: virtio-pci


** USB devices:
Bus 001 Device 002: ID 0627:0001 Adomax Technology Co., Ltd 
Bus 001 Device 001: ID 1d6b:0001 Linux Foundation 1.1 root hub


-- System Information:
Debian Release: 8.0
  APT prefers testing
  APT policy: (990, 'testing'), (1, 'unstable')
Architecture: amd64 (x86_64)

Kernel: Linux 3.16.0-4-amd64 (SMP w/3 CPU cores)
Locale: LANG=en_US.UTF-8, LC_CTYPE=en_US.UTF-8 (charmap=UTF-8)
Shell: /bin/sh linked to /bin/dash
Init: systemd (via /run/systemd/system)

Versions of packages linux-image-3.16.0-4-amd64 depends on:
ii  debconf [debconf-2.0]                   1.5.56
ii  initramfs-tools [linux-initramfs-tool]  0.120
ii  kmod                                    18-3
ii  linux-base                              3.5
ii  module-init-tools                       18-3

Versions of packages linux-image-3.16.0-4-amd64 recommends:
ii  firmware-linux-free  3.3
ii  irqbalance           1.0.6-3

Versions of packages linux-image-3.16.0-4-amd64 suggests:
pn  debian-kernel-handbook  <none>
ii  extlinux                3:6.03+dfsg-5
ii  grub-pc                 2.02~beta2-22
pn  linux-doc-3.16          <none>

Versions of packages linux-image-3.16.0-4-amd64 is related to:
pn  firmware-atheros        <none>
pn  firmware-bnx2           <none>
pn  firmware-bnx2x          <none>
pn  firmware-brcm80211      <none>
pn  firmware-intelwimax     <none>
pn  firmware-ipw2x00        <none>
pn  firmware-ivtv           <none>
pn  firmware-iwlwifi        <none>
pn  firmware-libertas       <none>
pn  firmware-linux          <none>
pn  firmware-linux-nonfree  <none>
pn  firmware-myricom        <none>
pn  firmware-netxen         <none>
pn  firmware-qlogic         <none>
pn  firmware-ralink         <none>
pn  firmware-realtek        <none>
pn  xen-hypervisor          <none>

-- debconf information:
  linux-image-3.16.0-4-amd64/prerm/removing-running-kernel-3.16.0-4-amd64: true
  linux-image-3.16.0-4-amd64/postinst/depmod-error-initrd-3.16.0-4-amd64: false
  linux-image-3.16.0-4-amd64/postinst/mips-initrd-3.16.0-4-amd64:

--- End Message ---
--- Begin Message ---
Hi

This bug was filed for a very old kernel or the bug is old itself
without resolution.

If you can reproduce it with

- the current version in unstable/testing
- the latest kernel from backports

please reopen the bug, see https://www.debian.org/Bugs/server-control
for details.

Regards,
Salvatore

--- End Message ---

Reply to: