[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#812416: marked as done (drbd module general protection fault while shutting down system with drbd device)



Your message dated Sun, 09 May 2021 07:32:04 -0700 (PDT)
with message-id <6097f264.1c69fb81.6a5d.dee3@mx.google.com>
and subject line Closing this bug (BTS maintenance for src:linux bugs)
has caused the Debian Bug report #812416,
regarding drbd module general protection fault while shutting down system with drbd device
to be marked as done.

This means that you claim that the problem has been dealt with.
If this is not the case it is now your responsibility to reopen the
Bug report if necessary, and/or fix the problem forthwith.

(NB: If you are a system administrator and have no idea what this
message is talking about, this may indicate a serious mail system
misconfiguration somewhere. Please contact owner@bugs.debian.org
immediately.)


-- 
812416: https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=812416
Debian Bug Tracking System
Contact owner@bugs.debian.org with problems
--- Begin Message ---
Package: linux-image-4.3.0-0.bpo.1-amd64
Version: 4.3.3-5~bpo8+1

Kernel:
# uname -a
Linux bac1f6065n 4.3.0-0.bpo.1-amd64 #1 SMP Debian 4.3.3-5~bpo8+1 (2016-01-07) x86_64 GNU/Linux

Libc6
# dpkg -s libc6 | grep Version
Version: 2.19-18+deb8u1

This issue occasionally happens while shutting down a system with a drbd block device.

I have traced it back to this line in the drbd module:

0x297e8 is in nla_put_status_info (include/linux/list.h:189).
184	* list_empty - tests whether a list is empty
185	* @head: the list to test.
186	*/
187	static inline int list_empty(const struct list_head *head)
188       { 
189	return head->next == head; 
190	}

+++++++++++++++++++++++++++++++++
Jan 21 14:22:43 bac1f6065n kernel: [ 218.869983] device-mapper: uevent: version 1.0.3
Jan 21 14:22:43 bac1f6065n kernel: [ 218.870394] device-mapper: ioctl: 4.33.0-ioctl (2015-8-18) initialised: dm-devel@redhat.com
Jan 21 14:23:35 bac1f6065n kernel: [ 271.355054] drbd r0: peer( Primary -> Unknown ) conn( Connected -> Disconnecting ) pdsk( UpToDate -> DUnknown )
Jan 21 14:23:35 bac1f6065n kernel: [ 271.355078] drbd r0: asender terminated
Jan 21 14:23:35 bac1f6065n kernel: [ 271.355082] drbd r0: Terminating drbd_a_r0
Jan 21 14:23:35 bac1f6065n kernel: [ 271.363802] drbd r0: Connection closed
Jan 21 14:23:35 bac1f6065n kernel: [ 271.363826] drbd r0: conn( Disconnecting -> StandAlone )
Jan 21 14:23:35 bac1f6065n kernel: [ 271.363835] drbd r0: receiver terminated
Jan 21 14:23:35 bac1f6065n kernel: [ 271.363838] drbd r0: Terminating drbd_r_r0
Jan 21 14:23:35 bac1f6065n kernel: [ 271.363897] block drbd1: disk( UpToDate -> Failed )
Jan 21 14:23:35 bac1f6065n kernel: [ 271.363951] block drbd1: bitmap WRITE of 0 pages took 0 jiffies
Jan 21 14:23:35 bac1f6065n kernel: [ 271.363955] block drbd1: 0 KB (0 bits) marked out-of-sync by on disk bit-map.
Jan 21 14:23:35 bac1f6065n kernel: [ 271.363962] block drbd1: disk( Failed -> Diskless )
Jan 21 14:23:35 bac1f6065n kernel: [ 271.365999] general protection fault: 0000 1 SMP
Jan 21 14:23:35 bac1f6065n kernel: [ 271.366056] Modules linked in: dm_mod cpuid vmw_vsock_vmci_transport iosf_mbi coretemp crct10dif_pclmul crc32_pclmul sha256_ssse3 sha256_generic hmac drbg ansi_cprng aesni_intel ppdev aes_x86_64 lrw gf128mul glue_helper ablk_helper vmw_balloon evdev psmouse serio_raw cryptd pcspkr battery vmwgfx ttm drm_kms_helper i2c_piix4 drm 8250_fintek shpchp nf_conntrack_tftp parport_pc parport acpi_cpufreq processor ac button nf_conntrack bonding drbd lru_cache libcrc32c autofs4 ext4 crc16 mbcache jbd2 virtio_blk virtio_net virtio_pci virtio_ring virtio kvm vsock vmw_pvscsi sg sd_mod sr_mod cdrom ata_generic crc32c_intel vmxnet3 mptspi scsi_transport_spi mptscsih mptbase ata_piix libata scsi_mod vmw_vmci
Jan 21 14:23:35 bac1f6065n kernel: [ 271.366746] CPU: 0 PID: 1017 Comm: drbd_w_r0 Not tainted 4.3.0-0.bpo.1-amd64 #1 Debian 4.3.3-5~bpo8+1
Jan 21 14:23:35 bac1f6065n kernel: [ 271.366815] Hardware name: VMware, Inc. VMware Virtual Platform/440BX Desktop Reference Platform, BIOS 6.00 09/30/2014
Jan 21 14:23:35 bac1f6065n kernel: [ 271.366894] task: ffff88003c6f8200 ti: ffff880035150000 task.ti: ffff880035150000
Jan 21 14:23:35 bac1f6065n kernel: [ 271.366949] RIP: 0010:[<ffffffffa02ae7b8>] [<ffffffffa02ae7b8>] nla_put_status_info+0x58/0x910 [drbd]
Jan 21 14:23:35 bac1f6065n kernel: [ 271.367085] RSP: 0018:ffff880035153c88 EFLAGS: 00010202
Jan 21 14:23:35 bac1f6065n kernel: [ 271.367126] RAX: 000000000000000e RBX: ffff8800382f6100 RCX: 0000000000000000
Jan 21 14:23:35 bac1f6065n kernel: [ 271.367185] RDX: ffff880035153de8 RSI: ffff88003bb8a000 RDI: ffff8800382f6100
Jan 21 14:23:35 bac1f6065n kernel: [ 271.367237] RBP: ffff88003bb8a000 R08: 000000000000000c R09: 0000000000000004
Jan 21 14:23:35 bac1f6065n kernel: [ 271.367412] R10: ffff88003c35601c R11: 0000000000001ec0 R12: fdfdfdfdfdfdfdfd
Jan 21 14:23:35 bac1f6065n kernel: [ 271.367469] R13: ffff880035153de8 R14: 0000000000000001 R15: 000000000000c002
Jan 21 14:23:35 bac1f6065n kernel: [ 271.367523] FS: 0000000000000000(0000) GS:ffff88003fc00000(0000) knlGS:0000000000000000
Jan 21 14:23:35 bac1f6065n kernel: [ 271.367582] CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b
Jan 21 14:23:35 bac1f6065n kernel: [ 271.367630] CR2: 00007fc28daee610 CR3: 0000000035ca5000 CR4: 00000000000006f0
Jan 21 14:23:35 bac1f6065n kernel: [ 271.367778] Stack:
Jan 21 14:23:35 bac1f6065n kernel: [ 271.367802] 000000013c480000 ffffffff8147f127 0000000000000246 ffff8800382f6100
Jan 21 14:23:35 bac1f6065n kernel: [ 271.367868] ffff880035153d17 0000000000000010 0000000000001040 00000000ffffffff
Jan 21 14:23:35 bac1f6065n kernel: [ 271.367933] 0000000000000000 ffffffff8147e31e ffff8800382f6100 0000000000000f00
Jan 21 14:23:35 bac1f6065n kernel: [ 271.367998] Call Trace:
Jan 21 14:23:35 bac1f6065n kernel: [ 271.368070] [<ffffffff8147f127>] ? __alloc_skb+0x57/0x200
Jan 21 14:23:35 bac1f6065n kernel: [ 271.368124] [<ffffffff8147e31e>] ? __kmalloc_reserve.isra.31+0x2e/0x80
Jan 21 14:23:35 bac1f6065n kernel: [ 271.368190] [<ffffffff814bfe58>] ? __nlmsg_put+0x68/0x80
Jan 21 14:23:35 bac1f6065n kernel: [ 271.368244] [<ffffffffa02b5d0a>] ? drbd_bcast_event+0x8a/0x160 [drbd]
Jan 21 14:23:35 bac1f6065n kernel: [ 271.368301] [<ffffffffa02ba896>] ? w_after_state_ch+0x166/0x22b0 [drbd]
Jan 21 14:23:35 bac1f6065n kernel: [ 271.368359] [<ffffffffa028f111>] ? drbd_worker+0x101/0x390 [drbd]
Jan 21 14:23:35 bac1f6065n kernel: [ 271.368413] [<ffffffffa02a4290>] ? drbd_destroy_connection+0xf0/0xf0 [drbd]
Jan 21 14:23:35 bac1f6065n kernel: [ 271.370060] [<ffffffffa02a42d6>] ? drbd_thread_setup+0x46/0x120 [drbd]
Jan 21 14:23:35 bac1f6065n kernel: [ 271.371590] [<ffffffffa02a4290>] ? drbd_destroy_connection+0xf0/0xf0 [drbd]
Jan 21 14:23:35 bac1f6065n kernel: [ 271.373244] [<ffffffff8109222f>] ? kthread+0xdf/0x100
Jan 21 14:23:35 bac1f6065n kernel: [ 271.374983] [<ffffffff81092150>] ? kthread_park+0x50/0x50
Jan 21 14:23:35 bac1f6065n kernel: [ 271.376499] [<ffffffff8158971f>] ? ret_from_fork+0x3f/0x70
Jan 21 14:23:35 bac1f6065n kernel: [ 271.378054] [<ffffffff81092150>] ? kthread_park+0x50/0x50
Jan 21 14:23:35 bac1f6065n kernel: [ 271.379394] Code: 84 24 90 00 00 00 31 c0 48 85 d2 c7 44 24 04 01 00 00 00 0f 84 0b 07 00 00 0f b6 85 1d 02 00 00 a8 1e 0f 85 82 02 00 00 45 31 f6 <49> 8b 44 24 58 49 8d 54 24 58 48 39 c2 0f 84 7b 06 00 00 48 3b
Jan 21 14:23:35 bac1f6065n kernel: [ 271.383803] RIP [<ffffffffa02ae7b8>] nla_put_status_info+0x58/0x910 [drbd]
Jan 21 14:23:35 bac1f6065n kernel: [ 271.385198] RSP <ffff880035153c88>
Jan 21 14:23:35 bac1f6065n kernel: [ 271.386565] --[ end trace b590fa9c15df0091 ]--
+++++++++++++++++++++++++++++++++

Fernando

--- End Message ---
--- Begin Message ---
Hi

This bug was filed for a very old kernel or the bug is old itself
without resolution.

If you can reproduce it with

- the current version in unstable/testing
- the latest kernel from backports

please reopen the bug, see https://www.debian.org/Bugs/server-control
for details.

Regards,
Salvatore

--- End Message ---

Reply to: