[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#812416: drbd module general protection fault while shutting down system with drbd device



Package: linux-image-4.3.0-0.bpo.1-amd64
Version: 4.3.3-5~bpo8+1

Kernel:
# uname -a
Linux bac1f6065n 4.3.0-0.bpo.1-amd64 #1 SMP Debian 4.3.3-5~bpo8+1 (2016-01-07) x86_64 GNU/Linux

Libc6
# dpkg -s libc6 | grep Version
Version: 2.19-18+deb8u1

This issue occasionally happens while shutting down a system with a drbd block device.

I have traced it back to this line in the drbd module:

0x297e8 is in nla_put_status_info (include/linux/list.h:189).
184	* list_empty - tests whether a list is empty
185	* @head: the list to test.
186	*/
187	static inline int list_empty(const struct list_head *head)
188       { 
189	return head->next == head; 
190	}

+++++++++++++++++++++++++++++++++
Jan 21 14:22:43 bac1f6065n kernel: [ 218.869983] device-mapper: uevent: version 1.0.3
Jan 21 14:22:43 bac1f6065n kernel: [ 218.870394] device-mapper: ioctl: 4.33.0-ioctl (2015-8-18) initialised: dm-devel@redhat.com
Jan 21 14:23:35 bac1f6065n kernel: [ 271.355054] drbd r0: peer( Primary -> Unknown ) conn( Connected -> Disconnecting ) pdsk( UpToDate -> DUnknown )
Jan 21 14:23:35 bac1f6065n kernel: [ 271.355078] drbd r0: asender terminated
Jan 21 14:23:35 bac1f6065n kernel: [ 271.355082] drbd r0: Terminating drbd_a_r0
Jan 21 14:23:35 bac1f6065n kernel: [ 271.363802] drbd r0: Connection closed
Jan 21 14:23:35 bac1f6065n kernel: [ 271.363826] drbd r0: conn( Disconnecting -> StandAlone )
Jan 21 14:23:35 bac1f6065n kernel: [ 271.363835] drbd r0: receiver terminated
Jan 21 14:23:35 bac1f6065n kernel: [ 271.363838] drbd r0: Terminating drbd_r_r0
Jan 21 14:23:35 bac1f6065n kernel: [ 271.363897] block drbd1: disk( UpToDate -> Failed )
Jan 21 14:23:35 bac1f6065n kernel: [ 271.363951] block drbd1: bitmap WRITE of 0 pages took 0 jiffies
Jan 21 14:23:35 bac1f6065n kernel: [ 271.363955] block drbd1: 0 KB (0 bits) marked out-of-sync by on disk bit-map.
Jan 21 14:23:35 bac1f6065n kernel: [ 271.363962] block drbd1: disk( Failed -> Diskless )
Jan 21 14:23:35 bac1f6065n kernel: [ 271.365999] general protection fault: 0000 1 SMP
Jan 21 14:23:35 bac1f6065n kernel: [ 271.366056] Modules linked in: dm_mod cpuid vmw_vsock_vmci_transport iosf_mbi coretemp crct10dif_pclmul crc32_pclmul sha256_ssse3 sha256_generic hmac drbg ansi_cprng aesni_intel ppdev aes_x86_64 lrw gf128mul glue_helper ablk_helper vmw_balloon evdev psmouse serio_raw cryptd pcspkr battery vmwgfx ttm drm_kms_helper i2c_piix4 drm 8250_fintek shpchp nf_conntrack_tftp parport_pc parport acpi_cpufreq processor ac button nf_conntrack bonding drbd lru_cache libcrc32c autofs4 ext4 crc16 mbcache jbd2 virtio_blk virtio_net virtio_pci virtio_ring virtio kvm vsock vmw_pvscsi sg sd_mod sr_mod cdrom ata_generic crc32c_intel vmxnet3 mptspi scsi_transport_spi mptscsih mptbase ata_piix libata scsi_mod vmw_vmci
Jan 21 14:23:35 bac1f6065n kernel: [ 271.366746] CPU: 0 PID: 1017 Comm: drbd_w_r0 Not tainted 4.3.0-0.bpo.1-amd64 #1 Debian 4.3.3-5~bpo8+1
Jan 21 14:23:35 bac1f6065n kernel: [ 271.366815] Hardware name: VMware, Inc. VMware Virtual Platform/440BX Desktop Reference Platform, BIOS 6.00 09/30/2014
Jan 21 14:23:35 bac1f6065n kernel: [ 271.366894] task: ffff88003c6f8200 ti: ffff880035150000 task.ti: ffff880035150000
Jan 21 14:23:35 bac1f6065n kernel: [ 271.366949] RIP: 0010:[<ffffffffa02ae7b8>] [<ffffffffa02ae7b8>] nla_put_status_info+0x58/0x910 [drbd]
Jan 21 14:23:35 bac1f6065n kernel: [ 271.367085] RSP: 0018:ffff880035153c88 EFLAGS: 00010202
Jan 21 14:23:35 bac1f6065n kernel: [ 271.367126] RAX: 000000000000000e RBX: ffff8800382f6100 RCX: 0000000000000000
Jan 21 14:23:35 bac1f6065n kernel: [ 271.367185] RDX: ffff880035153de8 RSI: ffff88003bb8a000 RDI: ffff8800382f6100
Jan 21 14:23:35 bac1f6065n kernel: [ 271.367237] RBP: ffff88003bb8a000 R08: 000000000000000c R09: 0000000000000004
Jan 21 14:23:35 bac1f6065n kernel: [ 271.367412] R10: ffff88003c35601c R11: 0000000000001ec0 R12: fdfdfdfdfdfdfdfd
Jan 21 14:23:35 bac1f6065n kernel: [ 271.367469] R13: ffff880035153de8 R14: 0000000000000001 R15: 000000000000c002
Jan 21 14:23:35 bac1f6065n kernel: [ 271.367523] FS: 0000000000000000(0000) GS:ffff88003fc00000(0000) knlGS:0000000000000000
Jan 21 14:23:35 bac1f6065n kernel: [ 271.367582] CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b
Jan 21 14:23:35 bac1f6065n kernel: [ 271.367630] CR2: 00007fc28daee610 CR3: 0000000035ca5000 CR4: 00000000000006f0
Jan 21 14:23:35 bac1f6065n kernel: [ 271.367778] Stack:
Jan 21 14:23:35 bac1f6065n kernel: [ 271.367802] 000000013c480000 ffffffff8147f127 0000000000000246 ffff8800382f6100
Jan 21 14:23:35 bac1f6065n kernel: [ 271.367868] ffff880035153d17 0000000000000010 0000000000001040 00000000ffffffff
Jan 21 14:23:35 bac1f6065n kernel: [ 271.367933] 0000000000000000 ffffffff8147e31e ffff8800382f6100 0000000000000f00
Jan 21 14:23:35 bac1f6065n kernel: [ 271.367998] Call Trace:
Jan 21 14:23:35 bac1f6065n kernel: [ 271.368070] [<ffffffff8147f127>] ? __alloc_skb+0x57/0x200
Jan 21 14:23:35 bac1f6065n kernel: [ 271.368124] [<ffffffff8147e31e>] ? __kmalloc_reserve.isra.31+0x2e/0x80
Jan 21 14:23:35 bac1f6065n kernel: [ 271.368190] [<ffffffff814bfe58>] ? __nlmsg_put+0x68/0x80
Jan 21 14:23:35 bac1f6065n kernel: [ 271.368244] [<ffffffffa02b5d0a>] ? drbd_bcast_event+0x8a/0x160 [drbd]
Jan 21 14:23:35 bac1f6065n kernel: [ 271.368301] [<ffffffffa02ba896>] ? w_after_state_ch+0x166/0x22b0 [drbd]
Jan 21 14:23:35 bac1f6065n kernel: [ 271.368359] [<ffffffffa028f111>] ? drbd_worker+0x101/0x390 [drbd]
Jan 21 14:23:35 bac1f6065n kernel: [ 271.368413] [<ffffffffa02a4290>] ? drbd_destroy_connection+0xf0/0xf0 [drbd]
Jan 21 14:23:35 bac1f6065n kernel: [ 271.370060] [<ffffffffa02a42d6>] ? drbd_thread_setup+0x46/0x120 [drbd]
Jan 21 14:23:35 bac1f6065n kernel: [ 271.371590] [<ffffffffa02a4290>] ? drbd_destroy_connection+0xf0/0xf0 [drbd]
Jan 21 14:23:35 bac1f6065n kernel: [ 271.373244] [<ffffffff8109222f>] ? kthread+0xdf/0x100
Jan 21 14:23:35 bac1f6065n kernel: [ 271.374983] [<ffffffff81092150>] ? kthread_park+0x50/0x50
Jan 21 14:23:35 bac1f6065n kernel: [ 271.376499] [<ffffffff8158971f>] ? ret_from_fork+0x3f/0x70
Jan 21 14:23:35 bac1f6065n kernel: [ 271.378054] [<ffffffff81092150>] ? kthread_park+0x50/0x50
Jan 21 14:23:35 bac1f6065n kernel: [ 271.379394] Code: 84 24 90 00 00 00 31 c0 48 85 d2 c7 44 24 04 01 00 00 00 0f 84 0b 07 00 00 0f b6 85 1d 02 00 00 a8 1e 0f 85 82 02 00 00 45 31 f6 <49> 8b 44 24 58 49 8d 54 24 58 48 39 c2 0f 84 7b 06 00 00 48 3b
Jan 21 14:23:35 bac1f6065n kernel: [ 271.383803] RIP [<ffffffffa02ae7b8>] nla_put_status_info+0x58/0x910 [drbd]
Jan 21 14:23:35 bac1f6065n kernel: [ 271.385198] RSP <ffff880035153c88>
Jan 21 14:23:35 bac1f6065n kernel: [ 271.386565] --[ end trace b590fa9c15df0091 ]--
+++++++++++++++++++++++++++++++++

Fernando


Reply to: