Bug#925918: linux-image-amd64: linux-image-3.16.0-8-amd64 - unpredictable reboots / kernel panics?
Package: linux-image-amd64
Version: linux-image-3.16.0-8-amd64
Severity: important
Dear Maintainer,
after upgrading to the latest linux-image-adm64 on jessie we're experiencing several issues which led us
to downgrade to linux-image-3.16.0-7-amd64 again and deinstall linux-image-3.16.0-8-amd64. It's happened
until now on a COROSYNC/DRBD Cluster where standby node has been upgraded and after the upgrade the system
froze, see [1].
On another MySQL-Slave where we applied this kernel - the system - after running some time rebooted due to
a kernel panic. I wasn't fast enough to catch the kernel panic on the screen as VMware HA-features instantly
rebooted the system. Both systems run in a VMware HA-Cluster on different ESXi runhosts.
So for me, linux-image-3.16.0-8-amd64 smells fishy and i was wondering if there are other users which have
problems?
Cheers,
Werner
[1]
Mar 28 13:35:38 nfs02 kernel: [ 191.925130] block drbd0: helper command: /sbin/drbdadm after-resync-target minor-0
Mar 28 13:35:38 nfs02 kernel: [ 191.927389] block drbd0: helper command: /sbin/drbdadm after-resync-target minor-0 exit code 0 (0x0)
Mar 28 13:35:40 nfs02 kernel: [ 194.306334] drbd r0: Wrong magic value 0x00000000 in protocol version 101
Mar 28 13:35:40 nfs02 kernel: [ 194.306407] drbd r0: peer( Primary -> Unknown ) conn( Connected -> ProtocolError ) pdsk( UpToDate -> DUnknown )
Mar 28 13:35:40 nfs02 kernel: [ 194.306432] drbd r0: asender terminated
Mar 28 13:35:40 nfs02 kernel: [ 194.306436] drbd r0: Terminating drbd_a_r0
Mar 28 13:35:40 nfs02 kernel: [ 194.306828] drbd r0: Connection closed
Mar 28 13:35:40 nfs02 kernel: [ 194.306845] drbd r0: conn( ProtocolError -> Unconnected )
Mar 28 13:35:40 nfs02 kernel: [ 194.306847] drbd r0: receiver terminated
Mar 28 13:35:40 nfs02 kernel: [ 194.306848] drbd r0: Restarting receiver thread
Mar 28 13:35:40 nfs02 kernel: [ 194.306850] drbd r0: receiver (re)started
Mar 28 13:35:40 nfs02 kernel: [ 194.306860] drbd r0: conn( Unconnected -> WFConnection )
Mar 28 13:35:41 nfs02 kernel: [ 194.805238] drbd r0: Handshake successful: Agreed network protocol version 101
Mar 28 13:35:41 nfs02 kernel: [ 194.805243] drbd r0: Agreed to support TRIM on protocol level
Mar 28 13:35:41 nfs02 kernel: [ 194.805274] drbd r0: conn( WFConnection -> WFReportParams )
Mar 28 13:35:41 nfs02 kernel: [ 194.805277] drbd r0: Starting asender thread (from drbd_r_r0 [1367])
Mar 28 13:35:41 nfs02 kernel: [ 194.869215] block drbd0: drbd_sync_handshake:
Mar 28 13:35:41 nfs02 kernel: [ 194.869221] block drbd0: self E2641EEB9E133204:0000000000000000:0DD839919AA45372:0DD739919AA45373 bits:0 flags:0
Mar 28 13:35:41 nfs02 kernel: [ 194.869225] block drbd0: peer 14F96DC2D3D2E20D:E2641EEB9E133205:0DD839919AA45373:0DD739919AA45373 bits:23 flags:0
Mar 28 13:35:41 nfs02 kernel: [ 194.869228] block drbd0: uuid_compare()=-1 by rule 50
Mar 28 13:35:41 nfs02 kernel: [ 194.869236] block drbd0: peer( Unknown -> Primary ) conn( WFReportParams -> WFBitMapT ) disk( UpToDate -> Outdated ) pdsk( DUnknown -> UpToDate )
Mar 28 13:35:41 nfs02 kernel: [ 194.876039] block drbd0: receive bitmap stats [Bytes(packets)]: plain 0(0), RLE 39(1), total 39; compression: 100.0%
Mar 28 13:35:41 nfs02 kernel: [ 194.882431] block drbd0: send bitmap stats [Bytes(packets)]: plain 0(0), RLE 39(1), total 39; compression: 100.0%
Mar 28 13:35:41 nfs02 kernel: [ 194.882445] block drbd0: conn( WFBitMapT -> WFSyncUUID )
Mar 28 13:35:41 nfs02 kernel: [ 194.887016] block drbd0: updated sync uuid E2651EEB9E133204:0000000000000000:0DD839919AA45372:0DD739919AA45373
Mar 28 13:35:41 nfs02 kernel: [ 194.887489] block drbd0: helper command: /sbin/drbdadm before-resync-target minor-0
Mar 28 13:35:41 nfs02 kernel: [ 194.889641] block drbd0: helper command: /sbin/drbdadm before-resync-target minor-0 exit code 0 (0x0)
Mar 28 13:35:41 nfs02 kernel: [ 194.889656] block drbd0: conn( WFSyncUUID -> SyncTarget ) disk( Outdated -> Inconsistent )
Mar 28 13:35:41 nfs02 kernel: [ 194.889666] block drbd0: Began resync as SyncTarget (will sync 92 KB [23 bits set]).
Mar 28 13:35:41 nfs02 kernel: [ 194.900324] drbd r0: Wrong magic value 0x84a1785a in protocol version 101
Mar 28 13:35:41 nfs02 kernel: [ 194.900381] drbd r0: peer( Primary -> Unknown ) conn( SyncTarget -> ProtocolError ) pdsk( UpToDate -> DUnknown )
Mar 28 13:35:41 nfs02 kernel: [ 194.900392] drbd r0: asender terminated
Mar 28 13:35:41 nfs02 kernel: [ 194.900394] drbd r0: Terminating drbd_a_r0
Mar 28 13:35:41 nfs02 kernel: [ 194.911438] drbd r0: Connection closed
Mar 28 13:35:41 nfs02 kernel: [ 194.911456] drbd r0: conn( ProtocolError -> Unconnected )
Mar 28 13:35:41 nfs02 kernel: [ 194.911458] drbd r0: receiver terminated
Mar 28 13:35:41 nfs02 kernel: [ 194.911460] drbd r0: Restarting receiver thread
Mar 28 13:35:41 nfs02 kernel: [ 194.911461] drbd r0: receiver (re)started
Mar 28 13:35:41 nfs02 kernel: [ 194.911471] drbd r0: conn( Unconnected -> WFConnection )
Mar 28 13:35:41 nfs02 kernel: [ 195.409791] drbd r0: Handshake successful: Agreed network protocol version 101
Mar 28 13:35:41 nfs02 kernel: [ 195.409796] drbd r0: Agreed to support TRIM on protocol level
Mar 28 13:35:41 nfs02 kernel: [ 195.409849] drbd r0: conn( WFConnection -> WFReportParams )
Mar 28 13:35:41 nfs02 kernel: [ 195.409852] drbd r0: Starting asender thread (from drbd_r_r0 [1367])
Mar 28 13:35:41 nfs02 kernel: [ 195.429466] block drbd0: drbd_sync_handshake:
Mar 28 13:35:41 nfs02 kernel: [ 195.429473] block drbd0: self E2651EEB9E133204:0000000000000000:0DD839919AA45372:0DD739919AA45373 bits:21 flags:0
Mar 28 13:35:41 nfs02 kernel: [ 195.429477] block drbd0: peer 14F96DC2D3D2E20D:E2651EEB9E133205:E2641EEB9E133205:0DD839919AA45373 bits:23 flags:0
Mar 28 13:35:41 nfs02 kernel: [ 195.429480] block drbd0: uuid_compare()=-1 by rule 50
Mar 28 13:35:41 nfs02 kernel: [ 195.429482] block drbd0: Becoming sync target due to disk states.
Mar 28 13:35:41 nfs02 kernel: [ 195.429489] block drbd0: peer( Unknown -> Primary ) conn( WFReportParams -> WFBitMapT ) pdsk( DUnknown -> UpToDate )
Mar 28 13:35:41 nfs02 kernel: [ 195.452578] block drbd0: receive bitmap stats [Bytes(packets)]: plain 0(0), RLE 39(1), total 39; compression: 100.0%
Mar 28 13:35:41 nfs02 kernel: [ 195.459661] block drbd0: send bitmap stats [Bytes(packets)]: plain 0(0), RLE 39(1), total 39; compression: 100.0%
Mar 28 13:35:41 nfs02 kernel: [ 195.459677] block drbd0: conn( WFBitMapT -> WFSyncUUID )
Mar 28 13:35:41 nfs02 kernel: [ 195.465210] block drbd0: updated sync uuid E2661EEB9E133204:0000000000000000:0DD839919AA45372:0DD739919AA45373
Mar 28 13:35:41 nfs02 kernel: [ 195.465663] block drbd0: helper command: /sbin/drbdadm before-resync-target minor-0
Mar 28 13:35:41 nfs02 kernel: [ 195.467430] block drbd0: helper command: /sbin/drbdadm before-resync-target minor-0 exit code 0 (0x0)
Mar 28 13:35:41 nfs02 kernel: [ 195.467451] block drbd0: conn( WFSyncUUID -> SyncTarget )
Mar 28 13:35:41 nfs02 kernel: [ 195.467464] block drbd0: Began resync as SyncTarget (will sync 92 KB [23 bits set]).
Mar 28 13:35:41 nfs02 kernel: [ 195.486698] block drbd0: Resync done (total 1 sec; paused 0 sec; 92 K/sec)
Mar 28 13:35:41 nfs02 kernel: [ 195.486705] block drbd0: updated UUIDs 14F96DC2D3D2E20C:0000000000000000:E2661EEB9E133204:E2651EEB9E133205
Mar 28 13:35:41 nfs02 kernel: [ 195.486712] block drbd0: conn( SyncTarget -> Connected ) disk( Inconsistent -> UpToDate )
Mar 28 13:35:41 nfs02 kernel: [ 195.486980] block drbd0: helper command: /sbin/drbdadm after-resync-target minor-0
Mar 28 13:35:41 nfs02 kernel: [ 195.488776] block drbd0: helper command: /sbin/drbdadm after-resync-target minor-0 exit code 0 (0x0)
Mar 28 13:35:44 nfs02 kernel: [ 197.913875] BUG: unable to handle kernel NULL pointer dereference at (null)
Mar 28 13:35:44 nfs02 kernel: [ 197.913929] IP: [<ffffffff81157935>] put_page+0x5/0x30
Mar 28 13:35:44 nfs02 kernel: [ 197.913961] PGD 0
Mar 28 13:35:44 nfs02 kernel: [ 197.913975] Oops: 0000 [#1] SMP
Mar 28 13:35:44 nfs02 kernel: [ 197.913997] Modules linked in: binfmt_misc vmw_vsock_vmci_transport vsock crc32_pclmul aesni_intel vmw_balloon ppdev evdev aes_x86_64 lrw serio_raw pcspkr gf128mul glue_helper ablk_helper cryptd vmwgfx processor thermal_sys parport_pc parport shpchp ttm drm_kms_helper battery vmw_vmci drm ac button drbd lru_cache libcrc32c crc32c_generic autofs4 ext4 crc16 mbcache jbd2 dm_mod sr_mod cdrom sg ata_generic sd_mod crc_t10dif crct10dif_generic crct10dif_pclmul crct10dif_common crc32c_intel psmouse floppy ata_piix libata vmxnet3 vmw_pvscsi i2c_piix4 i2c_core scsi_mod
Mar 28 13:35:44 nfs02 kernel: [ 197.914358] CPU: 0 PID: 1367 Comm: drbd_r_r0 Not tainted 3.16.0-8-amd64 #1 Debian 3.16.64-1
We're wondering if any other users have issues with this kernel-release?
*** Reporter, please consider answering these questions, where appropriate ***
* What led up to the situation?
* What exactly did you do (or not do) that was effective (or
ineffective)?
* What was the outcome of this action?
* What outcome did you expect instead?
*** End of the template - remove these template lines ***
-- System Information:
Debian Release: 8.11
APT prefers oldstable
APT policy: (500, 'oldstable')
Architecture: amd64 (x86_64)
Kernel: Linux 3.16.0-7-amd64 (SMP w/2 CPU cores)
Locale: LANG=en_US.UTF-8, LC_CTYPE=en_US.UTF-8 (charmap=UTF-8) (ignored: LC_ALL set to en_US.UTF-8)
Shell: /bin/sh linked to /bin/dash
Init: systemd (via /run/systemd/system)
Reply to: