[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

general protection fault on vif50.1-q1-guest (Debian 8/Xen)



Hello,

I had recently a general protection fault on a Debian 8 server with
Xen (debian pacakge: 4.4.4lts4-0+deb8u1) on the vif50.1-q1-guest
kernel proces. I have copied the kernel log below in this mail for
reference. After this GPF the system was still responding but one domU
lost network connectivity and all the others where still working
properly. I decided to power-off and power-on the system as a soft GPF
renders the system in an unstable state.

Now I am trying to find out what is most likely the cause of this
general protection fault in order to avoid that again in the future
and would like your opinion on that:

- is this maybe a bug in the Debian kernel I am using?
- a bug in the Xen package used by Debian 8?
- a hardware issue?
- if it is a hardware issue, what is most likely? RAM? CPU?
- anything else I am missing?

Note that the hardware is enterprise grade hardware and that the BIOS
has been updated to the latest available version.The CPUs (dual CPU)
are Intel Xeon E5-2640 v3 @ 2.60GHz.

Thank you for your input.

Best regards,
John

[Wed May  6 14:48:02 2020] general protection fault: 0000 [#1] SMP
[Wed May  6 14:48:02 2020] Modules linked in: xt_physdev
iptable_filter ip_tables x_tables xen_netback xen_blkback hmac
binfmt_misc xen_gntdev xen_evtchn xenfs xen_privcmd nfsd auth_rpcgss
oid_registry nfs_acl nfs lockd fscache sunrpc bridge bonding iTCO_wdt
iTCO_vendor_support mxm_wmi zfs(PO) zunicode(PO) x86_pkg_temp_thermal
intel_powerclamp zcommon(PO) intel_rapl znvpair(PO) spl(O) coretemp
crc32_pclmul zavl(PO) aesni_intel pcspkr aes_x86_64 lrw gf128mul
glue_helper ablk_helper cryptd ast ttm drm_kms_helper evdev joydev drm
lpc_ich mfd_core i2c_algo_bit mei_me mei shpchp tpm_tis tpm ipmi_si
ipmi_msghandler wmi acpi_power_meter processor thermal_sys button
8021q garp stp mrp llc drbd lru_cache libcrc32c crc32c_generic autofs4
ext4 crc16 mbcache jbd2 dm_mod raid1 md_mod mlx4_en vxlan xen_blkfront
ptp pps_core
[Wed May  6 14:48:02 2020]  hid_generic usbhid hid sg sd_mod
crc_t10dif crct10dif_generic ahci libahci crct10dif_pclmul
crct10dif_common crc32c_intel ehci_pci ehci_hcd mlx4_core libata
i2c_i801 i2c_core usbcore usb_common scsi_mod nvme
[Wed May  6 14:48:02 2020] CPU: 0 PID: 8305 Comm: vif50.1-q1-gues
Tainted: P           O  3.16.0-10-amd64 #1 Debian 3.16.72-1
[Wed May  6 14:48:02 2020] Hardware name: Quanta Computer Inc
QuantaPlex T41S-2U/S2S-MB, BIOS S2S_3B12 05/30/2019
[Wed May  6 14:48:02 2020] task: ffff88003c9f95d0 ti: ffff88004a3ac000
task.ti: ffff88004a3ac000
[Wed May  6 14:48:02 2020] RIP: e030:[<ffffffffa08fcaa2>]
[<ffffffffa08fcaa2>] xenvif_gop_frag_copy+0x22/0x3b0 [xen_netback]
[Wed May  6 14:48:02 2020] RSP: e02b:ffff88004a3afd98  EFLAGS: 00010282
[Wed May  6 14:48:02 2020] RAX: 0000000000001000 RBX: ffff8802e0841800
RCX: 7aec7d18f3f45689
[Wed May  6 14:48:02 2020] RDX: ffff88004a3afe80 RSI: ffff8802e0841800
RDI: 0000000111f703b7
[Wed May  6 14:48:02 2020] RBP: ffffc9002332c258 R08: 000000005ff8d9a9
R09: 00000000b1fe2a0e
[Wed May  6 14:48:02 2020] R10: ffff880000000000 R11: 0000000000000002
R12: 7aec7d18f3f45689
[Wed May  6 14:48:02 2020] R13: ffffc9002332c258 R14: ffff88004a3afe54
R15: 0000000000000001
[Wed May  6 14:48:02 2020] FS:  0000000000000000(0000)
GS:ffff880484000000(0000) knlGS:ffff880484000000
[Wed May  6 14:48:02 2020] CS:  e033 DS: 0000 ES: 0000 CR0: 0000000080050033
[Wed May  6 14:48:02 2020] CR2: 00007f49c8679000 CR3: 0000000074855000
CR4: 0000000000042660
[Wed May  6 14:48:02 2020] Stack:
[Wed May  6 14:48:02 2020]  0000000058f6d400 ffffc90023336c08
00000000000002c0 ffff8802e0841800
[Wed May  6 14:48:02 2020]  ffff88004a3afe80 0000000000000080
ffff8802e0841800 ffffc9002332c258
[Wed May  6 14:48:02 2020]  79eb3472cad61644 0000000000000028
ffff88004a3afe54 0000000000000001
[Wed May  6 14:48:02 2020] Call Trace:
[Wed May  6 14:48:02 2020]  [<ffffffffa08ff2c9>] ?
xenvif_kthread_guest_rx+0x549/0xce0 [xen_netback]
[Wed May  6 14:48:02 2020]  [<ffffffffa08fed80>] ?
xenvif_map_frontend_rings+0xd0/0xd0 [xen_netback]
[Wed May  6 14:48:02 2020]  [<ffffffff810905d1>] ? kthread+0xd1/0xf0
[Wed May  6 14:48:02 2020]  [<ffffffff8153be8f>] ? __schedule+0x22f/0x750
[Wed May  6 14:48:02 2020]  [<ffffffff81090500>] ?
kthread_create_on_node+0x1b0/0x1b0
[Wed May  6 14:48:02 2020]  [<ffffffff8154030e>] ? ret_from_fork+0x6e/0xa0
[Wed May  6 14:48:02 2020]  [<ffffffff81090500>] ?
kthread_create_on_node+0x1b0/0x1b0
[Wed May  6 14:48:02 2020] Code: 2e 0f 1f 84 00 00 00 00 00 0f 1f 44
00 00 41 57 41 56 b8 00 10 00 00 41 55 41 54 49 89 cc 55 53 49 89 fd
4b 8d 3c 08 48 83 ec 30 <48> 8b 09 4c 8b 74 24 68 4c 8b 7c 24 70 80 e5
40 74 08 49 8b 4c
[Wed May  6 14:48:02 2020] RIP  [<ffffffffa08fcaa2>]
xenvif_gop_frag_copy+0x22/0x3b0 [xen_netback]
[Wed May  6 14:48:02 2020]  RSP <ffff88004a3afd98>
[Wed May  6 14:48:33 2020] ---[ end trace 4fb039a0de2de66f ]---


Reply to: