[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Re: linux: instability on arm64 MP30-AR1 servers



Control: found -1 4.19.28-2

On Wed, May 22, 2019 at 11:58:15 +0200, Julien Cristau wrote:

> Source: linux
> Version: 4.9.168-1
> Severity: important
> X-Debbugs-Cc: debian-arm@lists.debian.org, debian-admin@lists.debian.org
> User: debian-admin@lists.debian.org
> Usertags: needed-by-DSA-Team
> 
> Hi,
> 
> ever since the 9.9 point release conova-node01.debian.org and
> conova-node02.debian.org have been unstable.  They run for an hour or
> three, and then things go bad.  Rebooting back to 4.9.144-3.1 makes them
> stable again.
> 
Still happening after upgrading to the stretch-backports kernel:

[87461.376828] Bad mode in FIQ handler detected on CPU0, code 0x56000000 -- SVC (AArch64)
[87461.376834] Internal error: Oops - bad mode: 0 [#1] SMP
[87461.389907] Modules linked in: openvswitch nsh nf_nat_ipv6 nf_nat_ipv4 nf_conncount nf_nat binfmt_misc nls_ascii nls_cp437 vfat fat dm_mod ip6t_REJECT nf_reject_ipv6 ip6table_filter ip6_tables ipt_REJECT nf_reject_ipv4 nfnetlink_log nfnetlink xt_NFLOG xt_tcpudp xt_hashlimit xt_multiport xt_conntrack efi_pstore nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 iptable_filter ast ttm drm_kms_helper drm xgene_hwmon i2c_algo_bit xgene_edac xgene_dma joydev evdev chaoskey sg xgene_rng mailbox_xgene_slimpro rng_core ipmi_ssif ipmi_devintf ipmi_msghandler efivars tun drbd lru_cache efivarfs ip_tables x_tables autofs4 ext4 crc16 mbcache jbd2 fscrypto raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx hid_generic usbhid hid xor raid6_pq crc32c_generic libcrc32c raid0 multipath linear raid1
[87461.460161]  md_mod sd_mod ahci_xgene libahci_platform libahci xhci_plat_hcd xgene_enet libata xhci_hcd i2c_xgene_slimpro marvell usbcore phy_xgene scsi_mod sdhci_of_arasan mdio_xgene sdhci_pltfm of_mdio cqhci fixed_phy sdhci libphy usb_common gpio_xgene_sb
[87461.482839] CPU: 0 PID: 1557 Comm: ovsdb-server Not tainted 4.19.0-0.bpo.4-arm64 #1 Debian 4.19.28-2~bpo9+1
[87461.492528] Hardware name: GIGABYTE R120-P31/MP30-AR1, BIOS D7b 08/26/2016
[87461.499367] pstate: 00000000 (nzcv daif -PAN -UAO)
[87461.504132] pc : 0000ffff897e2910
[87461.507427] lr : 0000ffff897e2918
[87461.510722] sp : 0000ffffe32d4440
[87461.514016] x29: 0000ffffe32d4440 x28: 000000000000015a 
[87461.519301] x27: 0000ffff89928c20 x26: 0000000000000000 
[87461.524586] x25: 0000ffffe32d44f8 x24: 0000ffffe32d4528 
[87461.529870] x23: 000000000000015a x22: 0000000000000090 
[87461.535154] x21: 0000aaaad73fd286 x20: 0000000000000001 
[87461.540439] x19: 0000aaaad743b560 x18: 0000000000000024 
[87461.545723] x17: 0000ffff897d7fc0 x16: 0000ffff899238e0 
[87461.551007] x15: 0000089e8439a422 x14: 0000000000000001 
[87461.556291] x13: 000000005ce6a4fa x12: 0000000000000018 
[87461.561576] x11: 0000000026295eb7 x10: 00000000000155a6 
[87461.566860] x9 : 0000aaaad741f300 x8 : 0000000000000000 
[87461.572144] x7 : 0000000000000010 x6 : 0000000000000000 
[87461.577429] x5 : 0000ffffe32d42b8 x4 : 0000aaaad73f0410 
[87461.582713] x3 : 0000aaaad743b568 x2 : 0000aaaad7403a20 
[87461.587997] x1 : 0000000000000001 x0 : 0000aaaad743b560 
[87461.593283] Process ovsdb-server (pid: 1557, stack limit = 0x0000000003b97138)
[87461.600468] ---[ end trace 2ab4838ec3817e8e ]---
[87461.606271] Bad mode in FIQ handler detected on CPU0, code 0x56000000 -- SVC (AArch64)
[87482.616230] rcu: INFO: rcu_sched detected stalls on CPUs/tasks:
[87482.622133] rcu:     0-...0: (1 GPs behind) idle=9a6/1/0x4000000000000000 softirq=1153372/1153372 fqs=2456 
[87482.631564] rcu:     (detected by 4, t=5255 jiffies, g=6202645, q=14630)
[87482.637973] Task dump for CPU 0:
[87482.641182] ovsdb-server    R  running task        0  1557   1556 0x00000002
[87482.648197] Call trace:
[87482.650636]  __switch_to+0x8c/0xd0
[87482.654018]            (null)

Cheers,
Julien


Reply to: