[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#1060706: linux-image-6.1.0-17-amd64: intel i225 NIC loses PCIe link, network becomes unusable)



And another instance, and this time I thought about getting messages from an attempted igc module reloading.




[Fr Feb 9 13:25:08 2024] igc 0000:0b:00.0 eno1: PCIe link lost, device now detached
[Fr Feb  9 13:25:08 2024] ------------[ cut here ]------------
[Fr Feb  9 13:25:08 2024] igc: Failed to read reg 0xc030!
[Fr Feb 9 13:25:08 2024] WARNING: CPU: 20 PID: 84300 at drivers/net/ethernet/intel/igc/igc_main.c:6583 igc_rd32+0x8d/0xa0 [igc] [Fr Feb 9 13:25:08 2024] Modules linked in: exfat rfcomm cpufreq_userspace cpufreq_powersave cpufreq_ondemand cpufreq_conservative nfsv3 nfs_acl rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver nfs lockd grace fscache netfs qrtr overlay cmac algif_hash algif_skcipher af_alg bnep sunrpc binfmt_misc nls_ascii nls_cp437 vfat fat ext4 mbcache jbd2 intel_rapl_msr intel_rapl_common btusb btrtl btbcm btintel btmtk bluetooth snd_hda_codec_hdmi mt7921e mt7921_common edac_mce_amd mt76_connac_lib snd_hda_intel uvcvideo snd_intel_dspcfg mt76 sha3_generic snd_intel_sdw_acpi videobuf2_vmalloc snd_usb_audio kvm_amd snd_hda_codec jitterentropy_rng uvc snd_usbmidi_lib videobuf2_memops mac80211 snd_hda_core drbg videobuf2_v4l2 libarc4 snd_rawmidi eeepc_wmi asus_nb_wmi kvm videodev snd_hwdep snd_seq_device ansi_cprng asus_wmi cfg80211 snd_pcm videobuf2_common battery irqbypass ecdh_generic ledtrig_audio ecc sparse_keymap sp5100_tco mc crc16 ccp snd_timer platform_profile rapl wmi_bmof watchdog pcspkr k10temp snd rfkill soundcore joydev sg evdev msr
[Fr Feb  9 13:25:08 2024]  parport_pc ppdev lp parport fuse loop efi_pstore configfs efivarfs ip_tables x_tables autofs4 xfs libcrc32c crc32c_generic sd_mod dm_crypt dm_mod uas usb_storage hid_generic amdgpu amdxcp drm_buddy gpu_sched i2c_algo_bit drm_suballoc_helper usbhid drm_display_helper hid sr_mod cec cdrom rc_core drm_ttm_helper ttm crc32_pclmul crc32c_intel drm_kms_helper ghash_clmulni_intel ahci sha512_ssse3 libahci xhci_pci sha512_generic libata xhci_hcd nvme drm nvme_core aesni_intel scsi_mod t10_pi usbcore crypto_simd igc cryptd crc64_rocksoft_generic i2c_piix4 crc64_rocksoft crc_t10dif crct10dif_generic crct10dif_pclmul scsi_common crc64 crct10dif_common usb_common video wmi gpio_amdpt gpio_generic button
[Fr Feb 9 13:25:08 2024] CPU: 20 PID: 84300 Comm: kworker/20:0 Not tainted 6.5.0-0.deb12.4-amd64 #1 Debian 6.5.10-1~bpo12+1 [Fr Feb 9 13:25:08 2024] Hardware name: ASUS System Product Name/ROG STRIX X670E-A GAMING WIFI, BIOS 1904 01/29/2024
[Fr Feb  9 13:25:08 2024] Workqueue: events igc_watchdog_task [igc]
[Fr Feb  9 13:25:08 2024] RIP: 0010:igc_rd32+0x8d/0xa0 [igc]
[Fr Feb 9 13:25:08 2024] Code: 48 c7 c6 10 36 3a c0 e8 81 aa dd e6 48 8b bb 28 ff ff ff e8 05 12 b4 e6 84 c0 74 bc 89 ee 48 c7 c7 38 36 3a c0 e8 c3 2e 53 e6 <0f> 0b eb aa b8 ff ff ff ff e9 15 0f 04 e7 0f 1f 44 00 00 90 90 90
[Fr Feb  9 13:25:08 2024] RSP: 0018:ffffb034cc61bdd8 EFLAGS: 00010282
[Fr Feb 9 13:25:08 2024] RAX: 0000000000000000 RBX: ffff97078f882cb8 RCX: 0000000000000027 [Fr Feb 9 13:25:08 2024] RDX: ffff97169e7213c8 RSI: 0000000000000001 RDI: ffff97169e7213c0 [Fr Feb 9 13:25:08 2024] RBP: 000000000000c030 R08: 0000000000000000 R09: ffffb034cc61bc68 [Fr Feb 9 13:25:08 2024] R10: 0000000000000003 R11: ffff9716dde3ac28 R12: ffff97078f882000 [Fr Feb 9 13:25:08 2024] R13: 0000000000000000 R14: ffff970784592d40 R15: 000000000000c030 [Fr Feb 9 13:25:08 2024] FS: 0000000000000000(0000) GS:ffff97169e700000(0000) knlGS:0000000000000000
[Fr Feb  9 13:25:08 2024] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[Fr Feb 9 13:25:08 2024] CR2: 00007f5271155f80 CR3: 0000000434bc6000 CR4: 0000000000750ee0
[Fr Feb  9 13:25:08 2024] PKRU: 55555554
[Fr Feb  9 13:25:08 2024] Call Trace:
[Fr Feb  9 13:25:08 2024]  <TASK>
[Fr Feb  9 13:25:08 2024]  ? igc_rd32+0x8d/0xa0 [igc]
[Fr Feb  9 13:25:08 2024]  ? __warn+0x81/0x130
[Fr Feb  9 13:25:08 2024]  ? igc_rd32+0x8d/0xa0 [igc]
[Fr Feb  9 13:25:08 2024]  ? report_bug+0x171/0x1a0
[Fr Feb  9 13:25:08 2024]  ? srso_alias_return_thunk+0x5/0x7f
[Fr Feb  9 13:25:08 2024]  ? prb_read_valid+0x1b/0x30
[Fr Feb  9 13:25:08 2024]  ? handle_bug+0x41/0x70
[Fr Feb  9 13:25:08 2024]  ? exc_invalid_op+0x17/0x70
[Fr Feb  9 13:25:08 2024]  ? asm_exc_invalid_op+0x1a/0x20
[Fr Feb  9 13:25:08 2024]  ? igc_rd32+0x8d/0xa0 [igc]
[Fr Feb  9 13:25:08 2024]  ? igc_rd32+0x8d/0xa0 [igc]
[Fr Feb  9 13:25:08 2024]  igc_update_stats+0x8a/0x6d0 [igc]
[Fr Feb  9 13:25:08 2024]  igc_watchdog_task+0x9d/0x4a0 [igc]
[Fr Feb  9 13:25:08 2024]  process_one_work+0x1df/0x3e0
[Fr Feb  9 13:25:08 2024]  worker_thread+0x51/0x390
[Fr Feb  9 13:25:08 2024]  ? __pfx_worker_thread+0x10/0x10
[Fr Feb  9 13:25:08 2024]  kthread+0xe5/0x120
[Fr Feb  9 13:25:08 2024]  ? __pfx_kthread+0x10/0x10
[Fr Feb  9 13:25:08 2024]  ret_from_fork+0x31/0x50
[Fr Feb  9 13:25:08 2024]  ? __pfx_kthread+0x10/0x10
[Fr Feb  9 13:25:08 2024]  ret_from_fork_asm+0x1b/0x30
[Fr Feb  9 13:25:08 2024]  </TASK>
[Fr Feb  9 13:25:08 2024] ---[ end trace 0000000000000000 ]---

subsequent rmmod igc && modprobe igc got me

[Fr Feb  9 13:27:09 2024] igc 0000:0b:00.0 eno1: PHC removed
[Fr Feb  9 13:27:17 2024] Intel(R) 2.5G Ethernet Linux Driver
[Fr Feb  9 13:27:17 2024] Copyright(c) 2018 Intel Corporation.
[Fr Feb  9 13:27:17 2024] igc 0000:0b:00.0: enabling device (0000 -> 0002)
[Fr Feb 9 13:27:17 2024] igc 0000:0b:00.0: PCIe PTM not supported by PCIe bus/controller [Fr Feb 9 13:27:17 2024] igc 0000:0b:00.0 (unnamed net_device) (uninitialized): PCIe link lost, device now detached
[Fr Feb  9 13:27:17 2024] ------------[ cut here ]------------
[Fr Feb  9 13:27:17 2024] igc: Failed to read reg 0x10!
[Fr Feb 9 13:27:17 2024] WARNING: CPU: 3 PID: 84566 at drivers/net/ethernet/intel/igc/igc_main.c:6583 igc_rd32+0x8d/0xa0 [igc] [Fr Feb 9 13:27:17 2024] Modules linked in: igc(+) exfat rfcomm cpufreq_userspace cpufreq_powersave cpufreq_ondemand cpufreq_conservative nfsv3 nfs_acl rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver nfs lockd grace fscache netfs qrtr overlay cmac algif_hash algif_skcipher af_alg bnep sunrpc binfmt_misc nls_ascii nls_cp437 vfat fat ext4 mbcache jbd2 intel_rapl_msr intel_rapl_common btusb btrtl btbcm btintel btmtk bluetooth snd_hda_codec_hdmi mt7921e mt7921_common edac_mce_amd mt76_connac_lib snd_hda_intel uvcvideo snd_intel_dspcfg mt76 sha3_generic snd_intel_sdw_acpi videobuf2_vmalloc snd_usb_audio kvm_amd snd_hda_codec jitterentropy_rng uvc snd_usbmidi_lib videobuf2_memops mac80211 snd_hda_core drbg videobuf2_v4l2 libarc4 snd_rawmidi eeepc_wmi asus_nb_wmi kvm videodev snd_hwdep snd_seq_device ansi_cprng asus_wmi cfg80211 snd_pcm videobuf2_common battery irqbypass ecdh_generic ledtrig_audio ecc sparse_keymap sp5100_tco mc crc16 ccp snd_timer platform_profile rapl wmi_bmof watchdog pcspkr k10temp snd rfkill soundcore joydev sg evdev [Fr Feb 9 13:27:17 2024] msr parport_pc ppdev lp parport fuse loop efi_pstore configfs efivarfs ip_tables x_tables autofs4 xfs libcrc32c crc32c_generic sd_mod dm_crypt dm_mod uas usb_storage hid_generic amdgpu amdxcp drm_buddy gpu_sched i2c_algo_bit drm_suballoc_helper usbhid drm_display_helper hid sr_mod cec cdrom rc_core drm_ttm_helper ttm crc32_pclmul crc32c_intel drm_kms_helper ghash_clmulni_intel ahci sha512_ssse3 libahci xhci_pci sha512_generic libata xhci_hcd nvme drm nvme_core aesni_intel scsi_mod t10_pi usbcore crypto_simd cryptd crc64_rocksoft_generic i2c_piix4 crc64_rocksoft crc_t10dif crct10dif_generic crct10dif_pclmul scsi_common crc64 crct10dif_common usb_common video wmi gpio_amdpt gpio_generic button [last unloaded: igc] [Fr Feb 9 13:27:17 2024] CPU: 3 PID: 84566 Comm: modprobe Tainted: G W 6.5.0-0.deb12.4-amd64 #1 Debian 6.5.10-1~bpo12+1 [Fr Feb 9 13:27:17 2024] Hardware name: ASUS System Product Name/ROG STRIX X670E-A GAMING WIFI, BIOS 1904 01/29/2024
[Fr Feb  9 13:27:17 2024] RIP: 0010:igc_rd32+0x8d/0xa0 [igc]
[Fr Feb 9 13:27:17 2024] Code: 48 c7 c6 10 36 3a c0 e8 81 aa dd e6 48 8b bb 28 ff ff ff e8 05 12 b4 e6 84 c0 74 bc 89 ee 48 c7 c7 38 36 3a c0 e8 c3 2e 53 e6 <0f> 0b eb aa b8 ff ff ff ff e9 15 0f 04 e7 0f 1f 44 00 00 90 90 90
[Fr Feb  9 13:27:17 2024] RSP: 0018:ffffb034ccb2baa0 EFLAGS: 00010286
[Fr Feb 9 13:27:17 2024] RAX: 0000000000000000 RBX: ffff9707a086ecb8 RCX: 0000000000000027 [Fr Feb 9 13:27:17 2024] RDX: ffff97169e2e13c8 RSI: 0000000000000001 RDI: ffff97169e2e13c0 [Fr Feb 9 13:27:17 2024] RBP: 0000000000000010 R08: 0000000000000000 R09: ffffb034ccb2b930 [Fr Feb 9 13:27:17 2024] R10: 0000000000000003 R11: ffff9716dde3ac28 R12: ffff9707a086e000 [Fr Feb 9 13:27:17 2024] R13: ffff9707a086e9c0 R14: ffff9707a086e000 R15: ffff9707a086ecb8 [Fr Feb 9 13:27:17 2024] FS: 00007f709389c040(0000) GS:ffff97169e2c0000(0000) knlGS:0000000000000000
[Fr Feb  9 13:27:17 2024] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[Fr Feb 9 13:27:17 2024] CR2: 0000559154b5b188 CR3: 00000004e4972000 CR4: 0000000000750ee0
[Fr Feb  9 13:27:17 2024] PKRU: 55555554
[Fr Feb  9 13:27:17 2024] Call Trace:
[Fr Feb  9 13:27:17 2024]  <TASK>
[Fr Feb  9 13:27:17 2024]  ? igc_rd32+0x8d/0xa0 [igc]
[Fr Feb  9 13:27:17 2024]  ? __warn+0x81/0x130
[Fr Feb  9 13:27:17 2024]  ? igc_rd32+0x8d/0xa0 [igc]
[Fr Feb  9 13:27:17 2024]  ? report_bug+0x171/0x1a0
[Fr Feb  9 13:27:17 2024]  ? prb_read_valid+0x1b/0x30
[Fr Feb  9 13:27:17 2024]  ? srso_alias_return_thunk+0x5/0x7f
[Fr Feb  9 13:27:17 2024]  ? handle_bug+0x41/0x70
[Fr Feb  9 13:27:17 2024]  ? exc_invalid_op+0x17/0x70
[Fr Feb  9 13:27:17 2024]  ? asm_exc_invalid_op+0x1a/0x20
[Fr Feb  9 13:27:17 2024]  ? igc_rd32+0x8d/0xa0 [igc]
[Fr Feb  9 13:27:17 2024]  igc_get_invariants_base+0xb9/0x260 [igc]
[Fr Feb  9 13:27:17 2024]  igc_probe+0x2ed/0x970 [igc]
[Fr Feb  9 13:27:17 2024]  local_pci_probe+0x42/0xa0
[Fr Feb  9 13:27:17 2024]  pci_device_probe+0xc7/0x240
[Fr Feb  9 13:27:17 2024]  really_probe+0x19f/0x400
[Fr Feb  9 13:27:17 2024]  ? __pfx___driver_attach+0x10/0x10
[Fr Feb  9 13:27:17 2024]  __driver_probe_device+0x78/0x160
[Fr Feb  9 13:27:17 2024]  driver_probe_device+0x1f/0x90
[Fr Feb  9 13:27:17 2024]  __driver_attach+0xd2/0x1c0
[Fr Feb  9 13:27:17 2024]  bus_for_each_dev+0x85/0xd0
[Fr Feb  9 13:27:17 2024]  bus_add_driver+0x116/0x220
[Fr Feb  9 13:27:17 2024]  driver_register+0x59/0x100
[Fr Feb  9 13:27:17 2024]  ? __pfx_igc_init_module+0x10/0x10 [igc]
[Fr Feb  9 13:27:17 2024]  do_one_initcall+0x5a/0x320
[Fr Feb  9 13:27:17 2024]  do_init_module+0x60/0x240
[Fr Feb  9 13:27:17 2024]  init_module_from_file+0x86/0xc0
[Fr Feb  9 13:27:17 2024]  idempotent_init_module+0x120/0x2b0
[Fr Feb  9 13:27:17 2024]  __x64_sys_finit_module+0x5e/0xb0
[Fr Feb  9 13:27:17 2024]  do_syscall_64+0x5c/0xc0
[Fr Feb  9 13:27:17 2024]  ? srso_alias_return_thunk+0x5/0x7f
[Fr Feb  9 13:27:17 2024]  ? ksys_mmap_pgoff+0xec/0x1f0
[Fr Feb  9 13:27:17 2024]  ? srso_alias_return_thunk+0x5/0x7f
[Fr Feb  9 13:27:17 2024]  ? exit_to_user_mode_prepare+0x40/0x1e0
[Fr Feb  9 13:27:17 2024]  ? srso_alias_return_thunk+0x5/0x7f
[Fr Feb  9 13:27:17 2024]  ? syscall_exit_to_user_mode+0x2b/0x40
[Fr Feb  9 13:27:17 2024]  ? srso_alias_return_thunk+0x5/0x7f
[Fr Feb  9 13:27:17 2024]  ? do_syscall_64+0x6b/0xc0
[Fr Feb  9 13:27:17 2024]  ? do_syscall_64+0x6b/0xc0
[Fr Feb  9 13:27:17 2024]  ? srso_alias_return_thunk+0x5/0x7f
[Fr Feb  9 13:27:17 2024]  ? exit_to_user_mode_prepare+0x40/0x1e0
[Fr Feb  9 13:27:17 2024]  entry_SYSCALL_64_after_hwframe+0x6e/0xd8
[Fr Feb  9 13:27:17 2024] RIP: 0033:0x7f709399e719
[Fr Feb 9 13:27:17 2024] Code: 08 89 e8 5b 5d c3 66 2e 0f 1f 84 00 00 00 00 00 90 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d b7 06 0d 00 f7 d8 64 89 01 48 [Fr Feb 9 13:27:17 2024] RSP: 002b:00007ffd5ebd1f78 EFLAGS: 00000246 ORIG_RAX: 0000000000000139 [Fr Feb 9 13:27:17 2024] RAX: ffffffffffffffda RBX: 0000563800fbbc30 RCX: 00007f709399e719 [Fr Feb 9 13:27:17 2024] RDX: 0000000000000000 RSI: 00005637fff544a0 RDI: 0000000000000003 [Fr Feb 9 13:27:17 2024] RBP: 00005637fff544a0 R08: 0000000000000000 R09: 0000563800fbe650 [Fr Feb 9 13:27:17 2024] R10: 0000000000000003 R11: 0000000000000246 R12: 0000000000040000 [Fr Feb 9 13:27:17 2024] R13: 0000000000000000 R14: 0000563800fbbdc0 R15: 0000000000000000
[Fr Feb  9 13:27:17 2024]  </TASK>
[Fr Feb  9 13:27:17 2024] ---[ end trace 0000000000000000 ]---
[Fr Feb  9 13:27:57 2024] igc: probe of 0000:0b:00.0 failed with error -13


Can anybody suggest what information I can provide to tackle this?

Thanks,

Arno

--
Arno Lehmann

IT-Service Lehmann
Sandstr. 6, 49080 Osnabrück


Reply to: