[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#897893: marked as done (Strange kernel panics on linux-image-4.9.0-6-amd64 with mlx4_en driver)



Your message dated Sun, 23 May 2021 06:28:18 -0700 (PDT)
with message-id <60aa5872.1c69fb81.ab85d.13e3@mx.google.com>
and subject line Closing this bug (BTS maintenance for src:linux bugs)
has caused the Debian Bug report #897893,
regarding Strange kernel panics on linux-image-4.9.0-6-amd64 with mlx4_en driver
to be marked as done.

This means that you claim that the problem has been dealt with.
If this is not the case it is now your responsibility to reopen the
Bug report if necessary, and/or fix the problem forthwith.

(NB: If you are a system administrator and have no idea what this
message is talking about, this may indicate a serious mail system
misconfiguration somewhere. Please contact owner@bugs.debian.org
immediately.)


-- 
897893: https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=897893
Debian Bug Tracking System
Contact owner@bugs.debian.org with problems
--- Begin Message ---
Package: linux-image-4.9.0-6-amd64
Version: 4.9.82-1+deb9u3

Hi!

Here's a short problem description. 

We have some Supermicro servers with the same configuration for all machines (hardware, kernels, packages, etc). A month ago, or maybe a bit later, all of these machines began crashing into kernel panic. I can't find any pattern of failure at all. But it happens very often. Some machines may drop into kernel panic a couple times a day! But usually machines crash about every 3 to 6 days. All of these machines have intensive network and i/o operations.

I saved dmesg log from one of these machines after the crash (see the attachment). 

As far as I see, every machine probably has problems with mlx4_en or GRO. Also I see list_add double add => list_del corruption. Can I do anything to get more detailed logs? What additional information do you need for better problem diagnostics?



---

С уважением,

Буданов Евгений.

Системный администратор

Компания «Рестрим»

Attachment: dmesg.log
Description: Binary data

Attachment: lspci
Description: Binary data


--- End Message ---
--- Begin Message ---
Hi

This bug was filed for a very old kernel or the bug is old itself
without resolution.

If you can reproduce it with

- the current version in unstable/testing
- the latest kernel from backports

please reopen the bug, see https://www.debian.org/Bugs/server-control
for details.

Regards,
Salvatore

--- End Message ---

Reply to: