[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#618341: marked as done (bnx2: MSI-X hang on 5709 NICs -> nic dead)



Your message dated Wed, 16 Mar 2011 03:34:14 +0000
with message-id <1300246454.3962.47.camel@localhost>
and subject line Re: Bug#618341: bnx2: MSI-X hang on 5709 NICs -> nic dead
has caused the Debian Bug report #618341,
regarding bnx2: MSI-X hang on 5709 NICs -> nic dead
to be marked as done.

This means that you claim that the problem has been dealt with.
If this is not the case it is now your responsibility to reopen the
Bug report if necessary, and/or fix the problem forthwith.

(NB: If you are a system administrator and have no idea what this
message is talking about, this may indicate a serious mail system
misconfiguration somewhere. Please contact owner@bugs.debian.org
immediately.)


-- 
618341: http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=618341
Debian Bug Tracking System
Contact owner@bugs.debian.org with problems
--- Begin Message ---
Package: linux-2.6
Version: 2.6.32-bpo.5
Severity: critical
Tags: squeeze patch lenny
Justification: breaks the whole system





http://marc.info/?l=linux-netdev&m=127240304211909&w=2

here is the description from the mail , repeated:

It has been reported that under certain heavy traffic conditions
in MSI-X
mode, the driver can lose an MSI-X vector causing all packets in
the
associated rx/tx ring pair to be dropped.  The problem is caused
by
the chip dropping the write to unmask the MSI-X vector by the
kernel
(when migrating the IRQ for example).

This can be prevented by increasing the GRC timeout value for
these
register read and write operations.

Thanks to Dell for helping us debug this problem.

Signed-off-by: Michael Chan <mchan@broadcom.com>



we are using the 2.6.32 kernel and are bitten by this frequently.



vers/net/bnx2.c |    6 +++++-
 1 files changed, 5 insertions(+), 1 deletions(-)

diff --git a/drivers/net/bnx2.c b/drivers/net/bnx2.c
index a257bab..4c1e51e 100644
--- a/drivers/net/bnx2.c
+++ b/drivers/net/bnx2.c
@@ -4759,8 +4759,12 @@ bnx2_reset_chip(struct bnx2 *bp, u32 reset_code)
 		rc = bnx2_alloc_bad_rbuf(bp);
 	}
 
-	if (bp->flags & BNX2_FLAG_USING_MSIX)
+	if (bp->flags & BNX2_FLAG_USING_MSIX) {
 		bnx2_setup_msix_tbl(bp);
+		/* Prevent MSIX table reads and write from timing out */
+		REG_WR(bp, BNX2_MISC_ECO_HW_CTL,
+			BNX2_MISC_ECO_HW_CTL_LARGE_GRC_TMOUT_EN);
+	}
 
 	return rc;
 }
-- 
1.6.4.GIT


-- System Information:
Debian Release: 6.0
  APT prefers proposed-updates
  APT policy: (500, 'proposed-updates'), (500, 'stable'), (1, 'experimental')
Architecture: amd64 (x86_64)

Kernel: Linux 2.6.37-1-amd64 (SMP w/2 CPU cores)
Locale: LANG=de_DE.UTF-8, LC_CTYPE=de_DE.UTF-8 (charmap=UTF-8)
Shell: /bin/sh linked to /bin/bash



--- End Message ---
--- Begin Message ---
On Mon, 2011-03-14 at 14:57 +0100, Andreas Schuldei wrote:
> Package: linux-2.6
> Version: 2.6.32-bpo.5
> Severity: critical
> Tags: squeeze patch lenny
> Justification: breaks the whole system
> 
> 
> 
> 
> 
> http://marc.info/?l=linux-netdev&m=127240304211909&w=2
[...]

This was already included in stable version 2.6.32.13 and Debian's
version 2.6.32-13.  If there is still a problem with later versions, it
must be something different.  In that case, please make a bug report
against a binary package name rather than inventing a version number for
the source package.

Ben.

-- 
Ben Hutchings
Once a job is fouled up, anything done to improve it makes it worse.

Attachment: signature.asc
Description: This is a digitally signed message part


--- End Message ---

Reply to: