[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#451326: marked as done (linux-image-2.6.18-5-686: Lock-ups accessing drives with Promise SX8 S-ATA card)



Your message dated Wed, 2 Jul 2008 15:32:36 +0200
with message-id <20080702133236.GC25863@stro.at>
and subject line Re: linux-image-2.6.18-5-686: Lock-ups accessing drives with Promise SX8 S-ATA card
has caused the Debian Bug report #451326,
regarding linux-image-2.6.18-5-686: Lock-ups accessing drives with Promise SX8 S-ATA card
to be marked as done.

This means that you claim that the problem has been dealt with.
If this is not the case it is now your responsibility to reopen the
Bug report if necessary, and/or fix the problem forthwith.

(NB: If you are a system administrator and have no idea what this
message is talking about, this may indicate a serious mail system
misconfiguration somewhere. Please contact owner@bugs.debian.org
immediately.)


-- 
451326: http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=451326
Debian Bug Tracking System
Contact owner@bugs.debian.org with problems
--- Begin Message ---
Package: linux-image-2.6.18-5-686
Version: 2.6.18.dfsg.1-13etch4
Severity: important

We are experiencing completely reproducible problems accessing S-ATA
drives, connected to one or more Promise SX8 S-ATA controllers.  The
controller is recognized by the default Debian kernel, and the attached
drives appear as the devices under /dev/sx8/.  When we create a
reasonably high load concurrently for more than one drive (such as
running two simultaneous "mkfs" processes), the processes lock up
indefinitely, and the load average starts climbing to approximately 2xN,
where N is the number of disks, accessed simultaneously.  After such
lock-up, the drives are completely inaccessible.  

The problem can be reproduced with one or two cards on the same system.  
We have three SX8 controllers altogether, and two identical motherboards, 
and the problem can be reproduced with all combinations of these.  We don't
pass any boot-time kernel options to the SX8 driver, and we have tagged
commands disabled in the firmware configuration.  The SX8 cards are the
only add-on cards on the system.

We found a work around for ourselves: we noted that, if we boot with a
Knoppix 5.0.1 CD-ROM, the above problem does not appear.  So we compiled
a custom kernel (2.6.23.1 from kernel.org), using the .config file from
the kernel that's on Knoppix 5.0.1 CD.  This worked: the problem cannot
be reproduced with that kernel on the same system.

We would very much prefer to use the stock Debian kernel, though.

The Promise SX8 cards we are using have been updated to the latest
firmware.  The motherboard is the SuperMicro PDSME, also flashed to the
latest BIOS.  Here are product information pages:
http://www.promise.com/product/product_detail_eng.asp?product_id=125
http://www.supermicro.com/products/motherboard/PD/E7230/PDSME.cfm

I'll paste the output of lspci and a snippet of `dmesg' in the end of
this report.

For your information, I found this open CentOS bug report: 
  http://bugs.centos.org/view.php?id=2355
This looks exactly like the problem we are experiencing.

p.s. I am not sure whether I can attach the working kernel configuration
file to this message: I'm using the reportbug command and so far it has
not offered to attach a file.  Should I just paste it in?

The output of lspci:
00:00.0 Host bridge: Intel Corporation E7230 Memory Controller Hub (rev 81)
00:01.0 PCI bridge: Intel Corporation E7230 PCI Express Root Port (rev 81)
00:1c.0 PCI bridge: Intel Corporation 82801G (ICH7 Family) PCI Express Port 1 (rev 01)
00:1c.4 PCI bridge: Intel Corporation 82801GR/GH/GHM (ICH7 Family) PCI Express Port 5 (rev 01)
00:1c.5 PCI bridge: Intel Corporation 82801GR/GH/GHM (ICH7 Family) PCI Express Port 6 (rev 01)
00:1d.0 USB Controller: Intel Corporation 82801G (ICH7 Family) USB UHCI #1 (rev 01)
00:1d.1 USB Controller: Intel Corporation 82801G (ICH7 Family) USB UHCI #2 (rev 01)
00:1d.2 USB Controller: Intel Corporation 82801G (ICH7 Family) USB UHCI #3 (rev 01)
00:1d.3 USB Controller: Intel Corporation 82801G (ICH7 Family) USB UHCI #4 (rev 01)
00:1d.7 USB Controller: Intel Corporation 82801G (ICH7 Family) USB2 EHCI Controller (rev 01)
00:1e.0 PCI bridge: Intel Corporation 82801 PCI Bridge (rev e1)
00:1f.0 ISA bridge: Intel Corporation 82801GB/GR (ICH7 Family) LPC Interface Bridge (rev 01)
00:1f.1 IDE interface: Intel Corporation 82801G (ICH7 Family) IDE Controller (rev 01)
00:1f.2 IDE interface: Intel Corporation 82801GB/GR/GH (ICH7 Family) Serial ATA Storage Controller IDE (rev 01)
00:1f.3 SMBus: Intel Corporation 82801G (ICH7 Family) SMBus Controller (rev 01)
01:00.0 PCI bridge: Intel Corporation 6700PXH PCI Express-to-PCI Bridge A (rev 09)
01:00.1 PIC: Intel Corporation 6700/6702PXH I/OxAPIC Interrupt Controller A (rev 09)
01:00.2 PCI bridge: Intel Corporation 6700PXH PCI Express-to-PCI Bridge B (rev 09)
01:00.3 PIC: Intel Corporation 6700PXH I/OxAPIC Interrupt Controller B (rev 09)
03:01.0 Mass storage controller: Promise Technology, Inc. SATAII150 SX8 (rev 02)
03:03.0 Mass storage controller: Promise Technology, Inc. SATAII150 SX8 (rev 02)
05:00.0 Ethernet controller: Intel Corporation 82573E Gigabit Ethernet Controller (Copper) (rev 03)
06:00.0 Ethernet controller: Intel Corporation 82573L Gigabit Ethernet Controller
0a:04.0 VGA compatible controller: ATI Technologies Inc Rage XL (rev 27)

Snippet of `dmesg':
x8 version 1.0
ACPI: PCI Interrupt 0000:03:01.0[A] -> GSI 48 (level, low) -> IRQ 58
sx8(0000:03:01.0): found 5 interesting devices
sx8(0000:03:01.0): port 0 device 781422768 sectors
sx8(0000:03:01.0): port 0 device "ST3400633AS"
sx8(0000:03:01.0): port 1 device 781422768 sectors
sx8(0000:03:01.0): port 1 device "ST3400633AS"
sx8(0000:03:01.0): port 2 device 781422768 sectors
sx8(0000:03:01.0): port 2 device "ST3400633AS"
sx8(0000:03:01.0): port 3 device 781422768 sectors
sx8(0000:03:01.0): port 3 device "ST3400633AS"
sx8(0000:03:01.0): port 4 device 781422768 sectors
sx8(0000:03:01.0): port 4 device "ST3400633AS"
 sx8/0: p1
 sx8/1: p1
 sx8/2: p1
 sx8/3:<6>ACPI: PCI Interrupt 0000:00:1f.3[B] -> GSI 19 (level, low) -> IRQ 233
 p1
 sx8/4: p1
sx8(0000:03:01.0): 5 ports activated
sx80: pci 0000:03:01.0, ports 8, io ed300000, irq 58, major 160
ACPI: PCI Interrupt 0000:03:03.0[A] -> GSI 52 (level, low) -> IRQ 66
sx8(0000:03:03.0): found 0 interesting devices
sx8(0000:03:03.0): 0 ports activated
sx81: pci 0000:03:03.0, ports 8, io ed400000, irq 66, major 161

Please let me know if you require any more information.

-- System Information:
Debian Release: 4.0
  APT prefers stable
  APT policy: (500, 'stable')
Architecture: i386 (i686)
Shell:  /bin/sh linked to /bin/bash
Kernel: Linux 2.6.18-5-686
Locale: LANG=C, LC_CTYPE=C (charmap=ANSI_X3.4-1968)

Versions of packages linux-image-2.6.18-5-686 depends on:
ii  coreutils                     5.97-5.3   The GNU core utilities
ii  debconf [debconf-2.0]         1.5.11     Debian configuration management sy
ii  initramfs-tools [linux-initra 0.85h      tools for generating an initramfs
ii  module-init-tools             3.3-pre4-2 tools for managing Linux kernel mo

Versions of packages linux-image-2.6.18-5-686 recommends:
pn  libc6-i686                    <none>     (no description available)

-- debconf information:
  linux-image-2.6.18-5-686/preinst/failed-to-move-modules-2.6.18-5-686:
  linux-image-2.6.18-5-686/postinst/create-kimage-link-2.6.18-5-686: true
  linux-image-2.6.18-5-686/preinst/initrd-2.6.18-5-686:
  shared/kernel-image/really-run-bootloader: true
  linux-image-2.6.18-5-686/preinst/lilo-has-ramdisk:
  linux-image-2.6.18-5-686/preinst/abort-install-2.6.18-5-686:
  linux-image-2.6.18-5-686/postinst/old-system-map-link-2.6.18-5-686: true
  linux-image-2.6.18-5-686/postinst/kimage-is-a-directory:
  linux-image-2.6.18-5-686/preinst/elilo-initrd-2.6.18-5-686: true
  linux-image-2.6.18-5-686/postinst/bootloader-test-error-2.6.18-5-686:
  linux-image-2.6.18-5-686/preinst/abort-overwrite-2.6.18-5-686:
  linux-image-2.6.18-5-686/prerm/would-invalidate-boot-loader-2.6.18-5-686: true
  linux-image-2.6.18-5-686/postinst/depmod-error-initrd-2.6.18-5-686: false
  linux-image-2.6.18-5-686/prerm/removing-running-kernel-2.6.18-5-686: true
  linux-image-2.6.18-5-686/postinst/bootloader-error-2.6.18-5-686:
  linux-image-2.6.18-5-686/postinst/depmod-error-2.6.18-5-686: false
  linux-image-2.6.18-5-686/preinst/already-running-this-2.6.18-5-686:
  linux-image-2.6.18-5-686/preinst/lilo-initrd-2.6.18-5-686: true
  linux-image-2.6.18-5-686/preinst/overwriting-modules-2.6.18-5-686: true
  linux-image-2.6.18-5-686/preinst/bootloader-initrd-2.6.18-5-686: true
  linux-image-2.6.18-5-686/postinst/old-dir-initrd-link-2.6.18-5-686: true
  linux-image-2.6.18-5-686/postinst/old-initrd-link-2.6.18-5-686: true



--- End Message ---
--- Begin Message ---
Version: 2.6.23-1

> So we compiled a custom kernel (2.6.23.1 from kernel.org), using the
> .config file from the kernel that's on Knoppix 5.0.1 CD.  This worked:
> the problem cannot be reproduced with that kernel on the same system.


sata promise improved a lot since 2.6.18, so this is expected,
for newer kernel in etch just use backports.org or etch+half
http://wiki.debian.org/EtchAndAHalf

best regards

-- 
maks


--- End Message ---

Reply to: