Bug#328534: Adaptec 2005S Hangs with current experiemental 2.6.13-686-smp kernel
Hi,
I am adding this information to this bug at the request of the person
helping me on Debian IRC debug the kernel panic when booting on my system.
With the information of this bug and the link to this one:
http://bugzilla.kernel.org/show_bug.cgi?id=4940
I was able to verify similar output from my system.
We loaded in the current linux-image-2.6.13-686-smp from experimental
and the nature of the error changed completely although it still stopped
(hung) at the same point.
I have attached an installation-report inside the report-bug e-mail. I
stopped the report bug because it was going to open a new bug I think.
Please contact me if I can help further with this.
Thank you,
Ken
Subject: linux-image-2.6.13-686-smp: system hangs when I2O tried to access the controller
Package: linux-image-2.6.13-686-smp
Version: linux-image-686-smp
Severity: critical
Justification: breaks the whole system
*** Please type your report below this line ***
Package: installation-reports
Debian-installer-version: http://ftp.us.debian.org/debian/pool/main/l/linux-2.6/linux-image-2.6.13-1-686-smp_2.6.13-1_i386.deb
uname -a: Linux hostname 2.4.27-2-686-smp #1 SMP Tue Aug 16 15:57:25 JST 2005 i686 GNU/Linux
Date: 25 October 2005
Method:
This system was installed with the netinstall minimal ISO image booting off IDE attached CDROM.
I had to install this system with install24 becuase the regular 2.6 install would crash after the inital reboot coming up on the standard kernel instead of running on the d-i kernel.
I used the http://cdimage.debian.org/pub/cdimage-testing/daily/i386/current/debian-testing-i386-netinst.iso for Ocotber 15 2005 for the various install iterations.
Machine:
Phoenix - Award WorkstationBIOS v6.00PG
Copyright (C) 1984-2002, Phoenix Technologies, LTD
Supermicro P4DC6+/P4DCE+/+II BIOS Rev 1.3
Main Processor : Intel Xeon(tm) 2.20GHz(100x22.0) , 4 CPU(s)
Memory Testing : 1048576K OK
Direct Rambus/Host Frequency is 400/100 MH0m MHzm400/100
Primary Master : None
Primary Slave : DVD-ROM BDV316C VER .20R
Secondary Master : None
Secondary Slave : None
Adaptec I O BIOS v001.41 (2001/07/13)
@ Copyright Adaptec Inc. 1996-2001 All Rights Reserved
Device 3/05/0 1044:A511: Changing Latency from 20h to 40h
Controller:0xF1000000 IRQ11 2005S FW380E cyls hds secs =+
Drive : (0,0,0) ADAPTEC RAID-5 380E 3298 255 63 25.27GB
This is a generic 2U case with SCA 9Gb 10K drives with a Supermicro P4DC6+ motherboard with onboard SCSI on it.
Processor: Two Pentium IV 2.2 GHz XEON These register as dual core
Memory: 1Gb RIMM
Root Device:
/dev/sda1 is serverd by hardware raid controller via SCSI.
gathered with raidutil -L controller
# b0 b1 b2 Controller Cache FW NVRAM Serial Status
---------------------------------------------------------------------------
d0 -- -- ADAP2005S 32MB 380E ADPT 1.0 BB0E20403B7Optimal
raidutil -L logical
Address Type Manufacturer/Model Capacity Status
---------------------------------------------------------------------------
d0b0t0d0 RAID 5 (Redundant ADAPTEC RAID-5 25875MB Optimal
raidutil -L physical
Address Type Manufacturer/Model Capacity Status
---------------------------------------------------------------------------
d0b0t0d0 Disk Drive (DASD) HP 9.10GB B 80-1205 8678MB Optimal
d0b0t1d0 Disk Drive (DASD) SEAGATE ST39102L CLAR09 8625MB Optimal
d0b0t2d0 Disk Drive (DASD) SEAGATE ST39103LCSUN9.0G 8637MB Optimal
d0b0t3d0 Disk Drive (DASD) HP 9.10GB B 80-1205 8678MB Optimal
Root Size/partition table:
Disk /dev/sda: 27.1 GB, 27131904000 bytes
255 heads, 63 sectors/track, 3298 cylinders
Units = cylinders of 16065 * 512 = 8225280 bytes
Device Boot Start End Blocks Id System
/dev/sda1 * 1 3160 25382668+ 83 Linux
/dev/sda2 3161 3298 1108485 5 Extended
/dev/sda5 3161 3298 1108453+ 82 Linux swap / Solaris
Output of lspci and lspci -n:
LSPCI:
0000:00:00.0 Host bridge: Intel Corporation 82860 860 (Wombat) Chipset Host Bridge (MCH) (rev 04)
0000:00:01.0 PCI bridge: Intel Corporation 82850 850 (Tehama) Chipset AGP Bridge (rev 04)
0000:00:02.0 PCI bridge: Intel Corporation 82860 860 (Wombat) Chipset AGP Bridge (rev 04)
0000:00:1e.0 PCI bridge: Intel Corporation 82801 PCI Bridge (rev 04)
0000:00:1f.0 ISA bridge: Intel Corporation 82801BA ISA Bridge (LPC) (rev 04)
0000:00:1f.1 IDE interface: Intel Corporation 82801BA IDE U100 (rev 04)
0000:00:1f.2 USB Controller: Intel Corporation 82801BA/BAM USB (Hub #1) (rev 04)
0000:00:1f.3 SMBus: Intel Corporation 82801BA/BAM SMBus (rev 04)
0000:00:1f.4 USB Controller: Intel Corporation 82801BA/BAM USB (Hub #2) (rev 04)
0000:00:1f.5 Multimedia audio controller: Intel Corporation 82801BA/BAM AC'97 Audio (rev 04)
0000:02:1f.0 PCI bridge: Intel Corporation 82806AA PCI64 Hub PCI Bridge (rev 03)
0000:03:00.0 PIC: Intel Corporation 82806AA PCI64 Hub Advanced Programmable Interrupt Controller (rev 01)
0000:03:05.0 RAID bus controller: Adaptec (formerly DPT) SmartRAID V Controller (rev 01)
0000:04:03.0 VGA compatible controller: Number 9 Computer Company Imagine 128-II
0000:04:04.0 Ethernet controller: Intel Corporation 82557/8/9 [Ethernet Pro 100] (rev 08)
LSPCI -n:
0000:00:00.0 0600: 8086:2531 (rev 04)
0000:00:01.0 0604: 8086:2532 (rev 04)
0000:00:02.0 0604: 8086:2533 (rev 04)
0000:00:1e.0 0604: 8086:244e (rev 04)
0000:00:1f.0 0601: 8086:2440 (rev 04)
0000:00:1f.1 0101: 8086:244b (rev 04)
0000:00:1f.2 0c03: 8086:2442 (rev 04)
0000:00:1f.3 0c05: 8086:2443 (rev 04)
0000:00:1f.4 0c03: 8086:2444 (rev 04)
0000:00:1f.5 0401: 8086:2445 (rev 04)
0000:02:1f.0 0604: 8086:1360 (rev 03)
0000:03:00.0 0800: 8086:1161 (rev 01)
0000:03:05.0 0104: 1044:a511 (rev 01)
0000:04:03.0 0300: 105d:2339
0000:04:04.0 0200: 8086:1229 (rev 08)
Base System Installation Checklist:
[O] = OK, [E] = Error (please elaborate below), [ ] = didn't try it
Initial boot worked: [O]
Configure network HW: [O]
Config network: [O]
Detect CD: [O]
Load installer modules: [O]
Detect hard drives: [O]
Partition hard drives: [O]
Create file systems: [O]
Mount partitions: [O]
Install base system: [O]
Install boot loader: [O]
Reboot: [E]
Comments/Problems:
I tried installing this system many times trying different things. However, the only way the initial reboot wouldn't fail was when I used the 2.4 kernel. I then attempted to install the 2.6 kernel with the same results.
I joined debian-boot on IRC and trave11er assisted with identifying the cause, bug# 328534. I installed the above mentioned kernel: linux-image-2.6.13 from experimental to try the fixed version of the kernel. The error message changed but failed at the same place, which seems to be the second driver loading like the bug report discusses.
Here is the output of the console I was able to obtain:
shpchp: loaded successfully
shpchp: already loaded
hw_random hardware driver 1.0.0 loaded
hw_randow: loaded successfully
shpchp: already loaded
generic: loaded successfully
piix: already loaded
uhci-hcd: already loaded
i2c-i801: loaded successfully
uhci-hcd: already loaded
i810_audio: already loaded
snd-intel8x0: loaded successfully
shpchp: already loaded
I2O subsystem v1.288
i2o: max drivers = 8
i2o: Checking for PCI I2O controllers . . .
ACPI: PCI Interrupt 0000:03:05.0[A] -> GSI 18 (level, low) -> IRQ 169
iop0: controller found (0000:03:05.0)
PCI: Unable to reserve mem region #!:100000@f1000000 for device 0000:03:05.0
iop0: device already claimed
iop0: DMA / IO allocation for I2O controller failed
ACPI: PCI interrupt for device 0000:03:05.0 disabled
i20_core: loaded successfully
dpti0: Trying to Abort cmd=1660
This is where it hung till I rebooted the system.
Summary:
I even attempted installing fedora core 4 on this same hardware, but I couldn't see the disks, guess the right module to load, and not overly interested in fedora on my computer. I tried it more to make sure I wasn't doing something wrong. So when Fedora had problems (albeit for other reasons) I was content to continue to poke at it with debian. The system is functionial with the 2.4 kernel, but I do not want to let this go unresolved long term. And I do not want to start using it until I find a resolution for the long term.
So, from that stand-point, I was impressed with the daily build of d-i, it handled the hardware raid very well compariong this same issue to fedora. Everything about the install seemed to work fine save that initial reboot. So, the d-i 2.6 kernel can handle what the stock 2.6.12-10 kernel could not. (that was the most recent version of 2.6 I had tried on my system prior to the 2.6.13 kernel from experiemental)
I am not an expert in this regard, but it almost appears as if the system or kernel disabled the controller with ACPI because of the intereaction of the second module.
I am available to trouble shoot this as necessary. I apologize for not being able to get a complete console dump, but I have been unable to get serial to work with the console=/dev/ttyS0,9600n8 option. I believe this is due to the BIOS console redirection stuff this motherboard has.
-- System Information:
Debian Release: testing/unstable
APT prefers testing
APT policy: (500, 'testing')
Architecture: i386 (i686)
Shell: /bin/sh linked to /bin/bash
Kernel: Linux 2.4.27-2-686-smp
Locale: LANG=en_US, LC_CTYPE=en_US (charmap=ISO-8859-1)
Reply to: