[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#328534: Adaptec 2005S Hangs with current experiemental 2.6.13-686-smp kernel



Hi,

I am adding this information to this bug at the request of the person helping me on Debian IRC debug the kernel panic when booting on my system.

With the information of this bug and the link to this one:

http://bugzilla.kernel.org/show_bug.cgi?id=4940

I was able to verify similar output from my system.

We loaded in the current linux-image-2.6.13-686-smp from experimental and the nature of the error changed completely although it still stopped (hung) at the same point.

I have attached an installation-report inside the report-bug e-mail. I stopped the report bug because it was going to open a new bug I think.

Please contact me if I can help further with this.

Thank you,
Ken
Subject: linux-image-2.6.13-686-smp: system hangs when I2O tried to access the controller
Package: linux-image-2.6.13-686-smp
Version: linux-image-686-smp
Severity: critical
Justification: breaks the whole system

*** Please type your report below this line ***

Package: installation-reports

Debian-installer-version: http://ftp.us.debian.org/debian/pool/main/l/linux-2.6/linux-image-2.6.13-1-686-smp_2.6.13-1_i386.deb
uname -a: Linux hostname 2.4.27-2-686-smp #1 SMP Tue Aug 16 15:57:25 JST 2005 i686 GNU/Linux

Date: 25 October 2005
Method: 

This system was installed with the netinstall minimal ISO image booting off IDE attached CDROM.

I had to install this system with install24 becuase the regular 2.6 install would crash after the inital reboot coming up on the standard kernel instead of running on the d-i kernel.

I used the http://cdimage.debian.org/pub/cdimage-testing/daily/i386/current/debian-testing-i386-netinst.iso for Ocotber 15 2005 for the various install iterations.

Machine: 

   Phoenix - Award WorkstationBIOS v6.00PG
   Copyright (C) 1984-2002, Phoenix Technologies, LTD

Supermicro  P4DC6+/P4DCE+/+II BIOS Rev 1.3

Main Processor : Intel Xeon(tm) 2.20GHz(100x22.0)    , 4 CPU(s)
Memory Testing : 1048576K OK

  Direct Rambus/Host Frequency is 400/100 MH0m MHzm400/100
  Primary Master : None
   Primary Slave : DVD-ROM BDV316C VER .20R
Secondary Master : None
 Secondary Slave : None

Adaptec I O BIOS v001.41 (2001/07/13)
  @ Copyright Adaptec Inc. 1996-2001 All Rights Reserved
 Device 3/05/0 1044:A511: Changing Latency from 20h to 40h

Controller:0xF1000000 IRQ11     2005S          FW380E  cyls   hds   secs =+
   Drive  :     (0,0,0) ADAPTEC RAID-5           380E  3298   255   63  25.27GB


This is a generic 2U case with SCA 9Gb 10K drives with a Supermicro P4DC6+ motherboard with onboard SCSI on it.

Processor: Two Pentium IV 2.2 GHz XEON These register as dual core
Memory: 1Gb RIMM
Root Device: 
/dev/sda1 is serverd by hardware raid controller via SCSI.

gathered with raidutil -L controller
#  b0 b1 b2  Controller     Cache  FW    NVRAM     Serial     Status
---------------------------------------------------------------------------
d0 -- --     ADAP2005S      32MB   380E  ADPT 1.0  BB0E20403B7Optimal

raidutil -L logical
Address    Type              Manufacturer/Model         Capacity  Status
---------------------------------------------------------------------------
d0b0t0d0   RAID 5 (Redundant ADAPTEC  RAID-5            25875MB   Optimal

raidutil -L physical
Address    Type              Manufacturer/Model         Capacity  Status
---------------------------------------------------------------------------
d0b0t0d0   Disk Drive (DASD) HP       9.10GB B 80-1205  8678MB    Optimal
d0b0t1d0   Disk Drive (DASD) SEAGATE  ST39102L CLAR09   8625MB    Optimal
d0b0t2d0   Disk Drive (DASD) SEAGATE  ST39103LCSUN9.0G  8637MB    Optimal
d0b0t3d0   Disk Drive (DASD) HP       9.10GB B 80-1205  8678MB    Optimal

Root Size/partition table: 

Disk /dev/sda: 27.1 GB, 27131904000 bytes
255 heads, 63 sectors/track, 3298 cylinders
Units = cylinders of 16065 * 512 = 8225280 bytes

   Device Boot      Start         End      Blocks   Id  System
/dev/sda1   *           1        3160    25382668+  83  Linux
/dev/sda2            3161        3298     1108485    5  Extended
/dev/sda5            3161        3298     1108453+  82  Linux swap / Solaris

Output of lspci and lspci -n:

LSPCI:
0000:00:00.0 Host bridge: Intel Corporation 82860 860 (Wombat) Chipset Host Bridge (MCH) (rev 04)
0000:00:01.0 PCI bridge: Intel Corporation 82850 850 (Tehama) Chipset AGP Bridge (rev 04)
0000:00:02.0 PCI bridge: Intel Corporation 82860 860 (Wombat) Chipset AGP Bridge (rev 04)
0000:00:1e.0 PCI bridge: Intel Corporation 82801 PCI Bridge (rev 04)
0000:00:1f.0 ISA bridge: Intel Corporation 82801BA ISA Bridge (LPC) (rev 04)
0000:00:1f.1 IDE interface: Intel Corporation 82801BA IDE U100 (rev 04)
0000:00:1f.2 USB Controller: Intel Corporation 82801BA/BAM USB (Hub #1) (rev 04)
0000:00:1f.3 SMBus: Intel Corporation 82801BA/BAM SMBus (rev 04)
0000:00:1f.4 USB Controller: Intel Corporation 82801BA/BAM USB (Hub #2) (rev 04)
0000:00:1f.5 Multimedia audio controller: Intel Corporation 82801BA/BAM AC'97 Audio (rev 04)
0000:02:1f.0 PCI bridge: Intel Corporation 82806AA PCI64 Hub PCI Bridge (rev 03)
0000:03:00.0 PIC: Intel Corporation 82806AA PCI64 Hub Advanced Programmable Interrupt Controller (rev 01)
0000:03:05.0 RAID bus controller: Adaptec (formerly DPT) SmartRAID V Controller (rev 01)
0000:04:03.0 VGA compatible controller: Number 9 Computer Company Imagine 128-II
0000:04:04.0 Ethernet controller: Intel Corporation 82557/8/9 [Ethernet Pro 100] (rev 08)


LSPCI -n:
0000:00:00.0 0600: 8086:2531 (rev 04)
0000:00:01.0 0604: 8086:2532 (rev 04)
0000:00:02.0 0604: 8086:2533 (rev 04)
0000:00:1e.0 0604: 8086:244e (rev 04)
0000:00:1f.0 0601: 8086:2440 (rev 04)
0000:00:1f.1 0101: 8086:244b (rev 04)
0000:00:1f.2 0c03: 8086:2442 (rev 04)
0000:00:1f.3 0c05: 8086:2443 (rev 04)
0000:00:1f.4 0c03: 8086:2444 (rev 04)
0000:00:1f.5 0401: 8086:2445 (rev 04)
0000:02:1f.0 0604: 8086:1360 (rev 03)
0000:03:00.0 0800: 8086:1161 (rev 01)
0000:03:05.0 0104: 1044:a511 (rev 01)
0000:04:03.0 0300: 105d:2339
0000:04:04.0 0200: 8086:1229 (rev 08)


Base System Installation Checklist:
[O] = OK, [E] = Error (please elaborate below), [ ] = didn't try it

Initial boot worked:    [O]
Configure network HW:   [O]
Config network:         [O]
Detect CD:              [O]
Load installer modules: [O]
Detect hard drives:     [O]
Partition hard drives:  [O]
Create file systems:    [O]
Mount partitions:       [O]
Install base system:    [O]
Install boot loader:    [O]
Reboot:                 [E]

Comments/Problems:
I tried installing this system many times trying different things.  However, the only way the initial reboot wouldn't fail was when I used the 2.4 kernel.  I then attempted to install the 2.6 kernel with the same results.

I joined debian-boot on IRC and trave11er assisted with identifying the cause, bug# 328534.  I installed the above mentioned kernel: linux-image-2.6.13 from experimental to try the fixed version of the kernel.  The error message changed but failed at the same place, which seems to be the second driver loading like the bug report discusses.

Here is the output of the console I was able to obtain:


	shpchp: loaded successfully
	shpchp: already loaded
hw_random hardware driver 1.0.0 loaded
	hw_randow: loaded successfully
	shpchp: already loaded
	generic: loaded successfully
	piix: already loaded
	uhci-hcd: already loaded
	i2c-i801: loaded successfully
	uhci-hcd: already loaded
	i810_audio: already loaded
	snd-intel8x0: loaded successfully
	shpchp: already loaded
I2O subsystem v1.288
i2o:	max drivers = 8
i2o: Checking for PCI I2O controllers . . .
ACPI: PCI Interrupt 0000:03:05.0[A] -> GSI 18 (level, low) -> IRQ 169
iop0: controller found (0000:03:05.0)
PCI: Unable to reserve mem region #!:100000@f1000000 for device 0000:03:05.0
iop0: device already claimed
iop0: DMA / IO allocation for I2O controller failed
ACPI: PCI interrupt for device 0000:03:05.0 disabled
	i20_core: loaded successfully
dpti0: Trying to Abort cmd=1660

This is where it hung till I rebooted the system.

Summary:

I even attempted installing fedora core 4 on this same hardware, but I couldn't see the disks, guess the right module to load, and not overly interested in fedora on my computer.  I tried it more to make sure I wasn't doing something wrong.  So when Fedora had problems (albeit for other reasons) I was content to continue to poke at it with debian.  The system is functionial with the 2.4 kernel, but I do not want to let this go unresolved long term.  And I do not want to start using it until I find a resolution for the long term.
 
So, from that stand-point, I was impressed with the daily build of d-i, it handled the hardware raid very well compariong this same issue to fedora.  Everything about the install seemed to work fine save that initial reboot.  So, the d-i 2.6 kernel can handle what the stock 2.6.12-10 kernel could not.  (that was the most recent version of 2.6 I had tried on my system prior to the 2.6.13 kernel from experiemental)

I am not an expert in this regard, but it almost appears as if the system or kernel disabled the controller with ACPI because of the intereaction of the second module.

I am available to trouble shoot this as necessary.  I apologize for not being able to get a complete console dump, but I have been unable to get serial to work with the console=/dev/ttyS0,9600n8 option.  I believe this is due to the BIOS console redirection stuff this motherboard has. 


-- System Information:
Debian Release: testing/unstable
  APT prefers testing
  APT policy: (500, 'testing')
Architecture: i386 (i686)
Shell:  /bin/sh linked to /bin/bash
Kernel: Linux 2.4.27-2-686-smp
Locale: LANG=en_US, LC_CTYPE=en_US (charmap=ISO-8859-1)

Reply to: