[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#572322: qla2xxx: Loading Module results in DMA error on Calgary PHB



Package: linux-2.6
Severity: important

Currently all Debian Kernels > 2.6.18 (including 2.6.32 from unstable)
are unusable on larger IBM xSeries machines with QLogic cards as
loading the kernel modules results in a crash and reboot of the whole
machine:

Feb 24 10:34:12 dbsrv01 kernel: [   95.516679] qla2xxx 0000:02:01.0:
Cable is unplugged...
Feb 24 10:34:12 dbsrv01 kernel: [   95.516679] scsi(1): fw_state=4 curr
time=ffff208e.
Feb 24 10:34:12 dbsrv01 kernel: [   95.516679] scsi(1): Firmware ready
**** FAILED ****.
Feb 24 10:34:12 dbsrv01 kernel: [   95.516679] qla2x00_restart_isp():
Configure loop done, status = 0x0
Feb 24 10:34:13 dbsrv01 kernel: [   95.516679] qla2xxx 0000:02:01.0: ISP
System Error - mbx1=65h mbx2=2h mbx3=8080h.
Feb 24 10:34:13 dbsrv01 kernel: [   95.516679] qla2xxx 0000:02:01.0:
Firmware dump saved to temp buffer (1/ffffc20007f84000).
Feb 24 10:34:13 dbsrv01 kernel: [   95.516679] qla2x00_abort_isp(1):
exiting.
Feb 24 10:34:13 dbsrv01 kernel: [   95.516679]
qla2x00_mailbox_command(1): finished abort_isp
Feb 24 10:34:13 dbsrv01 kernel: [   95.516679]
qla2x00_mailbox_command(1): finished abort_isp
Feb 24 10:34:13 dbsrv01 kernel: [   95.545239]
qla2x00_mailbox_command(1): **** FAILED. mbx0=69, mbx1=8023, mbx2=ffff,
cmd=69 ****
Feb 24 10:34:13 dbsrv01 kernel: [   95.613508]
qla2x00_get_firmware_state(1): failed=100.
Feb 24 10:34:13 dbsrv01 kernel: [   95.620441] scsi(1): fw_state=8023
curr time=ffff2118.
Feb 24 10:34:13 dbsrv01 kernel: [   95.625500] scsi(1): Firmware ready
**** FAILED ****.
Feb 24 10:34:13 dbsrv01 kernel: [   95.687879] scsi(1):
qla2x00_loop_resync - end
Feb 24 10:34:13 dbsrv01 kernel: [   96.232463] scsi(1): dpc: sched
qla2x00_abort_isp ha = ffff81086b4e85f8
Feb 24 10:34:13 dbsrv01 kernel: [   96.232463] qla2xxx 0000:02:01.0:
Performing ISP error recovery - ha= ffff81086b4e85f8.
Feb 24 10:34:13 dbsrv01 kernel: [   96.236463] Calgary: DMA error on
Calgary PHB 0x2, 0x02010000@CSR 0x00008000@PLSSR


The only kernel which works fine - as far as I know - is the RHEL Kernel
as they ship a patch which disables msi-x [1].
The issue is also known on several other bugtrackers, including
bugzilla.kernel.org ([2], [3]). Unfortunately there doesn't seem to be a
proper fix from upstream, so it would be nice to have the RHEL
workaround in Debian, too, as using Lenny is a PITA thanks to this bug -
currently we're using the Etch Kernel as this is the last one which
works well. According to the bugzilla.kernel.org bug the issue was
introduced with 2.6.20. Machine and the Qlogic cards are both running
the latest BIOS and Firmware versions, so there is nothing to upgrade
there unfortunately.


[1] http://launchpadlibrarian.net/17517188/linux-2.6-scsi-qla2xxx-disable-msi-x-by-default.patch
[2] http://bugzilla.kernel.org/show_bug.cgi?id=11646
[3] https://bugs.launchpad.net/ubuntu/+source/linux/+bug/268242


Best regards,

Bernd

--
 Bernd Zeimetz                            Debian GNU/Linux Developer
 http://bzed.de                                http://www.debian.org
 GPG Fingerprints: 06C8 C9A2 EAAD E37E 5B2C BE93 067A AD04 C93B FF79
                   ECA1 E3F2 8E11 2432 D485 DD95 EB36 171A 6FF9 435F



Reply to: