SCSI problem
В чем может быть проблема: выглядит так, как будто отваливается дисковая
подсистема. Ядро разрешает соединения с открытыми портами, на пинги отвечает,
но никакой реакции больше нет. Сначала грешил на плохую поддержку данного железа
ядром 2.2.19 (с ним машина даже на пинги не отзывалась). Поставил 2.4.16 - на
пинги озывается, бегает ощутимо быстрее, но часов через 15 диски все равно
отвалились.
2.2.19 перед смертью скинуло в логи следующее(это только кусок):
Dec 5 04:21:07 host kernel: SCSI disk error : host 0 channel 0 id 1 lun 0 return code = 28000002
Dec 5 04:21:07 host kernel: [valid=0] Info fld=0x0, Current sd08:05: sense key Not Ready
Dec 5 04:21:07 host kernel: Additional sense indicates Logical unit is in process of becoming ready
Dec 5 04:21:07 host kernel: scsidisk I/O error: dev 08:05, sector 147480
Dec 5 04:21:07 host kernel: SCSI disk error : host 0 channel 0 id 1 lun 0 return code = 28000002
Dec 5 04:21:07 host kernel: [valid=0] Info fld=0x0, Current sd08:05: sense key Not Ready
Dec 5 04:21:07 host kernel: Additional sense indicates Logical unit is in process of becoming ready
Dec 5 04:21:07 host kernel: scsidisk I/O error: dev 08:05, sector 147482
Что сказало 2.4.16 еще не знаю, поскольку кнопка ресет находится за 2000
километров отсюда :) Куда копать посоветуете?
# dmesg
Linux version 2.4.16 (root@host) (gcc version 2.95.4 20011006 (Debian prerelease)) #1 Thu Dec 5 19:11:23 GMT 2001
BIOS-provided physical RAM map:
BIOS-e820: 0000000000000000 - 000000000009f000 (usable)
BIOS-e820: 000000000009f000 - 00000000000a0000 (reserved)
BIOS-e820: 00000000000f0000 - 0000000000100000 (reserved)
BIOS-e820: 0000000000100000 - 000000001fffc000 (usable)
BIOS-e820: 000000001fffc000 - 000000001ffff000 (ACPI data)
BIOS-e820: 000000001ffff000 - 0000000020000000 (ACPI NVS)
BIOS-e820: 00000000ffff0000 - 0000000100000000 (reserved)
On node 0 totalpages: 131068
zone(0): 4096 pages.
zone(1): 126972 pages.
zone(2): 0 pages.
Local APIC disabled by BIOS -- reenabling.
Found and enabled local APIC!
Kernel command line: auto BOOT_IMAGE=Linux ro root=805
Initializing CPU#0
Detected 1007.361 MHz processor.
Console: colour VGA+ 80x25
Calibrating delay loop... 2011.95 BogoMIPS
Memory: 513520k/524272k available (1234k kernel code, 10364k reserved, 465k data, 224k init, 0k highmem)
Dentry-cache hash table entries: 65536 (order: 7, 524288 bytes)
Inode-cache hash table entries: 32768 (order: 6, 262144 bytes)
Mount-cache hash table entries: 8192 (order: 4, 65536 bytes)
Buffer-cache hash table entries: 32768 (order: 5, 131072 bytes)
Page-cache hash table entries: 131072 (order: 7, 524288 bytes)
CPU: Before vendor init, caps: 0383fbff 00000000 00000000, vendor = 0
CPU: L1 I cache: 16K, L1 D cache: 16K
CPU: L2 cache: 256K
Intel machine check architecture supported.
Intel machine check reporting enabled on CPU#0.
CPU: After vendor init, caps: 0383fbff 00000000 00000000 00000000
CPU: After generic, caps: 0383fbff 00000000 00000000 00000000
CPU: Common caps: 0383fbff 00000000 00000000 00000000
CPU: Intel Pentium III (Coppermine) stepping 06
Enabling fast FPU save and restore... done.
Enabling unmasked SIMD FPU exception support... done.
Checking 'hlt' instruction... OK.
POSIX conformance testing by UNIFIX
enabled ExtINT on CPU#0
ESR value before enabling vector: 00000040
ESR value after enabling vector: 00000000
Using local APIC timer interrupts.
calibrating APIC timer ...
..... CPU clock speed is 1007.3765 MHz.
..... host bus clock speed is 134.3168 MHz.
cpu: 0, clocks: 1343168, slice: 671584
CPU0<T0:1343168,T1:671584,D:0,S:671584,C:1343168>
mtrr: v1.40 (20010327) Richard Gooch (rgooch@atnf.csiro.au)
mtrr: detected mtrr type: Intel
PCI: PCI BIOS revision 2.10 entry at 0xf10e0, last bus=1
PCI: Using configuration type 1
PCI: Probing PCI hardware
PCI: Using IRQ router ALI [10b9/1533] at 00:07.0
Linux NET4.0 for Linux 2.4
Based upon Swansea University Computer Society NET3.039
Initializing RT netlink socket
Starting kswapd
VFS: Diskquotas version dquot_6.4.0 initialized
Journalled Block Device driver loaded
devfs: v0.120 (20011103) Richard Gooch (rgooch@atnf.csiro.au)
devfs: boot_options: 0x0
Detected PS/2 Mouse Port.
keyboard: Timeout - AT keyboard not present?(ed)
keyboard: Timeout - AT keyboard not present?(f4)
pty: 256 Unix98 ptys configured
Serial driver version 5.05c (2001-07-08) with MANY_PORTS SHARE_IRQ SERIAL_PCI enabled
ttyS00 at 0x03f8 (irq = 4) is a 16550A
ttyS01 at 0x02f8 (irq = 3) is a 16550A
block: 128 slots per queue, batch=32
Uniform Multi-Platform E-IDE driver Revision: 6.31
ide: Assuming 33MHz system bus speed for PIO modes; override with idebus=xx
ALI15X3: IDE controller on PCI bus 00 dev 20
PCI: No IRQ known for interrupt pin A of device 00:04.0. Please try using pci=biosirq.
ALI15X3: chipset revision 196
ALI15X3: not 100% native mode: will probe irqs later
ide0: BM-DMA at 0xb400-0xb407, BIOS settings: hda:pio, hdb:DMA
ide1: BM-DMA at 0xb408-0xb40f, BIOS settings: hdc:pio, hdd:pio
hdb: 54X CD-ROM, ATAPI CD/DVD-ROM drive
ide: Assuming 33MHz system bus speed for PIO modes; override with idebus=xx
hdc: no response (status = 0xa1), resetting drive
hdc: no response (status = 0xa1)
hdd: no response (status = 0xa1), resetting drive
hdd: no response (status = 0xa1)
ide0 at 0x1f0-0x1f7,0x3f6 on irq 14
Floppy drive(s): fd0 is 1.44M
FDC 0 is a post-1991 82077
SCSI subsystem driver Revision: 1.00
PCI: Found IRQ 11 for device 00:0c.0
PCI: Sharing IRQ 11 with 01:00.0
scsi0 : Initio INI-A100U2W SCSI device driver; Revision: 1.02c
Vendor: QUANTUM Model: ATLAS_V_18_WLS Rev: 0230
Type: Direct-Access ANSI SCSI revision: 03
Vendor: QUANTUM Model: ATLAS_V_18_WLS Rev: 0230
Type: Direct-Access ANSI SCSI revision: 03
Attached scsi disk sda at scsi0, channel 0, id 1, lun 0
Attached scsi disk sdb at scsi0, channel 0, id 2, lun 0
SCSI device sda: 35861388 512-byte hdwr sectors (18361 MB)
Partition check:
/dev/scsi/host0/bus0/target1/lun0: p1 p2 < p5 p6 p7 p8 >
SCSI device sdb: 35861388 512-byte hdwr sectors (18361 MB)
/dev/scsi/host0/bus0/target2/lun0: p1
usb.c: registered new driver usbdevfs
usb.c: registered new driver hub
NET4: Linux TCP/IP 1.0 for NET4.0
IP Protocols: ICMP, UDP, TCP, IGMP
IP: routing cache hash table of 4096 buckets, 32Kbytes
TCP: Hash tables configured (established 32768 bind 32768)
Linux IP multicast router 0.06 plus PIM-SM
NET4: Unix domain sockets 1.0/SMP for Linux NET4.0.
kjournald starting. Commit interval 5 seconds
Reply to: