Bug#488945: kmauz@htwg-konstanz.de: SCSI "Recovered Error" - stop ext3 journal - remount read-only
Package: linux-image-2.6.18-4-686
Version: 2.6.18.dfsg.1-12etch2
Dear debian-kernel-team,
During storage maintainance a scsi error occured and stopped the ext3
journal.
The setup:
Debian etch/i386:
Kernel: Linux homesrv 2.6.18-4-686 #1 SMP Wed May 9 23:03:12 UTC 2007
i686 GNU/Linux
lsmod:
nfs 202828 0
nfsd 197936 25
exportfs 5600 1 nfsd
lockd 54344 3 nfs,nfsd
nfs_acl 3584 2 nfs,nfsd
sunrpc 138812 13 nfs,nfsd,lockd,nfs_acl
vmmemctl 10332 0
ipv6 226016 36
quota_v2 8864 6
dm_snapshot 15552 0
dm_mirror 19152 0
dm_mod 50232 2 dm_snapshot,dm_mirror
loop 15048 0
tsdev 7520 0
i2c_piix4 8140 0
parport_pc 32132 0
parport 33256 1 parport_pc
intel_agp 21148 1
floppy 53156 0
shpchp 33024 0
pci_hotplug 28704 1 shpchp
i2c_core 19680 1 i2c_piix4
vmxnet 11712 0
agpgart 29896 1 intel_agp
rtc 12372 0
psmouse 35016 0
pcspkr 3072 0
serio_raw 6660 0
evdev 9088 0
ext3 119240 8
jbd 52456 1 ext3
mbcache 8356 1 ext3
sd_mod 19040 17
ide_cd 36064 0
cdrom 32544 1 ide_cd
mptspi 16136 9
mptscsih 21696 1 mptspi
mptbase 46176 2 mptspi,mptscsih
scsi_transport_spi 22336 1 mptspi
scsi_mod 124168 4
sd_mod,mptspi,mptscsih,scsi_transport_spi
piix 9444 0 [permanent]
generic 5476 0 [permanent]
ide_core 110504 3 ide_cd,piix,generic
thermal 13608 0
processor 28840 1 thermal
fan 4804 0
cat /proc/scsi/scsi
Host: scsi0 Channel: 00 Id: 00 Lun: 00
Vendor: VMware Model: Virtual disk Rev: 1.0
Type: Direct-Access ANSI SCSI revision: 02
Host: scsi0 Channel: 00 Id: 01 Lun: 00
Vendor: VMware Model: Virtual disk Rev: 1.0
Type: Direct-Access ANSI SCSI revision: 02
Host: scsi0 Channel: 00 Id: 02 Lun: 00
Vendor: SUN Model: CSM200_R Rev: 0619
Type: Direct-Access ANSI SCSI revision: 05
Host: scsi0 Channel: 00 Id: 03 Lun: 00
Vendor: SUN Model: CSM200_R Rev: 0619
Type: Direct-Access ANSI SCSI revision: 05
Host: scsi0 Channel: 00 Id: 04 Lun: 00
Vendor: SUN Model: CSM200_R Rev: 0619
Type: Direct-Access ANSI SCSI revision: 05
Host: scsi0 Channel: 00 Id: 05 Lun: 00
Vendor: VMware Model: Virtual disk Rev: 1.0
Type: Direct-Access ANSI SCSI revision: 02
Host: scsi0 Channel: 00 Id: 06 Lun: 00
Vendor: VMware Model: Virtual disk Rev: 1.0
Type: Direct-Access ANSI SCSI revision: 02
Host: scsi0 Channel: 00 Id: 08 Lun: 00
Vendor: VMware Model: Virtual disk Rev: 1.0
Type: Direct-Access ANSI SCSI revision: 02
it is running in a vmware esx 3.0.1 environment with a Sun STK 6140
storage array.
During a disk change the following error is logged by the kernel:
sdc: Current: sense key: Recovered Error
<<vendor>> ASC=0xe0 ASCQ=0x6ASC=0xe0 ASCQ=0x6
EXT3-fs error (device sdc1): ext3_free_blocks_sb: bit already
cleared for block 75569857
Remounting filesystem read-only
I have to reboot and fsck the partition.
Recovered Errors sounds like - there was an error, but the storage
recovered it, by itself - so wy the ext3 failure occured?
The same issue happend on a ubuntu 6.06 lts ( Kernel 2.6.16 ).
All debian sarge with kernel 2.4.27 survived the scsi error.
Is there a setting, to tell the kernel to retry the read or write
operation?
Regards,
Konrad
--
Konrad Mauz
Rechenzentrum
Hochschule Technik, Wirtschaft und Gestaltung
Braunegger-Strasse 55, D 78462 Konstanz
e-mail: kmauz@htwg-konstanz.de
Tel.: +49 7531 206-472
Fax.: +49 7531 206-153
Reply to: