Bug#295657: EXT3 on RAID problems in all 2.6-smp
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1
I'm seeing similar problems. I've now seen it twice. The first one was
with kernel-image-2.6.8-1-686-smp.
Unfortunately, I can't find the exakt error messages from that time,
but if I recall correctly, they were very similar to the ones listed
earlier in this bug, as well as to the ones later in this message.
At that time, I did some googling, and found some indications that the
problem might have to do with apic, so I turned off apic (which I was
getting loads of warning messages about in dmesg), and hoped for the
best.
Unfortunately, yesterday, it happened again, now with
kernel-image-2.6.8-2-686-smp. Relevant error messages from dmesg:
EXT3-fs error (device dm-4): ext3_find_entry: bad entry in directory #7176199: inode out of bounds - offset=192512, inode=4294967167, rec_len=64, name_len=55
Aborting journal on device dm-4.
ext3_abort called.
EXT3-fs abort (device dm-4): ext3_journal_start: Detected aborted journal
Remounting filesystem read-only
The filesystem is a 100G ext3 on top of lvm2 on top of a RAID1 md with
two 160G ATA disks. Running fsck took about 16 hours, and gave us a
situation where about 8G of the 40G data on the disk went into
lost+found. Not so funny.
I'll now try a UP kernel, probably kernel-image-2.6.10-1-686.
I consider this bug rather important. Being able to build cheap large
file servers based on Linux is an important strength of the OS. Having
it crash like this gives bad credibility.
The machine is a Dual PIII/850, 1G ram. Kernel was booted with
'root=/dev/md0 ro noapic' as commandline.
There are lots and lots of files on this file system, hosting mail
files in Maildir format. Some info about the filesystem:
lenin:~# tune2fs -l /dev/main_vg/home
tune2fs 1.35 (28-Feb-2004)
Filesystem volume name: <none>
Last mounted on: <not available>
Filesystem UUID: a3933c7b-b677-4b4b-8e5e-7bb8e865c430
Filesystem magic number: 0xEF53
Filesystem revision #: 1 (dynamic)
Filesystem features: has_journal dir_index filetype sparse_super
Default mount options: (none)
Filesystem state: clean
Errors behavior: Continue
Filesystem OS type: Linux
Inode count: 26214400
Block count: 26214400
Reserved block count: 0
Free blocks: 14808813
Free inodes: 22126089
First block: 0
Block size: 4096
Fragment size: 4096
Blocks per group: 32768
Fragments per group: 32768
Inodes per group: 32768
Inode blocks per group: 1024
Filesystem created: Mon Jan 10 20:50:26 2005
Last mount time: Thu Mar 31 13:10:56 2005
Last write time: Thu Mar 31 13:30:48 2005
Mount count: 1
Maximum mount count: -1
Last checked: Thu Mar 31 12:56:50 2005
Check interval: 0 (<none>)
Reserved blocks uid: 0 (user root)
Reserved blocks gid: 0 (group root)
First inode: 11
Inode size: 128
Journal inode: 8
Default directory hash: tea
Directory Hash Seed: d8ffb37b-3ff6-45aa-9b8c-0901fad4c215
Journal backup: inode blocks
Other data that may be relevant:
# cat /proc/mdstat
Personalities : [raid1]
md0 : active raid1 hde1[0] hdg1[1]
5855552 blocks [2/2] [UU]
md1 : active raid1 hde2[0] hdg2[1]
154223936 blocks [2/2] [UU]
lvdisplay
--- Logical volume ---
LV Name /dev/main_vg/var
VG Name main_vg
LV UUID whtP3W-Rvtr-yIcP-li6v-RcnO-XW1h-nw0K1T
LV Write Access read/write
LV Status available
# open 1
LV Size 4.00 GB
Current LE 128
Segments 1
Allocation inherit
Read ahead sectors 0
Block device 253:0
--- Logical volume ---
LV Name /dev/main_vg/spool
VG Name main_vg
LV UUID kSF1It-3PDb-qioo-IVPe-zveS-TIsk-aiXl4K
LV Write Access read/write
LV Status available
# open 1
LV Size 15.00 GB
Current LE 480
Segments 1
Allocation inherit
Read ahead sectors 0
Block device 253:1
--- Logical volume ---
LV Name /dev/main_vg/mailman
VG Name main_vg
LV UUID s6JMQZ-2Vgt-J2Dh-BRJK-lFar-2zhh-04Syqh
LV Write Access read/write
LV Status available
# open 1
LV Size 15.00 GB
Current LE 480
Segments 1
Allocation inherit
Read ahead sectors 0
Block device 253:2
--- Logical volume ---
LV Name /dev/main_vg/swap
VG Name main_vg
LV UUID Ky7RAN-Hc6G-EggZ-GW1O-srWk-FYpq-knwycb
LV Write Access read/write
LV Status available
# open 1
LV Size 2.00 GB
Current LE 64
Segments 1
Allocation inherit
Read ahead sectors 0
Block device 253:3
--- Logical volume ---
LV Name /dev/main_vg/home
VG Name main_vg
LV UUID dTO3U8-ka25-ELNi-JoBd-fbAp-WlHI-fjSrvJ
LV Write Access read/write
LV Status available
# open 1
LV Size 100.00 GB
Current LE 3200
Segments 1
Allocation inherit
Read ahead sectors 0
Block device 253:4
# vgdisplay
--- Volume group ---
VG Name main_vg
System ID
Format lvm2
Metadata Areas 1
Metadata Sequence No 6
VG Access read/write
VG Status resizable
MAX LV 0
Cur LV 5
Open LV 5
Max PV 0
Cur PV 1
Act PV 1
VG Size 147.06 GB
PE Size 32.00 MB
Total PE 4706
Alloc PE / Size 4352 / 136.00 GB
Free PE / Size 354 / 11.06 GB
VG UUID opWrDX-ZB2X-TJQB-OSCs-0iIS-jSCd-ulIKsu
# pvdisplay
--- Physical volume ---
PV Name /dev/md1
VG Name main_vg
PV Size 147.06 GB / not usable 0
Allocatable yes
PE Size (KByte) 32768
Total PE 4706
Free PE 354
Allocated PE 4352
PV UUID TaKwvI-xjjj-4Oi9-iTWL-H5qs-dGeG-Cg9B7b
- --
Erik Forsberg http://www.lysator.liu.se/~forsberg/
GPG/PGP Key: 1024D/0BAC89D9
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.2.4 (GNU/Linux)
Comment: Processed by Mailcrypt 3.5.8 <http://mailcrypt.sourceforge.net/>
iD8DBQFCTB4GrJurFAusidkRAp55AKCKmz1RWPeGlUnRBcXSNuqaXyb4hQCfZEmT
mxzbavspk526UM6KVdkV7hs=
=Wmuh
-----END PGP SIGNATURE-----
Reply to: