[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Re: filesystem corrupted like I've never seen



On Monday 13 December 2004 10:50, Alexandru Cabuz wrote:
> Hello,
>
> My /home and /var filesystems just got corrupted real bad and I have
> no idea why.
>
> I leave my (Latitude D600) laptop on overnight at the office normally.
> Except this monday morning I came into the office and tried to read my
> mail, and kmail promptly crashed saying the Mail folder doesn't have
> the right permissions or something.
>
> I closed everything down, rebooted, and at reboot I get tons of ide
> error messages
> dmesg gives this
>
> hda: dma_intr: error=0x40 { UncorrectableError }, LBAsect=10319090,
> high=0, low=10319090, sector=10319017
> ide: failed opcode was: unknown
> end_request: I/O error, dev hda, sector 10319017
> Buffer I/O error on device hda6, logical block 1306
> hda: dma_intr: status=0x51 { DriveReady SeekComplete Error }
> hda: dma_intr: error=0x40 { UncorrectableError }, LBAsect=10319090,
> high=0, low=10319090, sector=10319021
> ide: failed opcode was: unknown
> end_request: I/O error, dev hda, sector 10319021
> Buffer I/O error on device hda6, logical block 1307
> hda: dma_intr: status=0x51 { DriveReady SeekComplete Error }
> hda: dma_intr: error=0x40 { UncorrectableError }, LBAsect=10319090,
> high=0, low=10319090, sector=10319025
> ide: failed opcode was: unknown
> end_request: I/O error, dev hda, sector 10319025
> Buffer I/O error on device hda6, logical block 1308
> hda: dma_intr: status=0x51 { DriveReady SeekComplete Error }
> hda: dma_intr: error=0x40 { UncorrectableError }, LBAsect=10319090,
> high=0, low=10319090, sector=10319029
> ide: failed opcode was: unknown
> end_request: I/O error, dev hda, sector 10319029
> Buffer I/O error on device hda6, logical block 1309
> hda: dma_intr: status=0x51 { DriveReady SeekComplete Error }
> hda: dma_intr: error=0x40 { UncorrectableError }, LBAsect=10319090,
> high=0, low=10319090, sector=10319033
> ide: failed opcode was: unknown
> end_request: I/O error, dev hda, sector 10319033
> Buffer I/O error on device hda6, logical block 1310
> hda: dma_intr: status=0x51 { DriveReady SeekComplete Error }
> hda: dma_intr: error=0x40 { UncorrectableError }, LBAsect=10319090,
> high=0, low=10319090, sector=10319037
> ide: failed opcode was: unknown
> end_request: I/O error, dev hda, sector 10319037
> Buffer I/O error on device hda6, logical block 1311
> hda: dma_intr: status=0x51 { DriveReady SeekComplete Error }
> hda: dma_intr: error=0x40 { UncorrectableError }, LBAsect=10319090,
> high=0, low=10319090, sector=10319041
> ide: failed opcode was: unknown
> end_request: I/O error, dev hda, sector 10319041
> Buffer I/O error on device hda6, logical block 1312
> hda: dma_intr: status=0x51 { DriveReady SeekComplete Error }
> hda: dma_intr: error=0x40 { UncorrectableError }, LBAsect=10319090,
> high=0, low=10319090, sector=10319045
> ide: failed opcode was: unknown
> end_request: I/O error, dev hda, sector 10319045
> Buffer I/O error on device hda6, logical block 1313
> hda: dma_intr: status=0x51 { DriveReady SeekComplete Error }
> hda: dma_intr: error=0x40 { UncorrectableError }, LBAsect=10319090,
> high=0, low=10319090, sector=10319049
> ide: failed opcode was: unknown
> end_request: I/O error, dev hda, sector 10319049
> Buffer I/O error on device hda6, logical block 1314
> hda: dma_intr: status=0x51 { DriveReady SeekComplete Error }
> hda: dma_intr: error=0x40 { UncorrectableError }, LBAsect=10319090,
> high=0, low=10319090, sector=10319053
> ide: failed opcode was: unknown
> end_request: I/O error, dev hda, sector 10319053
> Buffer I/O error on device hda6, logical block 1315
> hda: dma_intr: status=0x51 { DriveReady SeekComplete Error }
> hda: dma_intr: error=0x40 { UncorrectableError }, LBAsect=10319090,
> high=0, low=10319090, sector=10319057
> ide: failed opcode was: unknown
> end_request: I/O error, dev hda, sector 10319057
> Buffer I/O error on device hda6, logical block 1316
> hda: dma_intr: status=0x51 { DriveReady SeekComplete Error }
> hda: dma_intr: error=0x40 { UncorrectableError }, LBAsect=10319090,
> high=0, low=10319090, sector=10319061
> ide: failed opcode was: unknown
> end_request: I/O error, dev hda, sector 10319061
> Buffer I/O error on device hda6, logical block 1317
> hda: dma_intr: status=0x51 { DriveReady SeekComplete Error }
> hda: dma_intr: error=0x40 { UncorrectableError }, LBAsect=10319090,
> high=0, low=10319090, sector=10319065
> ide: failed opcode was: unknown
> end_request: I/O error, dev hda, sector 10319065
> Buffer I/O error on device hda6, logical block 1318
> hda: dma_intr: status=0x51 { DriveReady SeekComplete Error }
> hda: dma_intr: error=0x40 { UncorrectableError }, LBAsect=10319090,
> high=0, low=10319090, sector=10319069
> ide: failed opcode was: unknown
> end_request: I/O error, dev hda, sector 10319069
> Buffer I/O error on device hda6, logical block 1319
> hda: dma_intr: status=0x51 { DriveReady SeekComplete Error }
> hda: dma_intr: error=0x40 { UncorrectableError }, LBAsect=10319090,
> high=0, low=10319090, sector=10319073
> ide: failed opcode was: unknown
> end_request: I/O error, dev hda, sector 10319073
> Buffer I/O error on device hda6, logical block 1320
> hda: dma_intr: status=0x51 { DriveReady SeekComplete Error }
> hda: dma_intr: error=0x40 { UncorrectableError }, LBAsect=10319090,
> high=0, low=10319090, sector=10319077
> ide: failed opcode was: unknown
> end_request: I/O error, dev hda, sector 10319077
> Buffer I/O error on device hda6, logical block 1321
> hda: dma_intr: status=0x51 { DriveReady SeekComplete Error }
> hda: dma_intr: error=0x40 { UncorrectableError }, LBAsect=10319090,
> high=0, low=10319090, sector=10319081
> ide: failed opcode was: unknown
> end_request: I/O error, dev hda, sector 10319081
> Buffer I/O error on device hda6, logical block 1322
> hda: dma_intr: status=0x51 { DriveReady SeekComplete Error }
> hda: dma_intr: error=0x40 { UncorrectableError }, LBAsect=10319090,
> high=0, low=10319090, sector=10319085
> ide: failed opcode was: unknown
> end_request: I/O error, dev hda, sector 10319085
> Buffer I/O error on device hda6, logical block 1323
> hda: dma_intr: status=0x51 { DriveReady SeekComplete Error }
> hda: dma_intr: error=0x40 { UncorrectableError }, LBAsect=10319090,
> high=0, low=10319090, sector=10319089
> ide: failed opcode was: unknown
> end_request: I/O error, dev hda, sector 10319089
> Buffer I/O error on device hda6, logical block 1324
> kjournald starting.  Commit interval 5 seconds
> EXT3 FS on hda9, internal journal
> EXT3-fs: recovery complete.
> EXT3-fs: mounted filesystem with ordered data mode.
> kjournald starting.  Commit interval 5 seconds
> EXT3 FS on hda8, internal journal
> EXT3-fs: mounted filesystem with ordered data mode.
> kjournald starting.  Commit interval 5 seconds
> EXT3 FS on hda5, internal journal
> EXT3-fs: mounted filesystem with ordered data mode.
> kjournald starting.  Commit interval 5 seconds
> EXT3 FS on hda6, internal journal
> ext3_orphan_cleanup: deleting unreferenced inode 658444
> ext3_orphan_cleanup: deleting unreferenced inode 89959
> ext3_orphan_cleanup: deleting unreferenced inode 89949
> EXT3-fs: hda6: 3 orphan inodes deleted
> EXT3-fs: recovery complete.
> EXT3-fs: mounted filesystem with ordered data mode.
> tg3.c:v3.10 (September 14, 2004)
> ACPI: PCI interrupt 0000:02:00.0[A] -> GSI 11 (level, low) -> IRQ 11
> tg3: tg3_request_firmware (eth%d): Couldn't get firmware
> "tg3/tso_5705-1.1.0". tg3: eth%d: Firmware "tg3/tso_5705-1.1.0" not loaded;
> continuing without TSO. eth0: Tigon3 [partno(BCM95705A50) rev 3001
> PHY(5705)]
> (PCI:33MHz:32-bit) 10/100/1000BaseT Ethernet 00:0f:1f:c8:9c:55
> eth0: RXcsums[1] LinkChgREG[1] MIirq[1] ASF[0] Split[0] WireSpeed[1]
> TSOcap[0] USB Universal Host Controller Interface driver v2.2
> ACPI: PCI interrupt 0000:00:1d.0[A] -> GSI 11 (level, low) -> IRQ 11
> uhci_hcd 0000:00:1d.0: Intel Corp. 82801DB/DBL/DBM
> (ICH4/ICH4-L/ICH4-M) USB UHCI Controller #1
> PCI: Setting latency timer of device 0000:00:1d.0 to 64
> uhci_hcd 0000:00:1d.0: irq 11, io base 0000bf80
> uhci_hcd 0000:00:1d.0: new USB bus registered, assigned bus number 2
> hub 2-0:1.0: USB hub found
> hub 2-0:1.0: 2 ports detected
> ACPI: PCI interrupt 0000:00:1d.1[B] -> GSI 11 (level, low) -> IRQ 11
> uhci_hcd 0000:00:1d.1: Intel Corp. 82801DB/DBL/DBM
> (ICH4/ICH4-L/ICH4-M) USB UHCI Controller #2
> PCI: Setting latency timer of device 0000:00:1d.1 to 64
> uhci_hcd 0000:00:1d.1: irq 11, io base 0000bf40
> uhci_hcd 0000:00:1d.1: new USB bus registered, assigned bus number 3
> hub 3-0:1.0: USB hub found
> hub 3-0:1.0: 2 ports detected
> ACPI: PCI interrupt 0000:00:1d.2[C] -> GSI 11 (level, low) -> IRQ 11
> uhci_hcd 0000:00:1d.2: Intel Corp. 82801DB/DBL/DBM
> (ICH4/ICH4-L/ICH4-M) USB UHCI Controller #3
> PCI: Setting latency timer of device 0000:00:1d.2 to 64
> uhci_hcd 0000:00:1d.2: irq 11, io base 0000bf20
> uhci_hcd 0000:00:1d.2: new USB bus registered, assigned bus number 4
> hub 4-0:1.0: USB hub found
> hub 4-0:1.0: 2 ports detected
> usb 3-1: new low speed USB device using address 2
> input: Logitech USB Mouse on usb-0000:00:1d.1-1
> usbcore: registered new driver hiddev
> usbcore: registered new driver usbhid
> drivers/usb/input/hid-core.c: v2.0:USB HID core driver
> mice: PS/2 mouse device common for all mice
> ts: Compaq touchscreen protocol output
> input: PS/2 Generic Mouse on isa0060/serio1
> input: PC Speaker
> Real Time Clock Driver v1.12
> parport: PnPBIOS parport detected.
> parport0: PC-style at 0x378 (0x778), irq 7, dma 1
> [PCSPP,TRISTATE,COMPAT,ECP,DMA]
> parport0: irq 7 in use, resorting to polled operation
> Linux agpgart interface v0.100 (c) Dave Jones
> agpgart: Detected an Intel 855PM Chipset.
> agpgart: Maximum main memory to use for agp memory: 941M
> agpgart: AGP aperture is 128M @ 0xe0000000
> cpci_hotplug: CompactPCI Hot Plug Core version: 0.2
> pci_hotplug: PCI Hot Plug PCI Core version: 0.5
> pciehp: acpi_pciehprm:\_SB_.PCI0 evaluate _BBN fail=0x5
> pciehp: acpi_pciehprm:get_device PCI ROOT HID fail=0x5
> shpchp: acpi_shpchprm:\_SB_.PCI0 evaluate _BBN fail=0x5
> shpchp: acpi_shpchprm:get_device PCI ROOT HID fail=0x5
> hw_random: cannot enable RNG, aborting
> pciehp: acpi_pciehprm:\_SB_.PCI0 evaluate _BBN fail=0x5
> pciehp: acpi_pciehprm:get_device PCI ROOT HID fail=0x5
> shpchp: acpi_shpchprm:\_SB_.PCI0 evaluate _BBN fail=0x5
> shpchp: acpi_shpchprm:get_device PCI ROOT HID fail=0x5
> ACPI: PCI interrupt 0000:00:1f.6[B] -> GSI 7 (level, low) -> IRQ 7
> PCI: Setting latency timer of device 0000:00:1f.6 to 64
> MC'97 1 converters and GPIO not ready (0xff00)
> Linux Kernel Card Services
>   options:  [pci] [cardbus] [pm]
> PCI: Enabling device 0000:02:01.0 (0000 -> 0002)
> ACPI: PCI interrupt 0000:02:01.0[A] -> GSI 11 (level, low) -> IRQ 11
> Yenta: CardBus bridge found at 0000:02:01.0 [1028:011d]
> Yenta: ISA IRQ mask 0x0438, PCI irq 11
> Socket status: 30000006
> PCI: Enabling device 0000:02:01.1 (0000 -> 0002)
> ACPI: PCI interrupt 0000:02:01.1[A] -> GSI 11 (level, low) -> IRQ 11
> Yenta: CardBus bridge found at 0000:02:01.1 [1028:011d]
> Yenta: ISA IRQ mask 0x0438, PCI irq 11
> Socket status: 30000410
> CSLIP: code copyright 1989 Regents of the University of California
> PPP generic driver version 2.4.2
> NET: Registered protocol family 10
> Disabled Privacy Extensions on device c034e040(lo)
> IPv6 over IPv4 tunneling driver
> NET: Registered protocol family 17
> ACPI: Battery Slot [BAT0] (battery present)
> ACPI: Battery Slot [BAT1] (battery absent)
> ACPI: AC Adapter [AC] (on-line)
> ip_tables: (C) 2000-2002 Netfilter core team
> tg3: eth0: Link is up at 10 Mbps, half duplex.
> tg3: eth0: Flow control is off for TX and off for RX.
> ip_conntrack version 2.1 (8189 buckets, 65512 max) - 332 bytes per
> conntrack PPP BSD Compression module registered
> PPP Deflate Compression module registered
> cs: IO port probe 0x0100-0x04ff: excluding 0x4d0-0x4d7
> cs: IO port probe 0x0800-0x08ff: clean.
> cs: IO port probe 0x0c00-0x0cff: clean.
> cs: IO port probe 0x0a00-0x0aff: clean.
> cs: memory probe 0xa0000000-0xa0ffffff: clean.
> mtrr: no more MTRRs available
> mtrr: no more MTRRs available
> mtrr: no more MTRRs available
> eth0: no IPv6 routers present
> codec_semaphore: semaphore is not ready [0x1][0x700300]
> codec_write 1: semaphore is not ready for register 0x54
> EXT3-fs error (device hda9): ext3_free_blocks: bit already cleared for
> block 282624
> Aborting journal on device hda9.
> EXT3-fs error (device hda9): ext3_free_blocks: bit already cleared for
> block 282625
> EXT3-fs error (device hda9): ext3_free_blocks: bit already cleared for
> block 282626
> EXT3-fs error (device hda9): ext3_free_blocks: bit already cleared for
> block 282627
> EXT3-fs error (device hda9): ext3_free_blocks: bit already cleared for
> block 282628
> ext3_reserve_inode_write: aborting transaction: Journal has aborted in
> __ext3_journal_get_write_access<2>EXT3-fs error (device hda9) in
> ext3_reserve_inode_write: Journal has aborted
> EXT3-fs error (device hda9) in ext3_truncate: Journal has aborted
> ext3_reserve_inode_write: aborting transaction: Journal has aborted in
> __ext3_journal_get_write_access<2>EXT3-fs error (device hda9) in
> ext3_reserve_inode_write: Journal has aborted
> EXT3-fs error (device hda9) in ext3_orphan_del: Journal has aborted
> ext3_reserve_inode_write: aborting transaction: Journal has aborted in
> __ext3_journal_get_write_access<2>EXT3-fs error (device hda9) in
> ext3_reserve_inode_write: Journal has aborted
> EXT3-fs error (device hda9) in ext3_delete_inode: Journal has aborted
> ext3_abort called.
> EXT3-fs error (device hda9): ext3_journal_start: Detected aborted journal
> Remounting filesystem read-only
> codec_semaphore: semaphore is not ready [0x1][0x700300]
> codec_write 1: semaphore is not ready for register 0x54
>
> df gives this
>
> Sys. de fich.        1M-blocs       Occupé Disponible Capacité Monté sur
> /dev/hda1                  250       183        55  78% /
> tmpfs                      506         0       506   0% /dev/shm
> /dev/hda9                28939      3515     23954  13% /home
> /dev/hda8                  361         9       334   3% /tmp
> /dev/hda5                 4695      3096      1361  70% /usr
> /dev/hda6                 1787       753       939  45% /var
>
>
>
>
>
>
> I have never seen this and I have no idea what could have happened,
> cos I just left my computer on, over the weekend, and didn't touch it
> or even log into it remotely or anything.
>
> The only thing I can think of that might have triggered something like
> this is software suspend 2, which I installed a couple of weeks ago,
> but it seemed to be working alright...
>
> Anyway, does anybody know what's going on and if there is any other
> way to save the day without formatting my /home and /var partitions
> (fortunately I have those filesystems each on their own logical
> partition, so I can safely format them, without affecting any of the
> rest of my system... except I gotta backup all my files, email folder,
> etc, a pain in the butt).
>
>
> Thanks for any pointers. Or perhaps there is another list I should
> post this to...?
> Alex.
Looks much more like a hardware failure to me than a filesystem 
failure.  Do you have SMART installed, if so look for its log entries
and see if it has been predicting a failure.

David



Reply to: