[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Re: disk failure [CLOSED]



On Wed, Nov 14, 2007 at 04:46:42PM +0000, michael wrote:
> On Wed, 2007-11-14 at 11:41 +0000, michael wrote:
> > On Wed, 2007-11-14 at 12:22 +0100, Jochen Schulz wrote:
> > > michael:
> > > > 'tiger' just told me various home directories are unavailable and upon
> > > > further investigation I see disk errors. Here's the first reports I can
> > > > find regarding said hard drive:
> > > > 
> > > > Nov 13 02:23:01 ratty /USR/SBIN/CRON[19292]: (michael) CMD (rsync -r -v
> > > > -P --links --stats /data_hdb1/michael/ /data_hdd1/michael/)
> > > > Nov 13 02:27:32 ratty kernel: hdd: dma_timer_expiry: dma status == 0x61
> > > > Nov 13 02:27:47 ratty kernel: hdd: DMA timeout error
> > > > Nov 13 02:27:47 ratty kernel: hdd: dma timeout error: status=0x58
> > > > { DriveReady SeekComplete DataRequest }
> > > 
> > > If I were you, I would assume it is dead. Try to copy everything you
> > > still can get off the disk, use your backup for the rest.
> > 
> > When I tried e2fsck it said there was no partition on the HD (sorry just
> > had to turn the machine off, see below)
> > 
> > > If you are curious, you may use smartctl from the package smartmontools
> > > to do further investigation. Your hard disk's manufacturer probably
> > > offers diagnostic tools as well.
> > 
> > SMART only gave very limited info - didn't seem to be able to read disk
> > at all.
> > 
> > > > Does anybody have ideas if this means the HD has actually died or not?
> > > 
> > > It's most probably dead.
> > 
> > 
> > Just discover the air con where the server is situated had conked out.
> > Estimated room temp was 30C (in UK where outside air temp is about 7C
> > today) so I suspect that may be a contributing factor. I'm crossing my
> > fingers that once the room etc has cooled down that the HD will work
> > again...
> > 
> 
> aha, close shave, now room cooler the HD in question works again. smartd
> now installed and running ;)

I'd still backup and replace that drive... The old trick of putting a
HD in the freezer to get another boot out of it only works for so long
and it sounds like that drive is headed down that path.

A

Attachment: signature.asc
Description: Digital signature


Reply to: