Re: Bug with soft raid?

To: debian-user@lists.debian.org
Subject: Re: Bug with soft raid?
From: David Christensen <dpchrist@holgerdanske.com>
Date: Tue, 12 Feb 2019 12:48:40 -0800
Message-id: <[🔎] ce6c44fb-13e7-0157-5aa1-e91d7a6c880b@holgerdanske.com>
In-reply-to: <[🔎] LYYCey2--3-1@tuta.io>
References: <[🔎] 20190212110800.5m4puaipkozbmgjc@maison.mrs> <[🔎] LYYCey2--3-1@tuta.io>

On 2/12/19 11:37 AM, Tom Bachreier wrote:



Feb 12, 2019, 12:08 PM by dlist@bluewin.ch:

The system blocks for about 3 minutes and then I get back a hand on it.


I have a similar - maybe the same - problem in buster - see the thread
"Software RAID blocks" on this list about a month ago. Unfortunately
still no solution. :-(

I have the advantage that my system harddisk is outside the RAID on a
separate disk. Therefore I'm still able to send "low level" commands
like smartctl or fdisk to the disks in the array during the block. If
I trigger the right disk the block aborts immediately.

In each of my machines, I use a single 16 GB USB 3.0 flash drive, or asmall SDD, for the system drive. I then use btrfs for all file systems.It is my expectation that if a disk goes bad, the machine will log anerror and/or halt.

Maybe this works for you, too?
You can try:

for i in /dev/sd{b..f}; do echo "DISK: ${i}"; smartctl -l scterc "${i}"; sleep 3; done

Some drives allow you to adjust the Error Recovery Control timeout intheir firmware. You can use this to force the drive to return an errorpromptly, rather than spending minutes trying to recover (e.g. block for3 minutes):


https://en.wikipedia.org/wiki/Error_recovery_control

I had a Linux md RAID0 (mirror) built from two older desktop/ SOHOserver drives that supported scterc. So, I put commands like thefollowing, one per drive, into a script that was run at system startup:


    # /usr/sbin/smartctl -l scterc,70,70 /dev/disk/by-id/ata-XXX_YYY


    SCT Error Recovery Control set to:
               Read:     70 (7.0 seconds)
              Write:     70 (7.0 seconds)


David

Reply to:

Follow-Ups:
- Re: Bug with soft raid?
  - From: David Christensen <dpchrist@holgerdanske.com>

References:
- Bug with soft raid?
  - From: steve <dlist@bluewin.ch>
- Re: Bug with soft raid?
  - From: Tom Bachreier <mr.tom-mldu-20181127@tuta.io>

Prev by Date: Re: (Stuck! Fresh 9.6 install) iwlwifi-8625-26.ucode <- Can not find/what is it? Spot of help please?
Next by Date: Re: Bug with soft raid?
Previous by thread: Re: Bug with soft raid?
Next by thread: Re: Bug with soft raid?
Index(es):
- Date
- Thread