Hardware/Software RAID (nearly a religious war...)

To: martin f krafft <madduck@debian.org>, debian-isp@lists.debian.org
Subject: Hardware/Software RAID (nearly a religious war...)
From: Michael Loftis <mloftis@modwest.com>
Date: Thu, 30 Aug 2007 01:47:52 -0600
Message-id: <[🔎] 3AE87B48139CC842D9A49DE2@dhcp-2-206.wgops.com>
In-reply-to: <[🔎] 20070830055652.GA28932@piper.oerlikon.madduck.net>
References: <[🔎] E2A15BF150C7AB4F80CCDAA6@dhcp-2-206.wgops.com> <[🔎] 200708291454.31723.mgb-debian@yosemite.net> <[🔎] 8C5B136FF6B0449BED449590@dhcp-2-206.wgops.com> <[🔎] 20070829102614.GL1409@freenet.de> <[🔎] 200708291303.54360.mgb-debian@yosemite.net> <[🔎] E2A15BF150C7AB4F80CCDAA6@dhcp-2-206.wgops.com> <[🔎] 200708291454.31723.mgb-debian@yosemite.net> <[🔎] 20070829102614.GL1409@freenet.de> <[🔎] 200708291303.54360.mgb-debian@yosemite.net> <[🔎] E2A15BF150C7AB4F80CCDAA6@dhcp-2-206.wgops.com> <[🔎] 20070830055652.GA28932@piper.oerlikon.madduck.net>

I apologize to the list as I didn't mean to hijack the thread.

--On August 30, 2007 7:56:52 AM +0200 martin f krafft <madduck@debian.org>wrote:

MDRAID is also very difficult to administer, offering only
(depending on your version) mdadm or raid* tools.  mdadm is rather
arcane.  simple operations are not well documented, like, how do
i replace a failed drive?  or start a rebuild?


Have you actually bothered to look into /usr/share/doc/mdadm?

You do have me there, not since 3.1 when it contained just a few sparsenotes. I had very incorrectly assumed that hadn't changed as much as itclearly has. Typically I make sure to double check something before I sayanything about it and this is one of those times where I didn't.

there's no 'rebuild drive' it's completely NON automated either.
meaning it always takes user intervention to recover from any
failure.


Not if you're using spares. But even then, yes, to pull a disk out
and insert a new one, you need to shut down the machine, unless you
have hotplugging drives. Same story for hardware RAID.

Ok, given. Spare replacement is fully automated as one would expect. Ithink hotplug (atleast on the drives part) is more or less mandatory inSATA, unless they don't support the SATA power connector.

a single I/O error causes MDRAID to mark the element as failed.
it does not even bother to retry.


And that's a feature. I've seen disk corruption where a block would
return wrong data only in 1/10 reads. On retry, it would work, and
the RAID would hide the problem from me. I'd much rather have
a failed drive

Would a patch be accepted that allowed a controllable retry? The placewhere this has caused the most pain is on IDE/SATA drives (and PATA) thatwill occasionally fault a sector, then read it correctly the next tryaround. Granted this condition needs to be reported, and the MD drivershouldn't persist on 'banging its head'. Some form of automatic recoverywould be nice like in the case of mirrors read (fault) read (partner) write(to faulted area) sort of logic? I know it gets really complicated,especially trying to avoid deadlocks.

MDRAID is also incapable of performing background patrolling
reads, something i think even 3Ware does.


Wrong. It does this only once a month by default (on Debian; the
mdadm sunday), but you could make it do that every hour.

Background patrolling reads are executed at low priority during I/O lowtimes, often constantly starting a new one when an old one finishes. EMCimplements it a bit differently though...doing patrolling reads onpartitions only when they detect an error by default. However the arrayscan be configured to do patrols more often. ICP GDT controllers dosimilarly. I *think* a firmware update for ICP ICP model controllers addsit as well. Some of the older ICP GDTs didn't support patrolling reads.The operation is obviously the same as a consistency check, just the systemdoes it more often. It takes the large hardware arrays onsite about a weekto finish a run. It's saved us from undetected failures a few times.

I was/am very glad to see the periodic consistency checks though. Thatalleviated one of our bigger complaints which was lack of patrolling, wehaven't had much experience with how it behaves quite yet though. Ipresume it uses the same/similar throttling mechanism that rebuilds haveused for a while now. That works really well, most of the machines don'tnotice rebuild I/O traffic at all. The few that I have seen it be an issueon have controllers with DMA disabled by default due to issues with thecontroller hardware, not MD's problem at all there.

MDRAID RAID5 sets are non-bootable.


grub2 can boot them.


Is anyone shipping it yet though?  Etch still has 0.97.

In many years of software RAID management and in two years as mdadm
maintainer, I have never heard of a single case where md failed to
correctly identify a failed drive.

I should have been clearer....my problem is that MD gets over-zealous aboutmarking drives as failed. Which, as you've noted (and I knew, but stilldon't necessarily agree with, and clearly did a poor job of acknowledging),is intentional.


<...>

I in no way meant to disrespect you or your work. I only meant to makeaware some of the possible issues one can experience with software basedRAID. MD has, and continues to, improve. You have your camp, I have mine.We disagree on what's best. In some situations software RAID is best, insome hardware RAID is best. It's up to each op to determine 'best' forthemselves, in the end I feel that's what Free and Open Source Software andOpen Source OSes are about.

Reply to:

Follow-Ups:
- Re: Hardware/Software RAID (nearly a religious war...)
  - From: martin f krafft <madduck@debian.org>

References:
- Re: [OT] 19"/2U Cases
  - From: Michael Loftis <mloftis@modwest.com>
- Re: [OT] 19"/2U Cases
  - From: Mike Bird <mgb-debian@yosemite.net>
- Re: [OT] 19"/2U Cases
  - From: Michael Loftis <mloftis@modwest.com>
- [OT] 19"/2U Cases
  - From: Michelle Konzack <linux4michelle@freenet.de>
- Re: [OT] 19"/2U Cases
  - From: Mike Bird <mgb-debian@yosemite.net>
- Re: [OT] 19"/2U Cases
  - From: martin f krafft <madduck@debian.org>

Prev by Date: ANDA BUTUH UANG TUNAI CEPAT, MUDAH & BEBAS BIAYA PROVISI 3%...???
Next by Date: Re: Hardware/Software RAID (nearly a religious war...)
Previous by thread: Re: Software RAID (was [OT] 19"/2U Cases)
Next by thread: Re: Hardware/Software RAID (nearly a religious war...)
Index(es):
- Date
- Thread