Re: thoughts on moving to shared storage for VM hosting

To: Andy Smith <andy@lug.org.uk>
Cc: debian-isp@lists.debian.org
Subject: Re: thoughts on moving to shared storage for VM hosting
From: Maarten Vink <vink@interstroom.nl>
Date: Mon, 30 Jun 2008 22:23:29 +0200
Message-id: <[🔎] 77B48968-1F28-42CB-A511-D610500B9A98@interstroom.nl>
In-reply-to: <[🔎] 20080626193514.GT3682@bitfolk.com>
References: <[🔎] 20080626193514.GT3682@bitfolk.com>

Hi Andy,



Local disks in a RAID-10 is probably one of the most performant
configurations so I have no expectation of greater performance, but
obviously it needs to not totally suck in that regard.

Your best solution for this would be iSCSI, preferrably over aseparate network.



The immediate question then is how to do that.  Take for example
this disk box:

       http://www.span.com/catalog/product_info.php?products_id=4770

Two of those could be used, each in RAID-10, exported by iSCSI and
then software RAID-1 on the servers would allow for operation even
in the face of the complete death of either disk box.

Sounds like a good plan, but I'd spend a lot of time testing whathappens when one of the machines goes down. Does the software RAIDdetect this, and what happens to your performance when tens or evenhundreds of exported disks start resyncing?


The downside is that 75% of the raw capacity is gone.  Does anyone
have any feel for how much of a performance penalty would be
incurred by configuring each one as say a RAID-50 (two 5-spindle
RAID-5s, striped) in each with 2 hot spares and then software RAID-1
on the servers?

I'd suggest choosing a platform that supports RAID-6 instead ofRAID-5, and use that, optionally with a hot spare. You might even skipthe hot spare, since you can lose up to two active disks in a RAID-6array.If you choose RAID-5 with a 12-disk array, sooner or later Murphy willcatch up with you. The chance of a second drive in your RAID-5 arrayfailing while it's doing a rebuild to the hot spare are larger thanyou might think.


Given 12x500G disks in each box, this would result in
(((12-2)/2)-1)x2x500G = 4T usable for 12T raw.  The
previously-mentioned RAID-10, RAID-1 configuration would result in
(12-2)/2x500G = 2.5T usable for 12T raw.  A straight up 10-disk
RAID-5 on each disk box would give (12-2-1)x500G = 4.5T usable for
12T raw, but 10 spindles seems too big for a RAID-5 to me, plus
RAID-5 write performance sucks and I understand -50 goes some way to
mitigate that.

The disk statistics you're gathering will help a lot in evaluatingyour future storage platform. /proc/diskstats will give you a lot ofinfo on this. If you use cacti for monitoring trends on your servers Ican send you some scripts that will help you graph disk I/O both inmegabytes/second and disk I/O operations.


A crazy idea would be to set both disk boxes up as JBOD and export
all 24 disks out, handling all the redundancy on the servers using
MD.  That really does sound crazy and hard to manage though!

As for the server end, is software RAID of iSCSI exports the right
choice here?  Would I be better off doing multipath?

As I said before, test this well. You might also look at handling thereplication and fail-over on the storage servers instead of theclients; you'll need to build your own storage boxes for that, andinvest a lot of time in testing the failover scenario's, but it'll beeasier to manage.You can run RAID-1 over NBD on your storage servers themselves, or useDRBD to handle the synchronization.

My next concern is iSCSI.  I've not yet played with that in Debian.
How usable is it in Debian Etch, assuming commodity hardware and a
dedicated 1GbE network with jumbo frames?  Would I be better off
building my own Linux-based disk box and going with AoE or NBD?  The
downside is needing to buy something like two of:

       http://www.span.com/catalog/product_info.php?cPath=18_711_2401&products_id=15975

plus two storage servers with SAS to export out AoE or NBD.

You don't have to use external storage; there are lots ofmanufacturers that offer servers with 12 or 16 hot-swappable disks.Supermicro comes to mind: http://www.supermicro.com/products/chassis/3U/836/SC836TQ-R800.cfm

There are a couple of other suppliers, it shouldn't be too hard tofind one in the UK.


Regards,

Maarten

Reply to:

References:
- thoughts on moving to shared storage for VM hosting
  - From: Andy Smith <andy@lug.org.uk>

Prev by Date: Re: Apache2 VHosts and AWStats within a clustered environment?
Previous by thread: thoughts on moving to shared storage for VM hosting
Index(es):
- Date
- Thread