Re: Homebuilt NAS Advice

To: debian-user@lists.debian.org
Subject: Re: Homebuilt NAS Advice
From: David Christensen <dpchrist@holgerdanske.com>
Date: Sun, 9 Aug 2020 05:33:00 -0700
Message-id: <[🔎] 12f31f82-a60f-912b-47b8-98e20959328a@holgerdanske.com>
In-reply-to: <[🔎] 22a0d01a-edde-2370-a6e6-e3f1a7cbd351@att.net>
References: <20200729124003.18a1ea47@debian9> <ab556cd5-d15c-f5b3-1f3a-2623a9cc14bf@att.net> <[🔎] 60a6835f-9f8e-9539-416b-c597be51a652@holgerdanske.com> <[🔎] 22a0d01a-edde-2370-a6e6-e3f1a7cbd351@att.net>

On 8/7/20 6:24 PM, Leslie Rhorer wrote:

On 8/7/2020 6:23 PM, David Christensen wrote:

??Filesystem?????????? Size?? Used Avail Use% Mounted on

Your editor seems to replace multiple spaces with two question marks foreach leading space (?). Please disable the feature if you can.

The NAS array

Now it is 8 x 8 + 8

The backup system array is 8 @ 8 TB data drives and 1 @ 8 TB hot spare

I assume you filled your 16 drive rack with 8 TB drives (?). Is there areason why you did not use a smaller number of larger drives, partiallyfill the rack, and leave open bays for future expansion and/oradditional servers?

I don't feel a need for LVM on the data arrays. I use theentire, unpartitioned drive for /RAID.

I was leading in to LVM's ability to add capacity, but you seem to havesolved this with mdadm (see below).

Are you concerned [about bit rot]?
Yes. I have routines that compare the data on the main array andthe backup array via checksum. When needed, the backups supply a thirdvote. The odds of two bits flipping at the very same spot areastronomically low. There has been some bit rot, but so far it has beenmanageable.

I had similar experiences and used similar methods in the past. BSD'smtree(8) is built for this purpose, but lacks a cache. The Debianversion is behind FreeBSD (even when built from Sid source) and lackskey features. I resorted to writing a Perl script with caching. ZFSand replication made all of that unnecessary.

To add a drive:

`mdadm /dev/md0 --add /dev/sdX`
`mdadm -v  /dev/md0 --grow --raid-devices=Y`
Note if an internal bitmap is set, it must be removed prior togrowing the array. It can be added back once the grow operation iscomplete.
To increase the drive size, replace any smaller drives with largerdrives one at a time:
`mdadm /dev/md0 --add /dev/sdX`
`mdadm /dev/md0 --fail /dev/sdY`
Once all the drives are larger than the current device size used bythe array:
`mdadm /dev/md0 --grow`


Nice.  :-)

Have you considered putting the additional files on another serverthat is not backed up, only archived?
They should no longer be needed. Once I confirm that (in a fewminutes from now, actually), they will be deleted. If any of the filesin question turn out to be necessary, I will do that very thing.

If DAR maintains a catalog of archive media and the files they contain,this would facilitate a data retention policy of "some files only existon archive media".

22E+12 bytes in 2.8 days is ~90 MB/s.?? That is a fraction of 4 Gbpsand an even smaller fraction of 10 Gbps.?? Have you identified thebottleneck?

That should have been about 15 hours or so. The transferrate for a large file is close to 4Gbps, which is about the best I wouldexpect from this hardware. It's good enough.


22E+12 bytes in 15 hours is ~408 MB/s.  That makes more sense.

Are you using hot-swap for the archive drives???

Yes on the hot swap. I just use a little eSATA docking stationattached to an eSATA port on the motherboard. 'Definitely a poor man'ssolution.

My 2011 desktop motherboard with dual eSATA ports (150 Mbps?) gives verysatisfactory performance.

If you have two HDD hot-swap bays, can DAR leap-frog destination media?
I believe it can, yes. A script to handle that should be prettysimple. I have never done so.

Thescript I use right now pauses and waits for the user to replace thedrive and press <Enter>. It would be trivial to have the scriptcontinue with a different device ID instead of pausing. Iteratingthrough a list of IDs is hardly any more difficult.
     Hmm.  You have given me an idea.  Thanks!


YW. :-)  Let us know if you can reduce the time to create an archive set.

If you have many HDD hot-swap bays, can DAR write in parallel??? Withleap-frog?
No, I don't think so, at least not in general. I suppose one couldcreate a front-end process which divides up the source and passes theindividual chunks to multiple DAR processes. A Python script should beable to handle it pretty well.

I have pondered writing a script to read a directory and create a set ofhard link trees, each tree of size N bytes or less; filtered, sorted,and grouped by configurable parameters. If anyone knows of a such autility, please reply.

In my experience, HDD's that are stored for long periods have the badhabit of failing within hours of being put back into service.?? Doesthis concern you?
No, not really. If a target drive fails during a backup, I canjust isolate the existing portion and then start a new backup on theisolate. A failed drive during a restore could be a bitch, but that'spretty unlikely. Something like dd_rescue could be a great help.

As I understand ddrescue, it is designed for multiple copies of somecontent (e.g. a file or a raw device) that were originally identical,each copy was damaged in a different area, and none of the damaged areasoverlap. ddrescue can then scan all the copies, identify the undamagedareas, and assemble a correct version.

As I understand DAR, it uses a specialized binary format withcompression, hashing, encryption, etc.. If you burn one archive mediaset using DAR, retain a few previous archive media sets, later need todo a restore, and one drive from the most recent archive media set isbad, I am uncertain if ddrescue will be of any help.

What is your data destruction policy?
You mean for live data? I don't have one. Do you mean for thebackups? There is no formal one.


Likewise.  It's a conundrum.

For an enterprise system, ZFS is the top contender, in mybook. These are for my own use, and my business is small, however. IfI ever get to the point where I have more than 10 employees, I will nodoubt switch to ZFS.
Let me put it this way: if a business has the need for a separateIT manager, his filesystem of choice for the file server(s) is prettymuch without question ZFS. For a small business or for personal use thelearning curve may be a bit more than the non-IT user might want to tackle.
Or not. I certainly would not discourage anyone who wants to takeon the challenge.

Migrating my SOHO servers from Linux, md, LVM, ext4, and btrfs toFreeBSD and ZFS has been a non-trivial undertaking. I've learn a lotand I think my data is better protected, but I still have more work todo for disaster preparedness. You have an order of magnitude more data,backups, and archives than I do. If and when you decide to try ZFS, Isuggest that you break off a piece and work with that.



David

Reply to:

References:
- Re: Homebuilt NAS Advice
  - From: David Christensen <dpchrist@holgerdanske.com>
- Re: Homebuilt NAS Advice
  - From: Leslie Rhorer <lesrhorer@att.net>

Prev by Date: Re: How to properly import the configuration of the Buster kernel into own development ?
Next by Date: Re: How to properly import the configuration of the Buster kernel into own development ?
Previous by thread: Re: Homebuilt NAS Advice
Next by thread: Re: Are the assigned capacities sufficient for my setup?
Index(es):
- Date
- Thread