[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Re: Backup ideas



Hello,

On Tue, Apr 28, 2020 at 11:43:22AM -0400, Default User wrote:
> Andy, you mentioned restic, which I am not familiar with.  Similar
> considerations would seem to apply to that also.  But I might also try
> that out later.

Note that you did not state any of these requirements for only using
tools you are familiar with, so we cannot really be blamed for
suggesting things just because we think they are good. You didn't
really state many requirements at all, only asked for pointers as to
what makes a good backup system, so only generalities were possible.
I did say there are many many different solutions.

> 4 - So I am still using rsync.  I back up a single system to a local
> external usb drive.  That drive is then backed up to another local
> external usb drive. So I do have some redundancy.  I could go even
> further, backing up to a third drive, and using the --checksum option.
> But that might be excessive, and I do have a life to live.

If you wish to continue using rsync then I would suggest looking at
rsnapshot. It uses rsync. It provides for incremental backups. It
stores things as the actual files which makes them easy to
investigate and restore.

When you come to features like deduplication and encryption either
the remote filesystem has to support it, or else the backup tool has
to support it. That is why so many of these solutions store data in
big blocks in their own format, so they can provide those features.

rsnapshot gives you some limited form of deduplication by hard
linking identical versions of each file together. It is far from
perfect but is better than no deduplication.

> Andy, you suggested having off-site backups.  You are correct that
> local-only backups do not protect against things like fire or theft.
> 
> But off-site backups require a place for them to be, the ongoing
> effort of getting them there (and back, in a timely manner, if
> needed), and some cost involved.  And the storage place is also
> subject to "disasters" , or just going out of business overnight.

The thing about cloud storage is that it is generic and it is easy.
Once you come up with a means to backup to an S3-compatible storage,
should your data disappear you can just point it at another
S3-compatible storage. Amazon's S3 was of course the original but
there are many competitors to Amazon now that use the same protocol.

Anyone using cloud ("someone else's computer") services needs to
think about what they will do if/when the cloud ("someone else's
computer") goes away. The usual answer is "switch everything over to
another cloud".

So the concern when using such a service usually isn't whether they
are going to go out of business, but more often about keeping
control of costs. At some point there may be a time where it's
cheaper to do yourself, or scale up the amount of responsibility you
take. For example, if you require many terabytes of storage then it
will be cheaper for you to rent a whole server from someone instead.
That server will almost certainly have a lower reliability than the
cloud storage vendor, but if it's only for backups then maybe that
is not as important to you as the cost saving.

I would not suggest having "the cloud" as one's ONLY backup of
important data. Multiple different clouds may be safe enough, though
the advantage of having a copy of the data local is large.

> On-line backups have the same problems, in addition to the fact that
> once your data is "out there", even if it is encrypted, it can be
> "cracked" by governments, corporations, or individuals, at their
> leisure.

Any organisation that wants to spend that amount of resources
accessing your data has already been in your home and cloned your
storage while you were out/asleep.

> I'm not sure my threat model really justifies off-site backups at this point.

Fair enough. Fire and theft seem pretty common to me, but it's your
call.

> 5 - You also mentioned encrypted backups.  Of course, you have to to
> trust the encryption methodology.  And it should not be such a hassle
> that you don't actually use it.

It is perfectly feasible to have it be entirely automated. I would
not recommend any kind of backup that's not automated!

> Finally, after well over seven hours into rsyncing (with no end in
> sight) from the first external usb drive to the second one (both are
> HDD), I am beginning to wonder if that is a good idea.  Those first
> full backups always take forever . . .

…and USB is a fairly awful way to use storage beyond personal
gadgets (camera, phone, etc)…

Cheers,
Andy

-- 
https://bitfolk.com/ -- No-nonsense VPS hosting


Reply to: