Re: Single root filesystem evilness decreasing in 2010? (on workstations) [LONG]

To: thib <thib@stammed.net>
Cc: debian-user@lists.debian.org
Subject: Re: Single root filesystem evilness decreasing in 2010? (on workstations) [LONG]
From: Robert Brockway <robert@timetraveller.org>
Date: Tue, 9 Mar 2010 13:17:00 -0500 (EST)
Message-id: <[🔎] alpine.DEB.1.10.1003091250110.18309@castor.opentrend.net>
In-reply-to: <[🔎] 4B8FBA15.9070508@stammed.net>
References: <4B89D293.7050007@stammed.net> <[🔎] alpine.DEB.1.10.1003031024010.7759@castor.opentrend.net> <[🔎] 4B8F1B51.5090009@stammed.net> <[🔎] alpine.DEB.1.10.1003032146410.7759@castor.opentrend.net> <[🔎] 4B8FBA15.9070508@stammed.net>

On Thu, 4 Mar 2010, thib wrote:

OTOH - I haven't studied XFS - but from the little overviews I read about
it, I suppose its allocation groups are a way to scale with this problem
(along with other unrelated advantages like parallelism in multithreaded
environments).  What happens if a filesystem doesn't have anything like it?

Filesystems will hit scale problems at some point. As you note AGs in XFShelp it to scale alot but you do need to be careful in selecting thenumber. Too many and you can become CPU bound.

Maybe no-one cares because we currently don't have filesystems big enough to
actually see the problem?


Some people definitely do.

I agree with that, but I know it's because I, personally, *need* to know
what's going on, all the time.  Some people are OK with letting a program
(even such a critical one) do some magic;  and without having tested any
"complex" one, I suspect they try to KIS for the user.

The problem is that if a backup system breaks you get to keep both pieces:) Failing to understand your backup system and now you can DR under theworst case is a serious risk.

The problem is, if there's a problem with the backup system itself, then
it's going to be a long night.  If there's no need for such software, I,
again, agree, there's no use to take risks, even if they're minimal.

Amanda is a good example. I keep 'backup state information at thebeginning of the tapes and allows the information to be dumped to a testfile easily. I have done a 10TB SAN DR with Amanda and used printed outpages of the tape state information to guide me. It was relativelypainless considering the amount of data I was bringing back.

Considering your experience, I have to believe you;  we can always backup
very simply, even very large systems.  It's just weird to picture, all these
complex backup systems would be useless?  (I know, it's not a binary answer,
but you know what I mean.)

I'm not saying they are useless but organisation do need to take more timeconsidering DR I think. Large organisations will have fully operationalDR sites and they can afford to run a database for their backup systemsince they can expect at least one of their sites to be operational at anygiven time.

I have known people who run a copy of the backup DB on a laptop which issupposedly kept offsite. These laptops likely come on site occassionallyand they are a prime candidate for bitrot.


Anything that gets between me and data restoration makes me nervous :)

And for those people who think that off-site/off-line backups aren't neededanymore because you can just replicate data across the network, I'll giveyou 5 minutes to find the floor in that plan :)
I guess I'm perfectly OK with that, but are we still talking about
workstations?  :-)

I'm talking about servers. There is no substitute for offsite/offlinebackups and there never will be. This is one of the few topics were Iwill use absolute statements like this.

You can never predict the nature of the failure. If you try to figure outhow a failure will occur then you will sooner or later run in to a failureof imagination.

The only way to guarantee against a single disaster of a certain size isto physically seperate the data stores by a sufficient distance and keepthe backups offline.

No technology can change this fundamental truth since our understanding ofthe possible failure modes will always be incomplete.

My understanding is that the "cached" column of the output of free(1) is the
sum of all pages, clean and dirty.  The "buffers" column would be the

Right. It might be nice if free did display them seperately. It wouldconfuse people less then :) /proc certain present the info. Checkout thesource of 'free' - it is a really simple application.

Since there's no "cached" column for the swapspace, I guess no clean page
gets pushed there, although it could be useful if that space is on a
significantly faster volume.  Anyway, the "used" column should be the total,
actual swapspace used, so your comment kind of confuses me.  Am I really
wrong here?

I'd recommend doing some reading. The cached system memory and the swapspace disaplayed by free are really unrelated concepts (at least at thelevel we're talking about here).

If you want to chat on IRC about fun subjects like caching and swap spacesometime you can find me as Solver on Freenode & OFTC.


Cheers,

Rob

--
Email: robert@timetraveller.org
IRC: Solver
Web: http://www.practicalsysadmin.com
I tried to change the world but they had a no-return policy

Reply to:

References:
- Re: Single root filesystem evilness decreasing in 2010? (on workstations)
  - From: Robert Brockway <robert@timetraveller.org>
- Re: Single root filesystem evilness decreasing in 2010? (on workstations)
  - From: thib <thib@stammed.net>
- Re: Single root filesystem evilness decreasing in 2010? (on workstations) [LONG]
  - From: Robert Brockway <robert@timetraveller.org>
- Re: Single root filesystem evilness decreasing in 2010? (on workstations) [LONG]
  - From: thib <thib@stammed.net>

Prev by Date: Re: Flash Player Install Confusion
Next by Date: Re: Flash Player Install Confusion
Previous by thread: Re: Single root filesystem evilness decreasing in 2010? (on workstations) [LONG]
Next by thread: Re: Single root filesystem evilness decreasing in 2010? (on workstations)
Index(es):
- Date
- Thread