Re: Single root filesystem evilness decreasing in 2010? (on workstations) [LONG]

To: debian-user@lists.debian.org
Subject: Re: Single root filesystem evilness decreasing in 2010? (on workstations) [LONG]
From: Robert Brockway <robert@timetraveller.org>
Date: Wed, 3 Mar 2010 22:21:48 -0500 (EST)
Message-id: <[🔎] alpine.DEB.1.10.1003032146410.7759@castor.opentrend.net>
In-reply-to: <[🔎] 4B8F1B51.5090009@stammed.net>
References: <4B89D293.7050007@stammed.net> <[🔎] alpine.DEB.1.10.1003031024010.7759@castor.opentrend.net> <[🔎] 4B8F1B51.5090009@stammed.net>

On Thu, 4 Mar 2010, thib wrote:

If restore speed is really that critical, it should still be possible togenerate an image without including the free space - I know virtualizationtechs are doing it just fine for most filesystems.
Maybe we misunderstood each other - saw a different problem.

Possibly. I didn't mean to suggest that dd was a good way to backup. Ithink it is a terrible way to backup[1]. I was talking about dumputilities. I started using dump on Solaris in the mid 90s and really likethe approach to backing up that dump utilities offer. On Linux I use xfsa lot and backup with xfsdump in many cases.

[1] A long time ago I used to use it to backup MS-Windows systems fromLinux but disks grew so much it became infeasable.

I recommend backing up all system binaries. It's the only way you canguarantee you will get back to the same system you had before the rebuild.This is most important for servers were even small behavioural changes canimpact the system in a big way.
So you don't trust Debian stable to be stable?  :-)

Actually I'd say Debian is best-of-breed when it comes to backportingsecurity patches to retain consistent functionality. Having said that,system binaries represents an ever reducing proportion of total data on acomputer system. When I first started with Linux the OS took up about 80%of the available disk space that I had. Today I'd be generous if I saidit took up 2%. So even if there is an alternative, backing them up now ishardly onerous and improves the chances of a successful disaster recovery.I cover this more in the backup talk.

Thanks a lot; that's a talk full of useful checklists. I'll definitely eatyour wiki pages when I have the time.


Great.  I'm gradually adding more and more info to the site.

While this may be a problem now I think it will be less of a problem in thefuture as some filesystems already allow you to add i-nodes dynamically andthis will increasingly be the case.
I'm not sure I follow you, but that sounds cool.  Could you elaborate?

Sure. GPFS (a commercial filesystem available for Linux) allows for theaddition of i-nodes dynamically. We can expect more and more dynamicchanges to filesystems as the science advances.

I once nearly ran out of i-nodes on a 20TB GPFS filesystem on a SAN.Being able to dynamically add i-nodes was a huge relief. I didn't evenneed to unmount the filesystem.

Anyway, my preference isn't based on my own experience so I'm not actuallyusing anything like that, but I'm willing to look at and try fsarchiver andsee if it can really beat simple ad-hoc scripts for my needs. Or somethingheavier, just for fun (Bacula?).

I'm fairly particular about backup systems. I think most people whodesign backup systems have never done a DR in the real world.

I seem to end having to do at least one large scale DR per year. I'vedone two in the last month. I've done several DRs in the multi-TB range.

Virtually every DR I've done has a hardware fault as the underlying cause.In several cases multiple (supposedly independent) systems failedsimultaneously.

The core of any DR plan is the KISS principal. There's a good chance thatthe poor guy doing the DR is doing it at 3am so the instructions need tobe simple to reduce the chance of errors.

If the backup solution requires me to have a working DB just to extractdata or wants me to install an OS and the app before I can get rollingthen I view it with extreme suspicion.

And for those people who think that off-site/off-line backups aren'tneeded anymore because you can just replicate data across the network,I'll give you 5 minutes to find the floor in that plan :)

Ah but they are. Cache pages may be clean or dirty. Your disk cache maybe full of clean cache pages, which is just fine.
Am I interpreting the output of free(1) the wrong way?


Sort of :)

Free is telling you the total memory in disk cache. Any given page in thecache may be 'dirty' or 'clean'. A dirty page has not yet been written todisk. New pages start out dirty. Within about 30 seconds (varies byfilesystem and other factors) the page is written to disk. The page inthe cache is now clean.

Unless your system is writing heavily most pages in the cache are likelyto be clean.

The difference is that clean pages can be dumped instantly to reclaim thememory. Dirty pages must be flushed to disk before they can bereclaimed. Using clean pages allows fast read access from the cachewithout the risk of not having committed the data. I describe this as'having your cake and eating it too'[2].


More info can be found here:

http://en.wikipedia.org/wiki/Page_cache

[2] Paraphrase of English language saying.

 cay:~$ free -o
              total       used       free     shared    buffers     cached
 Mem:       3116748    3029124      87624          0     721500    1548628
 Swap:      3145720        800    3144920
To me, looks like only 800KiB are actually swapped (uptime 11d) - don't knowhow I can see what type of data it is. Is that irrelevant?

I consider it irrelevant as a sysadmin. I'm purely interested in whetherthe system has sufficient swap or is swapping too much.


Cheers,

Rob

--
Email: robert@timetraveller.org
IRC: Solver
Web: http://www.practicalsysadmin.com
I tried to change the world but they had a no-return policy

Reply to:

Follow-Ups:
- Re: Single root filesystem evilness decreasing in 2010? (on workstations) [LONG]
  - From: thib <thib@stammed.net>

References:
- Re: Single root filesystem evilness decreasing in 2010? (on workstations)
  - From: Robert Brockway <robert@timetraveller.org>
- Re: Single root filesystem evilness decreasing in 2010? (on workstations)
  - From: thib <thib@stammed.net>

Prev by Date: Re: Possible modprobe parameter boot problem
Next by Date: Re: DNS (BIND)primario y secundario
Previous by thread: Re: Single root filesystem evilness decreasing in 2010? (on workstations)
Next by thread: Re: Single root filesystem evilness decreasing in 2010? (on workstations) [LONG]
Index(es):
- Date
- Thread