Re: Single root filesystem evilness decreasing in 2010? (on workstations)

To: debian-user@lists.debian.org
Subject: Re: Single root filesystem evilness decreasing in 2010? (on workstations)
From: thib <thib@stammed.net>
Date: Thu, 04 Mar 2010 03:30:41 +0100
Message-id: <[🔎] 4B8F1B51.5090009@stammed.net>
In-reply-to: <[🔎] alpine.DEB.1.10.1003031024010.7759@castor.opentrend.net>
References: <4B89D293.7050007@stammed.net> <[🔎] alpine.DEB.1.10.1003031024010.7759@castor.opentrend.net>

Robert Brockway wrote:
> [...]

Some filesystems such as XFS & ZFS allow you to effectively set quotason parts of the filesystem. I think we'll see this becoming morecommon. This takes away a big part of the need for multiple filesystems.

This is a neat feature indeed. And you're right; apparently, work isbeeing done on ext4.


  http://lwn.net/Articles/373513/

* Specific mount options
[...]
This is a good point. I actually hadn't considered this in my list.I'll respond by saying that in general the mount options I use fordifferent filesystems on the same box do not vary much (or at all) inpractice.

I've just discovered bindfs [1], a FUSE-based virtual filesystem, whichmight just answer this problem partially. It looks quite nice, simple andflexible but will obviously won't be able to enable optimizations likenoatime. Don't know about the possible overhead though - at that point, onemight want to go with "true" access control systems instead.


[1] http://code.google.com/p/bindfs/

> If I want a filesystem marked noatime then I probably want
> all the filesystems marked noatime.  There are exceptions to this of
> course.

Yep, like giving relatime to a filesystem containing mboxes or somethinglike that. But it's true, yes, access times are becoming less and lessuseful, and I can't think of another real problem (that isn't answered byaccess control systems) besides that one.

* System software replacement
Easier to reinstall the system if it's on separate volumes than confand data? Come on..
That's true but the time savings is not terribly great IMHO. The systemcan be backing up and restoring the dats while the human is off doingother stuff. Saves computer time (cheap) but not human time (expensive).

Either way, there's software to automate and abstract it all. I think thereal question is really processing vs storage resources; human resourcesare the same.

The only reason I saw for doing inflexible volume imaging to do backups isto avoid the filesystem formatting operations as well as files unpacking andcopying operations when restoring, which are theorically slower than copyinga volume byte-by-byte. "Whatever".

If restore speed is really that critical, it should still be possible togenerate an image without including the free space - I know virtualizationtechs are doing it just fine for most filesystems.


Maybe we misunderstood each other - saw a different problem.

[...]
I recommend backing up all system binaries. It's the only way you canguarantee you will get back to the same system you had before therebuild. This is most important for servers were even small behaviouralchanges can impact the system in a big way.


So you don't trust Debian stable to be stable?  :-)

See this link for my talk on backups which goes in to this issue further:

http://www.timetraveller.org/talks/backup_talk.pdf
All the info in this talk is being transferred tohttp://www.practicalsysadmin.com.

Thanks a lot; that's a talk full of useful checklists. I'll definitely eatyour wiki pages when I have the time.

[...]
* Metadata (i-node) table sizes
While this may be a problem now I think it will be less of a problem inthe future as some filesystems already allow you to add i-nodesdynamically and this will increasingly be the case.


I'm not sure I follow you, but that sounds cool.  Could you elaborate?

* Block/Volume level operations (dm-crypt, backup, ...)
[...]
As said earlier, I don't need a fast backup solution. I alreadyprefer smarter filesystem-based backup systems in general.
As do I. What do you use? If you want to use dump with ext2/3/4 youwill need to snapshot for data safety.

Actually I would think dump is a fast but "dumb" solution (much likepartimage). And yep, I know, LVM2 is just great for that.

Anyway, my preference isn't based on my own experience so I'm not actuallyusing anything like that, but I'm willing to look at and try fsarchiver andsee if it can really beat simple ad-hoc scripts for my needs. Or somethingheavier, just for fun (Bacula?).

[...]
In modern disks the sector layout is hidden. The fastest sectors may beat the beginning of the disk, the end or striped throughout. This isspecific to the design of the HDD and it is no longer possible to tellshort of doing timing tests[1]. My recommendation is to ignoredifferences in sector speeds.
[1] I'd love to hear if anyone has found a method but I can't see howthey could get through the h/w abstraction.

Good to know, I've actually never seen anything fancy like that (stripedthroughout). I'll test my disks to see how I can make the best out of themanyway - but I agree with you in the case one wants to setup a portable,deployable system.

LVM won't theoritically guarantee the physical position of the logicalvolumes anyway. And I'll need it if I do any partitioning.
So now it is abstracted (at least) twice :)


Hehe, yeah.  I'm glad I'm not into forensics.  What a beautiful mess.

* Swap special-case
[...]
Under Linux 2.6 kernels a swap file is as efficient as a swap partition.The only real advantage of a swap partition is to allow suspend to disk(on a laptop).

Really? That's too bad. Can't think of any real obstacle, I hope thislimitation will be lifted.

There are however some neat dymanic swap allocation projects out therethat would help me not lose these gigabytes I never seem to be using(at all). I
I wouldn't touch these if they in any way impacted performance. Disk ischeap. Give yourself 2GB swap.

Yup, they currently do (AFAIK), it takes a little bit of time to set themup. Let's say it's cool to have in addition to a fixed swap space, as anextra safety measure.

figured, with all this RAM I could think of the swapping space as amere rescue space to prevent OOM rampages - and nothing else. In *my*case, even buffers and cached pages never get to be pushed on diskafter weeks without
Ah but they are. Cache pages may be clean or dirty. Your disk cachemay be full of clean cache pages, which is just fine.


Am I interpreting the output of free(1) the wrong way?

  cay:~$ free -o
               total       used       free     shared    buffers     cached
  Mem:       3116748    3029124      87624          0     721500    1548628
  Swap:      3145720        800    3144920

To me, looks like only 800KiB are actually swapped (uptime 11d) - don't knowhow I can see what type of data it is. Is that irrelevant?


RAM is not even fully used, so it doesn't surprise me at first.

rebooting. I'm just OK with my three gigs. The 1:1 mem:swap rule hasgot to be wasting space here, hasn't it?
Absolutely.  This page has my thoughts on this topic:

http://practicalsysadmin.com/wiki/index.php/Swap_space
Thanks in advance for your help. I hope I could make you think twiceabout it too or maybe provide people with other needs with a littlechecklist to better design their layout.
Thanks for the great checklist.

Thanks for taking the time to look at this, and for the links to your pages(these are useful).

Cheers,

Rob


-thib

Reply to:

Follow-Ups:
- Re: Single root filesystem evilness decreasing in 2010? (on workstations) [LONG]
  - From: Robert Brockway <robert@timetraveller.org>

References:
- Re: Single root filesystem evilness decreasing in 2010? (on workstations)
  - From: Robert Brockway <robert@timetraveller.org>

Prev by Date: Re: wrong resolution with ATI and open source drivers
Next by Date: Re: Possible modprobe parameter boot problem
Previous by thread: Re: Single root filesystem evilness decreasing in 2010? (on workstations)
Next by thread: Re: Single root filesystem evilness decreasing in 2010? (on workstations) [LONG]
Index(es):
- Date
- Thread