Re: file systems

To: debian-user@lists.debian.org
Subject: Re: file systems
From: Stan Hoeppner <stan@hardwarefreak.com>
Date: Thu, 05 May 2011 06:15:11 -0500
Message-id: <[🔎] 4DC286BF.3060400@hardwarefreak.com>
In-reply-to: <[🔎] 201105041844.18604.bss@iguanasuicide.net>
References: <87wriqjd0t.fsf@towardsfreedom.com> <[🔎] 201105021602.50060.bss@iguanasuicide.net> <[🔎] 4DC1E009.30209@hardwarefreak.com> <[🔎] 201105041844.18604.bss@iguanasuicide.net>

On 5/4/2011 6:44 PM, Boyd Stephen Smith Jr. wrote:

In<[🔎] 4DC1E009.30209@hardwarefreak.com>, Stan Hoeppner wrote:

On 5/2/2011 4:02 PM, Boyd Stephen Smith Jr. wrote:

They are also essential for any journaled filesystem to have correct
behavior in the face of sudden pwoer loss.


This is true only if you don't have BBWC.


No.  It is true even with BBWC.

No, it's not. Sorry I didn't find any Debian documentation to prove mypoint. I'll use Red Hat docs:


http://docs.redhat.com/docs/en-US/Red_Hat_Enterprise_Linux/6/html/Storage_Administration_Guide/writebarrieronoff.html

"For devices with non-volatile, battery-backed write caches and thosewith write-caching disabled, you can safely disable write barriers atmount time using the -o nobarrier option for mount. However, somedevices do not support write barriers; such devices will log an errormessage to /var/log/messages (refer to Table 17.1, “Write barrier errormessages per file system”)."

You will see such errors with very high end SAN arrays, as I previouslymentioned. They simply don't support write barriers. Why? Becauseconstantly flushing an entire 16-64 *GigaByte* battery or flash backedwrite cache, sitting in front of 2048 SAS drives, because 64 servers onthe SAN keep issuing barriers at the rate of 10,000/second, is a mindnumbingly dumb thing to do.


http://docs.redhat.com/docs/en-US/Red_Hat_Enterprise_Linux/6/html/Storage_Administration_Guide/writebarrierconsider.html

"Write barriers are also unnecessary whenever the system uses hardwareRAID controllers with battery-backed write cache. If the system isequipped with such controllers and if its component drives have writecaches disabled, the controller will advertise itself as a write-throughcache; this will inform the kernel that the write cache data willsurvive a power loss."

Even with a a battery-packed RAID cache, like I have in my desktop,
executing without barrier can result in extra data loss that executing
with a barrier prevents.


Then I'd say you have a problem with your BBWC RAID controller in your
desktop.  Which BBWC RAID card do you have?


Areca ARC-1160.

Can you kindly point me to your past posts where you discussed this'extra data loss' problem you experienced? After AC power loss, withyour Areca-1160 w/ ARC-6120BA-T112 battery unit? I'd like to betterunderstand the circumstances surrounding the data loss.

Of course, even with out barriers a properly journaled or log-structed
filesystem should be able to immediately and silently recover.


This contradicts what you stated above.


No, it doesn't.  The filesystem can recover by dropping or replaying journal /
log entries that were not yet flushed to disk.  That doesn't mean you haven't
lost any data, if parts of the journal that existed in cache before the power
failure.

The argument you made was that barriers are required to maintain correctjournal write ordering. If that order isn't maintained because barriersare turned off, then, using your argument, the replaying of the 'out oforder' log journal will likely corrupt the filesystem. You seem toarguing from both sides of the fence.

With barriers, you a guaranteed to be able to recover to the last barrier.
Without them, the hardware many have fully, partially, of not-at-all completed
virtually any I/O.

This is generally true, but depends on the 'hardware' you're referringto, as I've pointed out a few times now in this thread.

This is why (good) BBWC enabled RAID cards automatically disable the
caches on all the drives,


Mine provides the option.  I can't remember what setting I'm using right now.
IIRC, I continue to use the drives write cache because I have a UPS that
provides enough time for a clean shutdown, even when under load.

Given that you have both the ARC-6120BA-T112 RAID card battery and aUPS, I'm now really curious to know more about your data loss due to notusing barriers.

and thus why it is recommended to disable
barriers for filesystems on BBWC RAID cards.


By whom?  Reference please.


Links and excerpts provided above.

The nobarrier results are far more relevant than the barrier results,
especially the 16 and 128 thread results, for those SAs with high
performance persistent storage.


I disagree entirely.  You should be looking at the threaded results,
probably 128 threads (depending on what the server does), but you should
also be using barriers.


You just said you "disagree entirely" and then say 128 threads, same
thing I said.  But then you recommend barriers, which is the disagreement.


You said 128 threads unconditionally, I admitted that there are certain
workloads where 16 threads is a more correct model.

The multi-thread tests are simply used to show how each filesystemscales with parallel workloads. Some servers will never see 16 parallelIO streams, such as most SOHO servers. Some servers will see thousandsof simultaneous IO streams, such as the Linux kernel archives servers.There is no "correct model".


--
Stan

Reply to:

Follow-Ups:
- Re: file systems
  - From: "Boyd Stephen Smith Jr." <bss@iguanasuicide.net>

References:
- Re: file systems
  - From: "Boyd Stephen Smith Jr." <bss@iguanasuicide.net>
- Re: file systems
  - From: Stan Hoeppner <stan@hardwarefreak.com>
- Re: file systems
  - From: "Boyd Stephen Smith Jr." <bss@iguanasuicide.net>

Prev by Date: How to install Software package on a Linux System when it do not have Internet - debbundle
Next by Date: Re: Need /etc/apt/sources.list
Previous by thread: Re: file systems
Next by thread: Re: file systems
Index(es):
- Date
- Thread