Re: Having fun with the following C code (UB)

To: debian-devel@lists.debian.org
Subject: Re: Having fun with the following C code (UB)
From: Shachar Shemesh <shachar@shemesh.biz>
Date: Fri, 11 Apr 2014 17:33:05 +0300
Message-id: <[🔎] 5347FD21.2090302@shemesh.biz>
In-reply-to: <[🔎] 5347C8B4.7040409@debian.org>
References: <CA+7wUsxcFAYzKJ-nTJviE_v5y_-2c9G79R4gL0GYEat0Db2E+w@mail.gmail.com> <[🔎] 20140410100321.GB18703@grep.be> <[🔎] 20140410102950.GA7517@jwilk.net> <[🔎] 20140410104203.GD18703@grep.be> <[🔎] 21318.32941.351261.72783@chiark.greenend.org.uk> <[🔎] Pine.BSM.4.64L.1404101147250.23837@herc.mirbsd.org> <[🔎] 20140410153409.GA32321@xvii.vinc17.org> <[🔎] 21318.56319.795404.108549@chiark.greenend.org.uk> <[🔎] 5346F981.9010802@debian.org> <[🔎] 21319.50942.486880.36127@chiark.greenend.org.uk> <[🔎] 5347C8B4.7040409@debian.org>

On 11/04/14 13:49, Ansgar Burchardt wrote:

Hi,

On 04/11/2014 12:42, Ian Jackson wrote:

What people expect is that the compiler compiles programs the way C
was traditionally compiled.

Shouldn't -O0 come close to that expectation?

I think that Ansgar's answer is spot on, but against all good sense, I still want to expand it.

Neither the compiler nor its authors are doing anything out of spite. It is, indeed, painful when a compiler optimizes away a security check due to some standard defining a feature to be "undefined behavior". However, for any such case there are hundreds in which this optimization saves on an "if" that would strain the branch prediction cache, or allows coalescing operations that would otherwise need to be done one after the other, or any number of other cases in which the output machine language looks nothing like your written high level C or C++.

Not only is this good for performance, it is also good for security. For example, in C++ I can run the following code:

for( unsigned int i=0; i<size; ++i )
vector1.at(i) = vector2.at(i);

"at" is better than square brackets, because it does bounds checking. I think you'll agree with me that bounds checking is good for security. Running this code as written, however, results in too many bounds checking. Luckily, the same compiler optimization that angered you will now realize that the bounds only ever need to be tested once. The result is machine code that looks nothing like my C++ source, but which does things both quickly and securely.

The quickly part is important. If I had to actually run the bounds checking each and every time, this code would, likely, be too slow to be practical. I would, then, have no choice but to use the version that does not do bounds checking. I'd like to hear how that would make my code more secure.

Alternatively, I might re-write the loop. This loop is relatively easy to write with explicit bounds checking, but explicit bounds checking has two major disadvantages:
1. It is easy to forget to do it correctly (see the heartbleed problem)
2. It makes the code less readable and less maintainable.

Both of those problems, again, translate to less secure code.

I, for one, accept the extra liability that modern optimizers provide in exchange for the easier to maintain, more secure code they allow me to write.

Shachar

Reply to:

References:
- Re: Having fun with the following C code (UB)
  - From: Wouter Verhelst <wouter@debian.org>
- Re: Having fun with the following C code (UB)
  - From: Jakub Wilk <jwilk@debian.org>
- Re: Having fun with the following C code (UB)
  - From: Wouter Verhelst <wouter@debian.org>
- Re: Having fun with the following C code (UB)
  - From: Ian Jackson <ijackson@chiark.greenend.org.uk>
- Re: Having fun with the following C code (UB)
  - From: Thorsten Glaser <tg@mirbsd.de>
- Re: Having fun with the following C code (UB)
  - From: Vincent Lefevre <vincent@vinc17.net>
- Re: Having fun with the following C code (UB)
  - From: Ian Jackson <ijackson@chiark.greenend.org.uk>
- Re: Having fun with the following C code (UB)
  - From: Shachar Shemesh <shachar@debian.org>
- Re: Having fun with the following C code (UB)
  - From: Ian Jackson <ijackson@chiark.greenend.org.uk>
- Re: Having fun with the following C code (UB)
  - From: Ansgar Burchardt <ansgar@debian.org>

Prev by Date: ITP: pgespresso -- Optional extension for Barman, Backup and Recovery Manager for PostgreSQL
Next by Date: Bug#744233: ITP: libhac-java -- hierarchical agglomerative clustering
Previous by thread: Re: Having fun with the following C code (UB)
Next by thread: Re: Having fun with the following C code (UB)
Index(es):
- Date
- Thread