Bug#706207: gcc-4.6, gcc-4.7: invalid optimization when doing double -> int math and conversion (on big endian archs(?))

To: Ondřej Surý <ondrej@debian.org>, 706207@bugs.debian.org
Cc: Bastian Blank <waldi@debian.org>
Subject: Bug#706207: gcc-4.6, gcc-4.7: invalid optimization when doing double -> int math and conversion (on big endian archs(?))
From: Gabriel Paubert <paubert@iram.es>
Date: Sat, 27 Apr 2013 17:14:56 +0200
Message-id: <[🔎] 20130427151455.GA3307@visitor2.iram.es>
Reply-to: Gabriel Paubert <paubert@iram.es>, 706207@bugs.debian.org
In-reply-to: <[🔎] CALjhHG95je_i_MXUDXEn8N8-4F8gXQwBQBpTmLEPAYLbGmcO5w@mail.gmail.com>
References: <[🔎] 20130426102753.12895.7039.reportbug@localhost6.localdomain6> <[🔎] 20130426104329.GA651@waldi.eu.org> <[🔎] CALjhHG95je_i_MXUDXEn8N8-4F8gXQwBQBpTmLEPAYLbGmcO5w@mail.gmail.com>

On Fri, Apr 26, 2013 at 01:04:30PM +0200, Ondřej Surý wrote:
> On Fri, Apr 26, 2013 at 12:43 PM, Bastian Blank <waldi@debian.org> wrote:
> > On Fri, Apr 26, 2013 at 12:27:53PM +0200, Ondřej Surý wrote:
> >> This code from libgd2:src/gd.c:clip_1d:
> >>   *y1 -= m * (*x1 - mindim);
> >> where
> >>   m = (double) -0.050000
> >>   *x1 = -200
> >>   mindim = 0
> >>   *y1 = 15
> >> results in *y1 = 4, which is incorrect value, since it should be 5.
> >
> > Nope. The result of "m * (*x1 - mindim)" is not 10, it is a floating
> > point value near 10, as 10 can't be expressed in double. So this is:
> > 15 - 10.00000001 = 4.9999999. This converted to int is 4.
> >
> >> Most simple workaround, which allows gcc to produce correct value:
> >>   *y1 -= (int)(m * (*x1 - mindim));
> >
> > Here you force the later part to be 10.
> >
> >> Assigning to some other variable also works ok:
> >>   int t;
> >>   t = m * (*x1 - mindim);
> >>   *y1 -= t;
> >
> > The same.
> >
> >> gcc-4.7 is unfortunatelly also affected.
> >> I just hope we don't compile the nuclear reactor controls with gcc :)
> >
> > Just don't convert floating point to fixed point.
> 
> I don't object to this, but somehow I fail to grasp the idea that the
> result depends on architecture and optimization level.

It really does, it seems that in this case it depends on the compiler
generating fused multiply accumulate instructions, which happens to be
the case of powerpc, ia64, probably s390 (and coming to x86).
(Note that ia64 is little-endian, except under HP-UX if I remember correctly).

Decomposing your example:
int t1= *x1 - mindim;  /* only integers, exact */
double t2=t1; /* Converted to double, exact for 32 bit int */
double t3=*y1; /* Same */
/* Now if you don't have fused multiply accumulate, the compiler has no choice */
double t4 = m*t2;
double t5 = t3-t4;
*y1 = (int)t5;
/* but if FMA is available, the compiler can merge two operations and get rid of t4 */
double t5=t3-m*t2;
*y1 = (int) t5;

The difference is in the rounding after m*t2, in your case 0.05*200 
rounds to exactly 10 in double precision, but is a actually a bit above 10. 
This is enough to make the result of the FMA a bit below 5 so the conversion
(truncation) to integer will return 4.

> 
> I would expect consistent results, even consistent *bad* results would be ok.

Nope, FMA can change the rules of the game in subtle ways. An easy way
to check for problems is to recompile the code with -mno-fused-madd.

	Gabriel

Reply to:

Follow-Ups:
- Bug#706207: gcc-4.6, gcc-4.7: invalid optimization when doing double -> int math and conversion (on big endian archs(?))
  - From: Ondřej Surý <ondrej@debian.org>

References:
- Bug#706207: gcc-4.6, gcc-4.7: invalid optimization when doing double -> int math and conversion (on big endian archs(?))
  - From: Ondřej Surý <ondrej@debian.org>
- Bug#706207: gcc-4.6, gcc-4.7: invalid optimization when doing double -> int math and conversion (on big endian archs(?))
  - From: Bastian Blank <waldi@debian.org>
- Bug#706207: gcc-4.6, gcc-4.7: invalid optimization when doing double -> int math and conversion (on big endian archs(?))
  - From: Ondřej Surý <ondrej@debian.org>

Prev by Date: Re: gnat-4.8_4.8.0-1~exp1_amd64.changes is NEW
Next by Date: Bug#706207: gcc-4.6, gcc-4.7: invalid optimization when doing double -> int math and conversion (on big endian archs(?))
Previous by thread: Bug#706207: gcc-4.6, gcc-4.7: invalid optimization when doing double -> int math and conversion (on big endian archs(?))
Next by thread: Bug#706207: gcc-4.6, gcc-4.7: invalid optimization when doing double -> int math and conversion (on big endian archs(?))
Index(es):
- Date
- Thread