Re: cortex / arm-hardfloat-linux-gnueabi (was Re: armelfp: new architecture name for an armel variant)

To: Matt Sealey <matt@genesi-usa.com>
Cc: debian-arm@lists.debian.org, Lennart Sorensen <lsorense@csclub.uwaterloo.ca>, Martin Guy <martinwguy@gmail.com>, Loïc Minier <lool@dooz.org>
Subject: Re: cortex / arm-hardfloat-linux-gnueabi (was Re: armelfp: new architecture name for an armel variant)
From: Paul Brook <paul@codesourcery.com>
Date: Thu, 15 Jul 2010 20:00:07 +0100
Message-id: <[🔎] 201007152000.08447.paul@codesourcery.com>
In-reply-to: <[🔎] AANLkTils_UZDBgweSkI5KGnDZc3-cRrqcwBHBziiRmY8@mail.gmail.com>
References: <[🔎] AANLkTikDNXA7bu1KQLU6-EIDMkNTpXhr0nBGCi9nImnI@mail.gmail.com> <[🔎] 201007151719.14372.paul@codesourcery.com> <[🔎] AANLkTils_UZDBgweSkI5KGnDZc3-cRrqcwBHBziiRmY8@mail.gmail.com>

> > Switching to the hard-float ABI certainly does give some benefit. While
> > 20% isn't a trivial difference, it's important to keep this in context.
> >  This is on top of what I'd guess is a 10x (i.e. 1000%) speedup achieved
> > without breaking the ABI and requiring a whole new port.
> 
> How do you figure a 10x speedup?

A fairly conservative guess at the cost of software floating point. Even a 
dog-slow FPU like on the Cortex-A8 should be at an order of magnitude faster 
than software.

> > about performance then a NEON optimized version of your critical code
> > should get you annother 4x or so on a Cortex-A8.
> 
> Yes it's about 4x mathematically but 2x in practice because of the ABI
> fudging.

Theoretical peak gain is way more than 4x. VFP on the A8 has a peak single 
precision performance of about 0.1 FLOP/cycle, maybe 0.2 if you enable runfast 
mode. NEON peak performance is 4 FLOP/cycle.
I've seen 2-3x speedup on plain scalar code without even attempting 
vectorization, so 4x seems fairly realistic given a bit of effort.

> >> What would not be so great is that even if it was fixed, the option to
> >> use a faster floating point ABI drags in a clone of
> >> every package on your system (at the very least, libc, libm, and all
> >> the system library dependencies) increasing the
> >> size of the installed system.
> > 
> > What you're describing here is multiarch.
> 
> Yes, which is needed anyway to support NEON where it's available. 

A new port (or arch) is only required if you break the ABI. Enabling NEON has 
no effect on the ABI.

Paul

Reply to:

Follow-Ups:
- Re: cortex / arm-hardfloat-linux-gnueabi (was Re: armelfp: new architecture name for an armel variant)
  - From: Matt Sealey <matt@genesi-usa.com>

References:
- cortex / arm-hardfloat-linux-gnueabi (was Re: armelfp: new architecture name for an armel variant)
  - From: Hector Oron <hector.oron@gmail.com>
- Re: cortex / arm-hardfloat-linux-gnueabi (was Re: armelfp: new architecture name for an armel variant)
  - From: Paul Brook <paul@codesourcery.com>
- Re: cortex / arm-hardfloat-linux-gnueabi (was Re: armelfp: new architecture name for an armel variant)
  - From: Matt Sealey <matt@genesi-usa.com>

Prev by Date: Re: cortex / arm-hardfloat-linux-gnueabi (was Re: armelfp: new architecture name for an armel variant)
Next by Date: Re: discussion reset! thumb2/thumbee code as on armv7-a
Previous by thread: Re: cortex / arm-hardfloat-linux-gnueabi (was Re: armelfp: new architecture name for an armel variant)
Next by thread: Re: cortex / arm-hardfloat-linux-gnueabi (was Re: armelfp: new architecture name for an armel variant)
Index(es):
- Date
- Thread