[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Re: Announcing cdrskin-0.7.2



Hi,

D. Hugh Redelmeier:
> If you double the gfpow table's size you could elminate the % 255.

That was a substantial improvement.
It now needs only 10 % more time than the old
implementation. I.e. you cut off 80% of the
initial performance deficit.

One has to unroll the table gfpow[] up to 511
elements because the highest sum of two gflog[]
elements is 510.


My own optimization idea about Q index
computation appears to have been anticipated by
gcc -O2 already.
But i was able to squeeze out the remaining 10 %
by computing LSB and MSB of the 16 bit words in
one loop rather than in two.
Now it depends mainly on external influences
which of both algorithms wins the race.
Maybe the old one is by an epsilon faster.


Have a nice day :)

Thomas


Reply to: