Re: Locales/sort bug
On Thu, Nov 04, 2010 at 10:55:53PM +0000, Camaleón wrote:
> On Thu, 04 Nov 2010 21:23:27 +0100, Rob Gom wrote:
>
> > [cut]
> >>
> >> I'm also getting that behaviour (locale set to "es_ES.UTF-8") so I
> >> understand that my locale setting dictates "underscore" ("_") comes
> >> first than "comma" (",") symbol.
> >>
> >> As per "man sort" page:
> >>
> >> *** WARNING *** The locale specified by the environment affects sort
> >> order. Set LC_ALL=C to get the traditional sort order that uses native
> >> byte values.
> >>
> >> Do you think that is a bug? :-?
> >
> > If so, why do I get order comma, underscore, comma? Even better,
> > comma+quote+A, underscore+d,comma+quote+M. I don't get it...
>
> Mmm... you're right, I missed the first line :-?
>
> Heck, it's even weirder with this sequence:
>
> aph3,"z
> aph3_devel,"a
> aph3,"b
>
> I gets sorted as:
>
> aph3,"b
> aph3_devel,"a
> aph3,"z
>
> I'm trying to "reverse-engineering" the logic behind the sort but I can't
> see it. Maybe it is done randomly? Very curious, indeed.
It just seems to ignore certain characters. Try filtering the output
through, for example, 's/[_|"|,]//g' and the you get it in the right
order.
David
Reply to: