[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Re: Locales/sort bug



On Thu, Nov 04, 2010 at 10:55:53PM +0000, Camaleón wrote:
> On Thu, 04 Nov 2010 21:23:27 +0100, Rob Gom wrote:
> 
> > [cut]
> >>
> >> I'm also getting that behaviour (locale set to "es_ES.UTF-8") so I
> >> understand that my locale setting dictates "underscore" ("_") comes
> >> first than "comma" (",") symbol.
> >>
> >> As per "man sort" page:
> >>
> >> *** WARNING *** The locale specified by the environment affects sort
> >> order. Set LC_ALL=C to get the traditional sort order that uses native
> >> byte values.
> >>
> >> Do you think that is a bug? :-?
> > 
> > If so, why do I get order comma, underscore, comma? Even better,
> > comma+quote+A, underscore+d,comma+quote+M. I don't get it...
> 
> Mmm... you're right, I missed the first line :-?
> 
> Heck, it's even weirder with this sequence:
> 
> aph3,"z
> aph3_devel,"a
> aph3,"b
> 
> I gets sorted as:
> 
> aph3,"b
> aph3_devel,"a
> aph3,"z
> 
> I'm trying to "reverse-engineering" the logic behind the sort but I can't 
> see it. Maybe it is done randomly? Very curious, indeed.

It just seems to ignore certain characters.  Try filtering the output 
through, for example, 's/[_|"|,]//g' and the you get it in the right
order.

David


Reply to: