[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Re: Questions regarding utf-8



Andreas Metzler <ametzler@downhill.at.eu.org> writes:

> *prompt* echo ö§ | recode latin1..ascii
> "oSS
> *prompt* echo ö§ | iconv -f latin1 -t
> ascii//TRANSLIT ; echo $?
> oe?
> --------------
> »oe« is much better than »"o« and »SS« is no usable replacement for
> »§«  (I do not think there is one), it would be nice if iconv's
> exit-status reflected whether questionmarks were used, but changing
> this would probably break existing software.

     I agree oe is much better than "o, but I can't get that:

bob:vc-p2:bob>echo ö§ | iconv -f latin1 -t ascii//TRANSLIT
??

     I have iconv 2.3.1.  Is it possible that my locale affects the
glyphs iconv returns?  My locale is C.  What locale are you using?

     Following are the values recode gives for the most common
non-ASCII characters: 

ä   à   ç  ñ   ê   ô   ö   ß   £ 
"a  `a  c  ~n  ^e  ^o  "o  ss  lb

     For all of these, iconv returns '?' on my system.

     I have not been able to find recode's transliteration of é. In
all cases I have tried, recode truncates the balance of the word
containing é, and the beginning of the next word, as in the following
examples:

Pierre Bézier
Pierre B

Conseil Européen pour la Normalisation.
Conseil Europour la Normalisation.

Comité Européen de Normalisation
Comitrope Normalisation

André Fischer
Andrcher

Réseaux Associés pour la Recherche Européenne
Rx Associur la Recherche Europ

Répondez s'il vous plait
Rez s'il vous plait

     I am not happy with the values recode gives, but, in most cases,
they are better than the ?.  I would like to learn how to make iconv
give results similar to those you get.

Regards,

Bob
-- 
   _
  |_)  _  |_    Robert D. Hilliard        <hilliard@debian.org>
  |_) (_) |_)   1294 S.W. Seagull Way     <bob@bobhilliard.net>
                Palm City, FL 34990 USA   GPG Key ID: 390D6559 
                                            



Reply to: