[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Re: Questions regarding utf-8



On Thu, May 08, 2003 at 07:50:50PM -0400, Bob Hilliard wrote:

>      1.  How can I determine what character encoding is used in a
>          document without manually scanning the entire file?

"file" gives good estimates for plain text files.

>      2.  What is the best available filter to convert from encoding X
>          to 7 bit ASCII?

iconv is the generic libc-based converter:

  iconv --from=SRC-ENCODING --to=DST-ENCODING SRC-FILE >DST-FILE

>      3.  What is the difference between utf-8 and en_US.utf8?

UTF-8 is an encoding, en_US.UTF-8 a language in UTF-8 encoding.

>      Pointers to the appropriate documentation would be very welcome,
> since I feel a need to become more knowledgeable about this subject.

man iconv
man utf-8

 - Sebastian



Reply to: