Re: Questions regarding utf-8
On Thu, May 08, 2003 at 07:50:50PM -0400, Bob Hilliard wrote:
> 1. How can I determine what character encoding is used in a
> document without manually scanning the entire file?
"file" gives good estimates for plain text files.
> 2. What is the best available filter to convert from encoding X
> to 7 bit ASCII?
iconv is the generic libc-based converter:
iconv --from=SRC-ENCODING --to=DST-ENCODING SRC-FILE >DST-FILE
> 3. What is the difference between utf-8 and en_US.utf8?
UTF-8 is an encoding, en_US.UTF-8 a language in UTF-8 encoding.
> Pointers to the appropriate documentation would be very welcome,
> since I feel a need to become more knowledgeable about this subject.