[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Re: multi-language debian



Hi,

From: Pied Axioplase <pied@pied.mine.nu>
Subject: Re: multi-language debian
Date: Sat, 31 May 2003 00:30:55 +0000

> how comes people can input japanese in iso-8859-1 encoded pages and have their 
> text readable by anyone,

It is impossible.  ISO-8859-1 and EUC-JP (or Shift_JIS) are different
character encodings.  In its definition, ISO-8859-1 text cannot include
Japanese characters.

If you want to mix characters from ISO-8859-1 character set and
characters from JIS X 0208 character set (well, most of Japanese
characters included in EUC-JP encoding), you will have to use UTF-8
or ISO-2022.

Note that EUC-JP or Shift_JIS include ASCII characters.  Thus, if
you said "ASCII" instead of "ISO-8859-1", it is possible.

I hope you understand the difference between ASCII and ISO-8859-1.
If not, please tell me.


> though when i do so, i have to manually select 
> shift_JIS in my browser in order to read the text i input before?

If your file was well shown by choosing Shift_JIS, I imagine you
didn't use ISO-8859-1 (above 0xa0) characters.

There is a way to specify encoding of the web page.  

<HEAD>
  <META http-equiv="Content-Type" content="text/html; charset=foobar">
</HEAD>

Please put valid encoding name for "foobar".  Please download
http://www.debian.org/index.ja.html and research the source.


> and why when i watch the sources, thei text show up as #1234567 whereas mine 
> seems to be lots of Ì–¼‘O‚̓AƒhƒŠƒAƒ“‚Å‚·I“ú–{Œê‚ð• or really nice chars if 
> you seem what i mean...

What you mean?  I am afraid your mail is broken....

Please see http://www.debian.or.jp/~kubota/mojibake/ .

---
Tomohiro KUBOTA <kubota@debian.org>
http://www.debian.or.jp/~kubota/

Reply to: