[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#99324: Default charset should be UTF-8

On Wed, Jun 06, 2001 at 08:42:28PM +0900, Junichi Uekawa wrote:
> Radovan Garabik <garabik@melkor.dnp.fmph.uniba.sk> immo vero scripsit
> > > utf8 in the current state does not cover everything we had in other encodings.
> > 
> > utf8 is just a _multibyte_ encoding, not _character_ encoding,
> > it can represent whatever character encoding is used in UCS-4
> UCS4 is not a satisfactory encoding for our needs, unfortunately.
> JIS is not comlpete either, but UCS4 is less.

but: JIS is japanese only, UCS-4 is global
UCS-4 can (and will) be easily expanded, there are no technical 
problems in adding characters to this encoding

can JIS be easily extended to support missing characters?
I do not think so...
UCS-4 can, given some effort.

> I would not go against making programs utf-8-aware,
> but I don't think that changing all the documentation to utf-8
> is going too far.

not yet - it will be just recommendation so far

| Radovan Garabik http://melkor.dnp.fmph.uniba.sk/~garabik/ |
| __..--^^^--..__    garabik @ melkor.dnp.fmph.uniba.sk     |
Antivirus alert: file .signature infected by signature virus.
Hi! I'm a signature virus! Copy me into your signature file to help me spread!

Reply to: