Bug#99933: Bug#99324: Default charset should be UTF-8

To: Florian Weimer <Florian.Weimer@RUS.Uni-Stuttgart.DE>
Cc: 99933@bugs.debian.org, Radovan Garabik <garabik@melkor.dnp.fmph.uniba.sk>
Subject: Bug#99933: Bug#99324: Default charset should be UTF-8
From: Raul Miller <moth@debian.org>
Date: Mon, 11 Jun 2001 14:08:49 -0400
Message-id: <[🔎] 992282701.4004b924@debian.org>
Reply-to: Raul Miller <moth@debian.org>, 99933@bugs.debian.org
In-reply-to: <[🔎] tg66e22976.fsf@mercury.rus.uni-stuttgart.de>; from Florian.Weimer@RUS.Uni-Stuttgart.DE on Mon, Jun 11, 2001 at 07:54:53PM +0200
References: <[🔎] 20010611104113.A15114@melkor.dnp.fmph.uniba.sk> <[🔎] 20010611090721.A12776@usatoday.com> <[🔎] 20010611164718.A25953@melkor.dnp.fmph.uniba.sk> <[🔎] 992273294.21bafc15@debian.org> <[🔎] tg66e22976.fsf@mercury.rus.uni-stuttgart.de>

On Mon, Jun 11, 2001 at 07:54:53PM +0200, Florian Weimer wrote:
> IMHO, a better mechanism are Unicode 3.1 language tags, see:
> 
>         http://www.unicode.org/unicode/reports/tr27/#tag

Which says: 

   The characters in this block provide a mechanism for language tagging
   in Unicode plain text. However, the use of these characters is strongly
   discouraged. The characters in this block are reserved for use with
   special protocols. They are not to be used in the absence of such
   protocols, or with any protocols that provide alternate means for
   language tagging, such as HTML or XML.

Which implies that this mechanism isn't useful for representing different
languages in the same document.  That, instead, it's logically equivalent
to a MIME declaration of the document's language.

Maybe, in the future, the Unicode Consortium wants to change the standard
so that this mechanism can be used to represent multiple languages within
the same document.  But that's not the current standard.

-- 
Raul

Reply to:

References:
- Bug#99324: Default charset should be UTF-8
  - From: Radovan Garabik <garabik@melkor.dnp.fmph.uniba.sk>
- Bug#99324: Default charset should be UTF-8
  - From: Raul Miller <moth@debian.org>
- Bug#99933: Bug#99324: Default charset should be UTF-8
  - From: Radovan Garabik <garabik@melkor.dnp.fmph.uniba.sk>
- Bug#99933: Bug#99324: Default charset should be UTF-8
  - From: Raul Miller <moth@debian.org>
- Bug#99933: Bug#99324: Default charset should be UTF-8
  - From: Florian Weimer <Florian.Weimer@RUS.Uni-Stuttgart.DE>

Prev by Date: Bug#99933: Bug#99324: Default charset should be UTF-8
Next by Date: Bug#99933: Bug#99324: Default charset should be UTF-8
Previous by thread: Bug#99933: Bug#99324: Default charset should be UTF-8
Next by thread: Bug#99933: Bug#99324: Default charset should be UTF-8
Index(es):
- Date
- Thread