[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#99933: Bug#99324: Default charset should be UTF-8

On Mon, Jun 11, 2001 at 07:54:53PM +0200, Florian Weimer wrote:
> IMHO, a better mechanism are Unicode 3.1 language tags, see:
>         http://www.unicode.org/unicode/reports/tr27/#tag

Which says: 

   The characters in this block provide a mechanism for language tagging
   in Unicode plain text. However, the use of these characters is strongly
   discouraged. The characters in this block are reserved for use with
   special protocols. They are not to be used in the absence of such
   protocols, or with any protocols that provide alternate means for
   language tagging, such as HTML or XML.

Which implies that this mechanism isn't useful for representing different
languages in the same document.  That, instead, it's logically equivalent
to a MIME declaration of the document's language.

Maybe, in the future, the Unicode Consortium wants to change the standard
so that this mechanism can be used to represent multiple languages within
the same document.  But that's not the current standard.


Reply to: