Bug#99933: Bug#99324: Default charset should be UTF-8
On Mon, Jun 11, 2001 at 07:54:53PM +0200, Florian Weimer wrote:
> IMHO, a better mechanism are Unicode 3.1 language tags, see:
The characters in this block provide a mechanism for language tagging
in Unicode plain text. However, the use of these characters is strongly
discouraged. The characters in this block are reserved for use with
special protocols. They are not to be used in the absence of such
protocols, or with any protocols that provide alternate means for
language tagging, such as HTML or XML.
Which implies that this mechanism isn't useful for representing different
languages in the same document. That, instead, it's logically equivalent
to a MIME declaration of the document's language.
Maybe, in the future, the Unicode Consortium wants to change the standard
so that this mechanism can be used to represent multiple languages within
the same document. But that's not the current standard.