[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#105364: installer allows user to insert underscores in the hostname



On Tue, Jul 24, 2001 at 01:57:51PM +0200, Kjetil Torgrim Homme wrote:
> Matt Kraai <kraai@debian.org> writes:
> > Will the input be encoded in UTF-8?
> 
> No, that will break too many protocols.  That's the reason for ASCII
> Compatible Encoding, using only characters "a-z0-9/-".
> 
> Look at some of the examples (/^Exampl) in
>   http://www.i-d-n.net/draft/draft-ietf-idn-amc-ace-w-00.txt
> Notice that UTF8 is inefficient for Hangeul and other scripts, even if
> it uses the full 8 bits instead of 5.
> 
> (Personally, I hope something like
>  http://www.i-d-n.net/draft/draft-ietf-idn-udns-02.txt
>  passes.  I'm not too optimistic, this reminds me of all the warts of
>  MIME we probably never will be rid of.)

In order to implement this, I really need to know the answers to
the following questions:

	* What encoding is used for the host name input by the user?

	* What host name validation should be performed by dbootstrap?

	* What encoding should be used for writing the host name?

The kernel treats the host name as any other character string, and
so doesn't appear to care about the encoding.  I haven't been able
to find the libc source which deals with internationalized host
names, so I don't know what format it expects things in.

Matt



Reply to: