Re: Newbie : unrepresentable changes to source

To: "debian-devel@lists.debian.org" <debian-devel@lists.debian.org>
Subject: Re: Newbie : unrepresentable changes to source
From: Steve Langasek <vorlon@netexpress.net>
Date: Mon, 25 Nov 2002 14:07:55 -0600
Message-id: <[🔎] 20021125200752.GA12911@netexpress.net>
Mail-followup-to: "debian-devel@lists.debian.org" <debian-devel@lists.debian.org>
In-reply-to: <[🔎] 3DE27544.717FBDF4@infofin.com>
References: <[🔎] 3DE1D86F.1E6B06C5@infofin.com> <[🔎] 20021125174445.GB5446@quetzlcoatl.dodds.net> <[🔎] 3DE27544.717FBDF4@infofin.com>

On Tue, Nov 26, 2002 at 12:38:52AM +0530, Krishna Dagli wrote:
> > Why is a Unix program using UTF16BE (or UCS2BE) for its internal
> > representation of localization data?
> 
> As per the upstream author :
> UTF16LE or  UTF16BE tells that it's unicode (Gammu support
> both). I use Unicode in localisation data to avoid such problem: in the
> OS of somebody, who will make localisation data for X language,
> there is set different codepage than in my PC. But my codepage contains
> the same chars too. Using Unicode allows to avoid problems - on my PC
> all chars are displayed correctly too. I can open it in Unicode editor
> and see correct accent, etc. chars

Except that UTF16 is the absolute dumbest Unicode encoding in existence,
inheriting compatibility problems from both widechar and multibyte
encoding styles.  The Unix convention is to use UTF8 as the encoding for
such things, to maintain compatibility with C strings -- and with tools
like diff.

If you're stuck with changes to such files in your package, you must
encode those changes in a format diff can understand, either using
something like sharutils to store the binary data in a text format, or
something like 'iconv' to convert the text to a sensible encoding.

-- 
Steve Langasek
postmodern programmer

Attachment: pgp3HUAyAwDyP.pgp
Description: PGP signature

Reply to:

References:
- Newbie : unrepresentable changes to source
  - From: Krishna Dagli <kdagli@infofin.com>
- Re: Newbie : unrepresentable changes to source
  - From: Steve Langasek <vorlon@netexpress.net>
- Re: Newbie : unrepresentable changes to source
  - From: Krishna Dagli <kdagli@infofin.com>

Prev by Date: Re: Are we losing users to Gentoo?
Next by Date: Re: Are we losing users to Gentoo?
Previous by thread: Re: Newbie : unrepresentable changes to source
Next by thread: Request for Sponsor: [libnet-easytcp-perl]
Index(es):
- Date
- Thread