Re: UTF-8 editor support in Debian?

To: debian-devel@lists.debian.org
Subject: Re: UTF-8 editor support in Debian?
From: Gaute B Strokkenes <gs234@cam.ac.uk>
Date: Thu, 05 Jul 2001 18:30:04 +0200
Message-id: <[🔎] 87hewrtk37.fsf@cam.ac.uk>
In-reply-to: <[🔎] 20010705235500.A16409@strider> (Drew Parsons's message of "Thu, 5 Jul 2001 23:55:00 +1000")
References: <[🔎] 20010705231420.A16056@strider> <[🔎] 20010705152724.B18601@cistron.nl> <[🔎] 20010705235500.A16409@strider>

On Thu, 5 Jul 2001, dparsons@emerall.com wrote:
> 
> What are the main issues that need to be thought about?  Is it as
> simple as using w_char instead of char in the code, or is there more
> to it than that?  Can a general "plan of attack" be summarised in a
> couple of paragraphs?

That depends.  wchar_t is in general not portable, since it is defined
by the relevant standards as locale dependent, and could be as little
as 8 bits.  Moreover, the standard C mbs* and wcs* interfaces are
extremely limited and hard to use for any real work.  Even worse,
there are a lot of extremely buggy versions around.  You can in
general only counton wchar_t being Unicode based if you are a glibc
based system.  In particular, this is not so for any of the free BSDs
or Solaris etc.

I recommend you to have a look at Markus Kuhn's Unicode page.  I don't
have the URL handy, but do a google search for "markus kuhn unicode".
You will also want to have a look Bruno Haible's libiconv, libcharset
and libutf8 packages which provide partial (but extremely useful)
solutions to the portability problems mentioned above.

As for editors, emacs20 with mule-ucs is fine but has a few caveats
due to "philosophic differences" between Unicode and ISO-2022 (which
is what the mule internal coding is based on).  The next version of
emacs, emacs 21, is said to include native and much smoother support
for non-CJK unicode, though I haven't seen it myself.  Other than
that, you might want to try yudit.

-- 
Gaute Strokkenes                        http://www.srcf.ucam.org/~gs234/
Let's climb to the TOP of that MOUNTAIN and think about STRIP MINING!!

Reply to:

References:
- UTF-8 editor support in Debian?
  - From: Drew Parsons <dparsons@emerall.com>
- Re: UTF-8 editor support in Debian?
  - From: Wichert Akkerman <wichert@wiggy.net>
- Re: UTF-8 editor support in Debian?
  - From: Drew Parsons <dparsons@emerall.com>

Prev by Date: Re: Debugging g++-3.0 programs
Next by Date: Re: Debugging g++-3.0 programs
Previous by thread: Re: UTF-8 editor support in Debian?
Next by thread: Re: UTF-8 editor support in Debian?
Index(es):
- Date
- Thread