[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#234355: xlibs: .../locale/en_US.UTF-8/Compose: <dead_diaeresis> <space> produces a diaeresis



> > Package: xlibs
> > Version: 4.3.0-2
> > Severity: normal
> > 
> > This is very annoying! I can no longer get a quotedbl (") by pressing
> > <dead_diaeresis> <space>, as it was always. Why this change? A
> > diaeresis can still be produced with <dead_diaeresis>
> > <dead_diaeresis>, should anybody need it (!?).
> > 
> > I suggest you to apply the following patch. I hope there's no plans to
> > change <dead_acute> <space> to produce an useles <acute> instead of an
> > apostrophe... Why do people change things they don't use??
> 
> I am sorry, but I am reluctant to break with upstream practice on this
> issue without understanding the underlying issue a little better.

Well, what can I tell you...

There are people who have a keyboard with US layout, but need to type
non-ascii latin characters. For example, Germans living in the US will
likely have an US keyboard, but they still need to write umlauts when
they write in German. Or people living in countries where keyboards
are imported... sometimes keyboard availability depends on
internacional prices, and there are times when you can only get an
US keyboard, other times only an es-ES keyboard, and others only the
ill-designed es-LA one.

Anyway, these people use an US keyboard and need to input diacritics.
The standard, intuitive and historical way is to use the keys labeled
with symbols resembling the diacritics to input them. Thus, pressing
the apostrophe key produces a <dead_acute>, the quote a
<dead_diaeresis>, the circumflex a <dead_circumflex>, etc. The next
pressed key will result in an appropiately diacriticalised(?) letter.

So far, so good. But how to input the apostrophe, quotedbl, and
asciicircum? The historical way was to press the key followed by a
space. This is was they broke with the new en_US.UTF-8/Compose.

Why <dead_diaeresis> <space> should produce <quotedbl>:

1. Ability to produce quotedbl. When these people decided to "improve"
   the rules file, they took out the iso9995-3 symbols from the layout
   "pc*", variant "us_intl". So, people using UTF-8 locale, and using
   that layout/variant combination can no longer type a quotedbl
   (us_intl maps the quote key to dead_diaeresis).

2. Historical/backwards compatibility. Compatiblity with *me*, the
   dead_diaeresis user. I'm used to type a quote that way. I'm so used
   to it that I always type '" ' when I use a pure US keyboard layout,
   and need to backspace. It's frustrating to have one's keyboard
   change its behaviour according to the academic theories of the last
   XKB commiter.

3. Compatibility with other "pseudo-diacritical" keys. I'm afraid of
   saying that, should they inutilise their composing sequences too...
   <dead_acute> <space>: apostrophe, <dead_tilde> <sp>: asciitilde,
   <dead_grave> <sp>: grave.

4. It doesn't make sense. Maybe these people thought: "We were doing
   it wrong all the time! <dead_diaeresis> + space should produce a
   diacriticalised(?) space! That's the logical thing to do!" Well,
   there is no such thing as SPACE WITH DIAERESIS, and that key
   combination only produces a DIAERESIS. Ok, their reasoning is not
   too far-fetched, but there's already a way to produce a lone
   diacritic: pressing the key two times; and see next item.

5. WHY WOULD ANYBODY WANT TO PRODUCE A DIAERESIS? I can only think of
   one context: when you are talking about diaeresises. Like: "The
   diaeresis (¨) ...", or "<dead_diaeresis> <space> produces '¨'!".
   The ratio of usage of quotedbl vs. diaeresis, I think, must be
   infinity to one. Even if you have a keyboard with both a quotedbl
   and a diaeresis key, if you press <dead_diaeresis> <space>, the
   most likely thing is that you want a quotedbl. I think Windows
   applies the same line of reasoning that the XKB people, because
   many times I have seen mails in which the sender quoted ¨like this¨
   or ´like this´. They are obviously searching for the quote or
   apostrophe keys, and settle for the nearest similar-looking one. Of
   course they don't want the diacriticals but the quotes, because ...
   (see beginning of paragraph).

As an addition, I think the compose sequences should be consistent
across locales, but maybe I'm mistaken. And, in any case, I think the
"consistency trend" should be in the opposite direction (there are
already 9 locales with the wrong composing sequence (en_US.UTF-8 is
modified by me)).

  $ grep "^<dead_diaeresis> <space>" */Compose | expand -t1
  el_GR.UTF-8/Compose:<dead_diaeresis> <space> : "¨"  U00a8
  el_GR.UTF-8/Compose:<dead_diaeresis> <space>  : "¨" diaeresis
  en_US.UTF-8/Compose:<dead_diaeresis> <space>  : "\"" quotedbl
  iso8859-15/Compose:<dead_diaeresis> <space>  : "\"" quotedbl
  iso8859-1/Compose:<dead_diaeresis> <space>  : "\"" quotedbl
  iso8859-2/Compose:<dead_diaeresis> <space>  : "\250" diaeresis
  iso8859-3/Compose:<dead_diaeresis> <space>  : "\250" diaeresis
  iso8859-7/Compose:<dead_diaeresis> <space>  : "\250" diaeresis
  iso8859-7/Compose:<dead_diaeresis> <space>  : "\250" diaeresis
  iso8859-9/Compose:<dead_diaeresis> <space>  : "\250" diaeresis
  iso8859-9e/Compose:<dead_diaeresis> <space>  : "\250" diaeresis
  pt_BR.UTF-8/Compose:<dead_diaeresis> <space>  : "\"" quotedbl

But I won't protest for a locale I don't use. en_US.UTF-8 is the
important for me.

I hope these notes were helpful to you.

Bye.




Reply to: