[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#1032356: look: can't open /usr/lib/plan9/lib/words



Package: 9base
Version: 1:6-13
Severity: normal
X-Debbugs-Cc: witold.baryluk@gmail.com

As installed, look is not very usable:

$ /usr/lib/plan9/bin/look
look: can't open /usr/lib/plan9/lib/words
$

This is most likely a porting bug, as man page, says by default it uses

/usr/share/dict/words

Using explicitly this file, makes it work:

$ /usr/lib/plan9/bin/look foo /usr/share/dict/words
foo
foobar
food
...


Note: /usr/share/dict/words is usually provided by wamerican. Users are
free to divert the symlink to other word lists.

Additionally:

I am pretty sure look uses UTF-8 (obviously), and in Debian now, all word
dictionaries are now using UTF-8 too. However, not all dictionaries use
Unicode or UTF-8 sorting.

I tested it with Polish word list, and look does not work on it. Even
without using polish dictrics in a pattern or a word. I am guessing this
is because from quick tests, it looks like plan9 look looks to be doing a
binary search in file, but many locale specific dictionaries are sorted
using locale specific collation rules, not Unicode rules. For example,
for Polish wordlist:

...
Aaseny
ab
Ab
aba
ABA
abace
...


because this is what official Polish language rules say to do (as used
for centuries in encyclopedias and paper dictionaries). In dictionaries,
usually character-after-character rules are used. In encylopedias
word-after-word (so for example Jan Sobieski, is before Janina). (Polish
dictionary does not have any multi-word words, so that is moot anyway).

Whetever it is useful for spell checking purposes, is another matter.

(not util-linux's look also does not work)


-- System Information:
Debian Release: 12.0
  APT prefers unstable-debug
  APT policy: (500, 'unstable-debug'), (500, 'unstable'), (1, 'experimental')
Architecture: amd64 (x86_64)
Foreign Architectures: i386

Kernel: Linux 6.2.0-rc5 (SMP w/32 CPU threads; PREEMPT)
Kernel taint flags: TAINT_UNSIGNED_MODULE, TAINT_SOFTLOCKUP
Locale: LANG=en_US.UTF-8, LC_CTYPE=en_US.UTF-8 (charmap=UTF-8), LANGUAGE not set
Shell: /bin/sh linked to /usr/bin/dash
Init: systemd (via /run/systemd/system)

Versions of packages 9base depends on:
ii  debianutils  5.7-0.4
ii  libc6        2.36-8

9base recommends no packages.

9base suggests no packages.

-- no debconf information


Reply to: