[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#905126: marked as done (www.debian.org: Website search box unhelpful for common names (e.g. Buster) in certain character sets)



Your message dated Thu, 2 Aug 2018 05:03:13 +0100
with message-id <20180802040313.r67chi7v5wgtympk@survex.com>
and subject line Re: www.debian.org: Website search box unhelpful for common names (e.g. Buster) in certain character sets
has caused the Debian Bug report #905126,
regarding www.debian.org: Website search box unhelpful for common names (e.g. Buster) in certain character sets
to be marked as done.

This means that you claim that the problem has been dealt with.
If this is not the case it is now your responsibility to reopen the
Bug report if necessary, and/or fix the problem forthwith.

(NB: If you are a system administrator and have no idea what this
message is talking about, this may indicate a serious mail system
misconfiguration somewhere. Please contact owner@bugs.debian.org
immediately.)


-- 
905126: https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=905126
Debian Bug Tracking System
Contact owner@bugs.debian.org with problems
--- Begin Message ---
Package: www.debian.org
Severity: normal

A number of search languages end up with no results for contextually
common search terms, for example "debian" or "buster".

To reproduce:
 - use the search box for the term "buster" in English. There are a
   number of results including release information, news items and
   errata.
 - set the language to Vietnamese, Chinese or similar and search again
 - there are no results.

I assume that this is an issue with translations into non-Latin
character sets without hint words nearby the translated word.

-- System Information:
Debian Release: 9.5
  APT prefers stable
  APT policy: (990, 'stable')
Architecture: amd64 (x86_64)
Foreign Architectures: i386

Kernel: Linux 4.16.0-0.bpo.2-amd64 (SMP w/4 CPU cores)
Locale: LANG=en_GB.utf8, LC_CTYPE=en_GB.utf8 (charmap=UTF-8), LANGUAGE=en_GB:en (charmap=UTF-8)
Shell: /bin/sh linked to /bin/dash
Init: systemd (via /run/systemd/system)

--- End Message ---
--- Begin Message ---
On Thu, Aug 02, 2018 at 01:04:25AM +0100, Olly Betts wrote:
> I'm not sure how the stemmer mapping file is generated, but I'll look
> into it today if I can.  I think we should be able to just specify a
> default of "none" but I suspect this file is generated so I need to
> fix the script not just the current output.

It's generated by /srv/search.debian.org/bin/gen-stemmer.sh - I've
fixed that to generate $set{stemmer,none} before anything else, run
it by hand, and now I can search for "buster" in "chinese-china" and
"vietnamese", and stemming is still enabled for "english".

This should fix all cases, but please reopen if not.

Cheers,
    Olly

--- End Message ---

Reply to: