[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Re: enable searching East Asian words at search.debian.org



Hi,

From: Josip Rodin <joy@srce.hr>
Subject: Re: enable searching East Asian words at search.debian.org
Date: Sun, 11 May 2003 19:44:17 +0200

> On Sun, May 11, 2003 at 11:09:44PM +0900, Tomohiro KUBOTA wrote:
> > > c=`grep CHARSET ../.wmlrc | cut -d= -f2`; \
> > >           iconv -f $c -t UTF-8 search.ja.html | perl -pe 's,^(\s*<meta http-equiv="Content-Type" content="text/html; charset=)\S+(">)$,$1UTF-8$2,' > search.ja.html
> > > iconv: cannot open input file `euc-jp': No such file or directory
> > 
> > Sorry I don't understand what you are doing.  However, my "improvement"
> > is not related to search.ja.html (or translation of search page) at all.
> 
> Well, it's related if you want people to be able to actually input stuff
> properly into the search engine. :)

OK, I remembered.  The search web page must be UTF-8.

The current (English) version of the search page is already UTF-8 and
have no problem for international search, I think.

However, if you would like to supply translated search pages (though
I think it is not an urgent problem), I just read the
webwml/english/searchtmpl/Makefile and found that
`grep CHARSET ../.wmlrc` might have a problem.  webwml/japanese/.wmlrc
have two lines which matches 'grep CHARSET', which are
'-D CHARSET=iso-2022-jp' and '-D CHARSET_WML=euc-jp'.

---
Tomohiro KUBOTA <kubota@debian.org>
http://www.debian.or.jp/~kubota/




Reply to: