[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Re: Output of dpkg-scanpackages as XML



William Ballard wrote:
[...]
But back to Linux.
$echo hi | iconv -f utf8 -t unicode | grep hi
(no output)

Not surprised; grep understands ASCII, AFAIK, so what you've just sent to it is:

$ echo hi | iconv -f utf8 -t unicode | od -t x1
0000000 ff fe 68 00 69 00 0a 00

It can't find an 'h' and an 'i' next to each other. That's why I mentioned UTF8; UTF8 has the nice property that anything that can be represented in plain ASCII *is*, and all other characters are high-bit, which grep and friends will pass straight through.

It's still an ad-hoc solution, though; does anyone know of versions of the standard textutils that know about Unicode?

--
[insert interesting .sig here]



Reply to: