Re: coreutils wc count multi bytes question
Neo Anderson, le Fri 06 Feb 2009 15:18:51 -0800, a écrit :
> this is a 文件 vi 打的
> The manual words count are 8 characters.
How do you count that?
> But the output of wc -w is 6. It seems like it is separated as token by white space. So the characters of Chinese which concatenates together would be treated as one character; resulting in the total words count is 6.
Well, yes. Do you mean that wc should know chinese enough to determine
whether a few kanjis form a word or not?