[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Re: coreutils wc count multi bytes question



Hello,

Neo Anderson, le Fri 06 Feb 2009 15:18:51 -0800, a écrit :
> this is a 文件 vi 打的
> 
> The manual words count are 8 characters.

How do you count that?

> But the output of wc -w is 6. It seems like it is separated as token by white space. So the characters of Chinese which concatenates together would be treated as one character; resulting in the total words count is 6. 

Well, yes.  Do you mean that wc should know chinese enough to determine
whether a few kanjis form a word or not?

Samuel


Reply to: