[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Re: َUnicode problem

On Thu, Dec 18, 2003 at 12:11:15PM +0330, Arash Bijanzadeh wrote:
> I faced a problem with unicode and gotto go inside of this technoligy. I 
> noticed that for example when I am inserting a Arabic_khah with the unicode 
> 0f 062E two bytes with values D8 AE would be inserted to the file. Can 
> anybody tell me what is the relationship between these two value, and how 
> does it work.

D8 AE is the UTF-8 encoding of the Unicode character U+D8AE ("ARABIC
LETTER KHAH"). UTF-8 is the de facto standard encoding of Unicode on
Unix systems, partly due to its compatibility with US-ASCII. See RFC
2279 for the definition of UTF-8.


Colin Watson                                  [cjwatson@flatline.org.uk]

Reply to: