[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

RFS: unidesc - Tools for finding out what is in a Unicode file.



Hi,
I'd be glad if someone can sponsor me.
Here's my work: http://home.foolab.org/debs/unidesc/2.15.1/1/
ITP #337498

Here's the description I'm using:
 They are useful when working with Unicode files when one doesn't know
 the writing system, doesn't have the necessary font, needs to inspect
 invisible characters, needs to find out whether characters have been
 combined or in what order they occur, or needs statistics on which
 characters occur.
 .
 * uniname defaults to printing the character offset of each character,
   its byte offset, its hex code value, its encoding, the glyph itself,
   and its name. It may also be used to validate UTF-8 input.
 * unidesc reports the character ranges to which different portions of the
   text belong. It can also be used to identify Unicode encodings
   (e.g. UTF-16be) flagged by magic numbers.
 * unihist generates a histogram of the characters in its input.
 * ExplicateUTF8 is intended for debugging or for learning about Unicode.
   It determines and explains the validity of a sequence of bytes as a UTF8
   encoding.

All the best,

-- 
----------------
-- Katoob Main Developer, Arabbix Maintainer.
GNU/Linux registered user #224950
Proud Egyptian GNU/Linux User Group <www.eglug.org> Admin.
Life powered by Debian, Homepage: www.foolab.org
--
Don't send me any attachment in Micro$oft (.DOC, .PPT) format please
Read http://www.gnu.org/philosophy/no-word-attachments.html
Preferable attachments: .PDF, .HTML, .TXT
Thanx for adding this text to Your signature

Attachment: signature.asc
Description: Digital signature


Reply to: