[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

OT: strip hebrew vowels and accents from utf-8 text



Can anyone suggest a simple way to strip vowels out of utf-8 encoded
hebrew text, leaving just the consenants?

i.e., given something like בָָּ֟֟רָא, pipe it through something so that the
output is ברא. The unicode characters <U+0591> to <U+05C7> ideally
should be stripped. This includes accents, etc.

cheers,

dc

-- 
David Purton
dcpurton@marshwiggle.net
 
For the eyes of the LORD range throughout the earth to
strengthen those whose hearts are fully committed to him.
                                 2 Chronicles 16:9a

Attachment: signature.asc
Description: Digital signature


Reply to: