Can anyone suggest a simple way to strip vowels out of utf-8 encoded
hebrew text, leaving just the consenants?
i.e., given something like בָָּ֟֟רָא, pipe it through something so that the
output is ברא. The unicode characters <U+0591> to <U+05C7> ideally
should be stripped. This includes accents, etc.
cheers,
dc
--
David Purton
dcpurton@marshwiggle.net
For the eyes of the LORD range throughout the earth to
strengthen those whose hearts are fully committed to him.
2 Chronicles 16:9a
Attachment:
signature.asc
Description: Digital signature