[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#1017080: ITP: translitcodec -- Python Unicode to 8-bit charset transliteration codec



Package: wnpp
Severity: wishlist
Owner: Edward Betts <edward@4angle.com>
X-Debbugs-Cc: debian-devel@lists.debian.org, debian-python@lists.debian.org

* Package name    : translitcodec
  Version         : 0.7.0
  Upstream Author : Jason Kirtland <jek@discorporate.us>
* URL             : https://github.com/claudep/translitcodec
* License         : MIT
  Programming Lang: Python
  Description     : Unicode to 8-bit charset transliteration codec

  This package contains codecs for transliterating ISO 10646 texts into
  best-effort representations using smaller coded character sets (ASCII,
  ISO 8859, etc.).  The translation tables used by the codecs are from
  the 'transtab' collection by Markus Kuhn.
  .
  Three types of transliterating codecs are provided:
  .
  "long", using as many characters as needed to make a natural replacement.
  For example, \u00e4 LATIN SMALL LETTER A WITH DIAERESIS 'ä' will be replaced
  with 'ae'.
  .
  "short", using the minimum number of characters to make a replacement.
  For example, \u00e4 LATIN SMALL LETTER A WITH DIAERESIS 'ä' will be replaced
  with 'a'.
  .
  "one", only performing single character replacements.  Characters that can
  not be transliterated with a single character are passed through unchanged.
  For example, \u2639 WHITE FROWNING FACE '☹' will be passed through unchanged.
 
I plan to maintain this package as part of the Python team.


Reply to: