[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#623016: ITP: python-tidylib -- Python wrapper for HTML Tidy (tidylib)



Package: wnpp
Severity: wishlist
Owner: Janos Guljas <janos@resenje.org>

* Package name    : python-tidylib
  Version         : 0.2.1
  Upstream Author : Jason Stitt <js@jasonstitt.com>
* URL             : http://countergram.com/open-source/pytidylib/
* License         : MIT/X11 (BSD like)
  Programming Lang: Python
  Description     : Python wrapper for HTML Tidy (tidylib)

 PyTidyLib is a Python package that wraps the HTML Tidy library. This
 allows you, from Python code, to “fix” invalid (X)HTML markup. Some
 of the library’s  many capabilities include:

  * Clean up unclosed tags and unescaped characters such as ampersands
  * Output HTML 4 or XHTML, strict or transitional, and add missing
    doctypes
  * Convert named entities to numeric entities, which can then be used
    in XML documents without an HTML doctype.
  * Clean up HTML from programs such as Word (to an extent)
  * Indent the output, including proper (i.e. no) indenting for pre
    elements, which some (X)HTML indenting code overlooks.



Reply to: