[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#299830: ITP: python-beautifulsoup -- error-tolerant HTML parser for Python



Package: wnpp
Severity: wishlist
Owner: Decklin Foster <decklin@red-bean.com>

* Package name    : python-beautifulsoup
  Version         : 1.2+cvs20041017
  Upstream Author : Leonard Richardson <leonardr@segfault.org>
* URL             : http://www.crummy.com/software/BeautifulSoup/
* License         : Python
  Description     : error-tolerant HTML parser for Python

The BeautifulSoup class turns arbitrarily bad HTML into a tree-like
nested tag-soup list of Tag objects and text snippets. A Tag object
corresponds to an HTML tag.  It knows about the HTML tag's attributes,
and contains a representation of everything contained between the
original tag and its closing tag (if any). It's easy to extract Tags
that meet certain criteria.



Reply to: