[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#612875: ITA: beautifulsoup -- error-tolerant HTML parser for Python



Package: wnpp
Severity: normal

In response to the "general RFA" at [0], I'll adopt this package.

The package description is:
 The BeautifulSoup class turns arbitrarily bad HTML into a tree-like nested
 tag-soup list of Tag objects and text snippets. A Tag object corresponds to
 an HTML tag.  It knows about the HTML tag's attributes, and contains a
 representation of everything contained between the original tag and its
 closing tag (if any). It's easy to extract Tags that meet certain criteria.

David

[0]: http://lists.debian.org/debian-devel/2011/02/msg00217.html

-- 
 . ''`.   Debian developer | http://wiki.debian.org/DavidPaleino
 : :'  : Linuxer #334216 --|-- http://www.hanskalabs.net/
 `. `'`  GPG: 1392B174 ----|---- http://deb.li/dapal
   `-   2BAB C625 4E66 E7B8 450A C3E1 E6AA 9017 1392 B174

Attachment: signature.asc
Description: PGP signature


Reply to: