Bug#616521: ITP: libhtml-html5-sanity-perl -- make HTML5 DOM trees less insane
Package: wnpp
Severity: wishlist
Owner: Jonas Smedegaard <dr@jones.dk>
* Package name : libhtml-html5-sanity-perl
Version : 0.101
Upstream Author : Toby Inkster <tobyink@cpan.org>
* URL : http://search.cpan.org/dist/HTML-HTML5-Sanity/
* License : Artistic or GPL-1+
Programming Lang: Perl
Description : make HTML5 DOM trees less insane
HTML::HTML5::Sanity is a Perl library tidying HTML::HTML5::Parser.
.
The Document Object Model (DOM) generated by HTML::HTML5::Parser meets
the requirements of the HTML5 spec, but will probably catch a lot of
people by surprise.
.
The main oddity is that elements and attributes which appear to be
namespaced are not really. For example, the following element:
.
<div xml:lang="fr">...</div>
.
Looks like it should be parsed so that it has an attribute "lang" in
the XML namespace. Not so. It will really be parsed as having the
attribute "xml:lang" in the null namespace.
Reply to: