Bug#1125002: ITP: tinyhtml5 -- a tiny HTML5 parser
Quoting Emilio Pozuelo Monfort (2026-01-09 12:55:13)
> On 08/01/2026 09:45, Stéphane Glondu wrote:
> > Package: wnpp
> > Severity: wishlist
> > Owner: Stéphane Glondu <glondu@debian.org>
> > X-Debbugs-Cc: debian-devel@lists.debian.org, debian-python@lists.debian.org
> >
> > * Package name : tinyhtml5
> > Version : 2.0.0
> > Upstream Contact: Guillaume Ayoub
> > * URL : https://github.com/CourtBouillon/tinyhtml5
> > * License : Expat
> > Programming Lang: Python
> > Description : a tiny HTML5 parser
> >
> > tinyhtml5 is a HTML5 parser that transforms a possibly malformed HTML
> > document into an ElementTree tree.
> >
> > This is a new dependency of weasyprint, see:
> >
> > https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=1122284#20
> >
> > This is a fork of html5lib, which is packaged in Debian. I've started
> > the Debian package of tinyhtml5 based on html5lib's one.
>
> What are the differences with html5lib, and the reason that a fork was needed?
The question seems answered here:
https://doc.courtbouillon.org/tinyhtml5/latest/going_further.html
I recommend (and guess that was implied by Emilio as well) to include
in the long description of the package a short version of the answer.
- Jonas
--
* Jonas Smedegaard - idealist & Internet-arkitekt
* Tlf.: +45 40843136 Website: http://dr.jones.dk/
* Sponsorship: https://ko-fi.com/drjones
[x] quote me freely [ ] ask before reusing [ ] keep private
Reply to: