Bug#1022944: ITP: python-extruct -- library for extracting embedded metadata from HTML markup
Package: wnpp
Severity: wishlist
Owner: Christian Marillat <marillat@debian.org>
X-Debbugs-Cc: debian-devel@lists.debian.org
* Package name : python-extruct
Version : 0.14.0
Upstream Author : Scrapinghub
* URL : https://github.com/scrapinghub/extruct
* License : BSD-3
Programming Lang: Python
Description : library for extracting embedded metadata from HTML markup
Currently, extruct supports:
W3C's HTML Microdata
embedded JSON-LD
Microformat via mf2py
Facebook's Open Graph
(experimental) RDFa via rdflib
Dublin Core Metadata (DC-HTML-2003)
This package is a dependency for recipe-scrappers python package.
Reply to: