Bug#657278: ITP: python-scrapelib -- library for scraping websites

To: submit@bugs.debian.org
Subject: Bug#657278: ITP: python-scrapelib -- library for scraping websites
From: Alex Chiang <achiang@canonical.com>
Date: Tue, 24 Jan 2012 22:24:17 -0800
Message-id: <20120125062416.GD15260@canonical.com>
Reply-to: Alex Chiang <achiang@canonical.com>, 657278@bugs.debian.org

Package: wnpp
Severity: wishlist
Owner: Alex Chiang <achiang@canonical.com>

* Package name    : python-scrapelib
  Version         : 0.5.6
  Upstream Author : Michael Stephens <mstephens@sunlightfoundation.com>
                    James Turk <jturk@sunlightfoundation.com>
* URL             : https://github.com/sunlightlabs/scrapelib
* License         : BSD-3-clause
  Programming Lang: Python
  Description     : library for scraping websites

It builds those binary packages:

python-scrapelib - library for scraping websites
scrapeshell - ipython shell to examine python-scrapelib results

Long description:
 At its simplest provides a replacement for urllib2’s urlopen functionality
 but can do much more.
 .
 Advantages of using scrapelib over urllib2 or httplib2 include:
 .
   * HTTP, HTTPS, FTP requests via an identical API
   * HTTP caching, compression and cookies
   * redirect following
   * request throttling
   * robots.txt compliance (optional)
   * robust error handling

Reply to:

Prev by Date: Bug#656100: ITA: fsvs -- Full system versioning with metadata support
Next by Date: Processed: tagging as pending bugs that are closed by packages in NEW
Previous by thread: Bug#656100: ITA: fsvs -- Full system versioning with metadata support
Next by thread: Processed: retitle 657020 to ITP: ruby-childprocess -- Ruby gem that aims at being a simple and reliable solution for controlling external programs ...
Index(es):
- Date
- Thread