[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#1053134: ITP: python-cloudscraper -- Python module to bypass Cloudflare's anti-bot page

Package: wnpp
Severity: wishlist
Owner: Carles Pina i Estany <carles@pina.cat>
X-Debbugs-Cc: debian-devel@lists.debian.org

* Package name    : python-cloudscraper
  Version         : 1.2.68
  Upstream Contact: VeNoMouS
* URL             : https://github.com/VeNoMouS/cloudscraper
* License         : MIT
  Programming Lang: Python
  Description     : Python module to bypass Cloudflare's anti-bot page

A simple Python module to bypass Cloudflare's anti-bot page (also known
as "I'm Under Attack Mode", or IUAM), implemented with Requests.
Cloudflare changes their techniques periodically, so I will update this
repo frequently.

This can be useful if you wish to scrape or crawl a website protected
with Cloudflare. Cloudflare's anti-bot page currently just checks if the
client supports Javascript, though they may add additional techniques in
the future.

Due to Cloudflare continually changing and hardening their protection
page, cloudscraper requires a JavaScript Engine/interpreter to solve
Javascript challenges. This allows the script to easily impersonate a
regular web browser without explicitly deobfuscating and parsing
Cloudflare's Javascript.

I ITP simplemonitor (#1016113). One of simplemonitor dependencies is
pyaarlo (#1053132). python-cloudscraper is a dependency of pyaarlo.

Incidentally, I've used cloudscraper (from upstream) previously so I'm
familiar with it.

I plan to package it inside the Debian Python Team. I will need a sponsor.

Reply to: