[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Re: wget and captcha puzzle !!!



> So when I download the captcha, that very step also reload a new
> captcha.

If the captcha has been properly implemented there's no way around
solving it.  Look at:

Package: slimrat-nox
Source: slimrat
Version: 1.0-1
Installed-Size: 336
Maintainer: Paul McEnery <pmcenery@gmail.com>
Architecture: all
Depends: perl, libwww-perl, libwww-mechanize-perl, libcrypt-ssleay-perl, aview, imagemagick, tesseract-ocr
Description: CLI application for automated downloading from file hosters
 Provides a command-line interface for automatically downloading files
 from hosting providers. Slimrat is also capable of captcha solving using
 tesseract for optical character recognition. Support includes, but is
 not limited to the following file hosters:
 .
    * data.hu
    * www.depositfiles.com
    * www.easy-share.com
    * www.fast-load.net
    * www.fast-share.com
    * www.hotfile.com
    * leteckaposta.cz
    * www.mediafire.com
    * www.megaupload.com
    * odsiebie.najlepsze.net
    * www.rapidshare.com
    * sharebase.to
    * uploaded.to
    * www.youtube.com
 .
 This package provides the command-line user interface
Homepage: http://code.google.com/p/slimrat/

And:

Package: tesseract-ocr
Priority: optional
Section: graphics
Installed-Size: 3184
Maintainer: Jeffrey Ratcliffe <Jeffrey.Ratcliffe@gmail.com>
Architecture: amd64
Source: tesseract
Version: 2.04-2+squeeze1
Replaces: tesseract-ocr-data
Depends: libc6 (>= 2.2.5), libgcc1 (>= 1:4.1.1), libjpeg62 (>= 6b1), libstdc++6 (>= 4.1.1), libtiff4, zlib1g (>= 1:1.1.4), tesseract-ocr-eng | tesseract-ocr-language
Filename: pool/main/t/tesseract/tesseract-ocr_2.04-2+squeeze1_amd64.deb
Size: 1026476
MD5sum: 4391d9a7cc1e1ff13996ec2011bf8f9a
SHA1: bfaa49414b480c30661973d6d0fc7be9794ca0e9
SHA256: ac6738f86e353d7e2cff9ebfcc4f29dbacbdd4b3abf5088bd89d066f6c65468a
Description: Command line OCR tool
 The Tesseract OCR engine was originally developed at HP between 1985 and 1995.
 It was open-sourced by HP and UNLV in 2005 and Google has lead further
 development.
 .
 The Tesseract OCR engine was one of the top 3 engines in the 1995 UNLV
 Accuracy test.  Between 1995 and 2006 it had little work done on it, but it
 is probably one of the most accurate open source OCR engines available.  It
 will read a binary, grey or color image and output text.
Homepage: http://code.google.com/p/tesseract-ocr/


-- 
John Hasler


Reply to: