[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#949506: RFP: wayback-machine-downloader -- Download an entire website from the Wayback Machine.



owner 834177 anarcat@debian.org
thanks

On 2020-01-21 16:26:31, Alessandro Barbieri wrote:
> Package: wnpp
> Severity: wishlist
>
> * Package name    : wayback-machine-downloader
>   Version         : 2.2.1
>   Upstream Author : Julian Khaleghy
> * URL             : https://github.com/hartator/wayback-machine-downloader
> * License         : MIT
>   Programming Lang: Ruby
>   Description     : Download an entire website from the Wayback Machine.
>
> It will download the last version of every file present on Wayback Machine to ./websites/example.com/. It will also re-create a directory structure and auto-create index.html pages to work seamlessly with Apache and Nginx. All files downloaded are the original ones and not Wayback Machine rewritten versions. This way, URLs and links structure are the same as before.

Just to let you know that there's also a Python version of this that's
called "wayback" and seems slightly better maintained, although I've
heard this one (wayback-machine-downloader) might be "better", it's
clear the repo mentioned about has been inactive for years and one must
find The Right Fork going forward.

For now I'll be taking a look at the python version, wayback.

A.

-- 
We must shift America from a needs- to a desires-culture. People must
be trained to desire, to want new things, even before the old have
been entirely consumed. Man's desires must overshadow his needs.
                         - Paul Mazur, Lehman Brothers


Reply to: