[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#924040: #924040 ITP: archivebox -- open source self-hosted web archive



Hi Martin,

On Thursday, 12 March 2020 9:14:18 AM AEDT Martin wrote:
> As anarcat, I'm too busy with other things now, but I'm starting
> to test your draft package right now. Observations so far:

Thanks a lot for the feedback.


>  - the link "https://nicksweeting.com/images/archive.png"; in
>    archivebox/templates/link_index.html should be replaced with
>    a local copy

Upstream addressed that already so I've packaging a new upstream snapshot to 
pick up changes.


>  - when starting archivebox, I'm getting a strange message:
>    "fatal: not a git repository (or any of the parent directories): .git"

That was a version detection implying that archivebox executable is running 
from git repository. I've patched away this logic and embedded package 
version.


>  - archivebox likes to write to /usr/share/output/, which
>    probably should be ~/.cache/archivebox/

Indeed it is an inconvenient default even though it is customizable by 
setting "OUTPUT_DIR" environment variable or in "~/.ArchiveBox.conf" as per 
template in 

  /usr/share/doc/archivebox/examples/ArchiveBox.conf.default

I've patched ArchiveBox to use current working directory.
"~/.cache/archivebox/" feels oddly specific...


>  - I'm trying to archive www.debian.org using the command
>    echo https://www.debian.org/ | archivbox
>    but get the error:
>    "! Failed to archive link: KeyError: 'domain'"
>    and the resulting subdirectory
>    /usr/share/output/archive/1583964646 remains empty
 
It has been fixed by recent updates to packaging. I could successfully 
archive some web sites but archived debian.org seems to CSS so the menu is 
not rendered properly...


> I'm desperately in need of something replacing scrapbook. If you
> are working on archivebox, I'ld in turn test the package and
> send bug reports, maybe even with patches.

I need Scrapbook alternative as well but I'll be working slowly towards that 
goal due to pressure from other priorities. I'm yet to learn how to use 
ArchiveBox... Also upstream prepares some serious changes for next release -- 
I hope it won't be too difficult to package. I'd appreciate any help with 
ArchiveBox.

Did you have a look at Archivematica by any chance?

-- 
Regards,
 Dmitry Smirnov.

---

Facebook in particular is the most appalling spying machine that has ever
been invented.
        -- Julian Assange

Attachment: signature.asc
Description: This is a digitally signed message part.


Reply to: