[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Re: Broadinstitute catch - elegant way to disable tests requiring online access?



Hi Nilesh,

Thank you tons for thinking along.

On 27.07.21 17:47, Nilesh Patra wrote:

On Tue, 27 Jul, 2021, 9:02 pm Steffen Möller, <steffen_moeller@gmx.de> wrote:
Hello,

This package is one of the closest that we have on our radar to suit
virologists, to I went for CATCH from the broad institute at
https://github.com/broadinstitute/catch . Conda has it already, does not
surface on bio.tools, though.

Testing takes quite a while (~15mins) and the few that fail seem to
depend on viral sequence data that cannot be downloaded. So, I consider
the package functional, but don't really know what to do with the test
that fail because they have to fail.

Are there suggestions from your sides what to do about them?

On another note, when I just wanted to inject it all to salsa, it failed
to create the pristine-tar:

> [...]


That file is small

$ du -sh catch/datasets/data/achimota_rubulavirus_1.fasta.gz
8.5K    catch/datasets/data/achimota_rubulavirus_1.fasta.gz

so there is no need for git-lsf in the first place, right?

Yes, no need. It looks weird on first look though

Any idea?

If you have the sources in a non-git repository could you do something like:

$ dpkg-source -b .
$ cd ..
$ gbp import-dsc --pristine-tar <your .dsc file>

And see what you get?

> <snip>
> Huge test log
> </snip>

Some of these don't work due to a missing .tsv file, others because they try to access internet w/ urllib, few others due to a missing fasta file.

But it's difficult to debug till we have a repository in salsa to look stuff up and fix, so if you could commit, it'd be easy to move forward

It took me a bit but. Too long. The answer is that git does not lose the sensation of a file being a git lfs reference even when you download a tar.gz. For some reason I had expected that all genomes were truly gzipped fasta files, but no, they were still references. Maybe I had inadvertently transformed a few to what they point to during a first build and that is why I then did not find the issue a bit earlier.

The import of pristine-tar has worked after removing .gitattributes, but then the git lfs references were still in the tarball and the upstream branch. pristine-tar could be pushed, but then the other branches would trigger git lfs errors when pushed to salsa. Only after having all fasta.gz lfs files removed, the upload went smoothly and you all now find this on https://salsa.debian.org/med-team/broad-catch .

Tests seem to do just as fine as before, still.

Best,

Steffen


Reply to: