Re: Datasets to design autopkgtests for our packages

Andrius Merkys <merkys@debian.org> writes:

> It depends whether you need simple protein FASTA sequences or
> alignments. You may find simple sequences in PDB, for example [1], go to
> "Download Files" and FASTA format is just there. AFAIR, PDB data is
> freely distributable.

Likewise for everything in GenBank/GenPept, ENA, and DDBJ per [2] and
everything in UniProt per [3].

> [1] https://www.rcsb.org/structure/6fti

[2] https://www.insdc.org/policy.html
[3] https://www.uniprot.org/help/license

Aaron M. Ucko, KB1CJC (amu at alum.mit.edu, ucko at debian.org)
http://www.mit.edu/~amu/ | http://stuff.mit.edu/cgi/finger/?amu@monk.mit.edu

