Re: Request to get Permission for Data extraction
Richard Reinick <richard.reinick@stud.hs-merseburg.de> wrote
Wed, 12 Mar 2025 10:21:28 +0100:
> I need the *.debian.tar.gz and *.dsc files from pool/main/* for all
> versions of the respective packages, in other words the metadata files
> of the packages. Additionally, I need information on when a package
> was part of the stable, testing, and unstable branches of a Debian
> version. For this, I require the changelogs and the package lists
> located under /dists/<stable, testing, unstable>/.
>
> I am unsure about the total volume of data I need to download.
>
> If possible, I would gladly request a copy of the database.
The database in the context of the snapshot service is the snapshot
metadata kept in an SQL database, currently about 100G (on disk). The
actual packages are kept in what's called the farm which currently
weighs in at 184T.
Cf. https://salsa.debian.org/snapshot-team/snapshot/-/blob/master/README
The snapshot metadata alone might help you but I suspect that you will
want more. Are you aware of https://metasnap.debian.net/?
Reply to: