Linus Nordberg <linus@glasklarteknik.se> writes: > Richard Reinick <richard.reinick@stud.hs-merseburg.de> wrote > Tue, 11 Mar 2025 10:50:58 +0100: > >> to historical package data. I would like to inquire about the >> possibility of extracting specific data in large quantities from the >> Debian Snapshot Archive for my research purposes. I noticed that there > > Hi Richard, > > Thanks for reaching out. > > What type(s) of data and what volumes are you planning to download? > > As pointed out in another response to your request, it might make sense > for you to ask for (a copy of) the metadata kept in the database. Could the snapshot team make those public? It is harder than it should be to mirror snapshot locally. You have to screenscrape the web interface to get full data. This creates unnecessary load, so it would be nice if at least the list of filenames (essentially SHA1 hashes) could be published. Right now this information is hidden. As far as I understood earlier discussions on this, that hiding is intentional (for reasons I couldn't understand). /Simon
Attachment:
signature.asc
Description: PGP signature