[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Re: Request to get Permission for Data extraction



Linus Nordberg <linus@glasklarteknik.se> writes:

> Richard Reinick <richard.reinick@stud.hs-merseburg.de> wrote
> Tue, 11 Mar 2025 10:50:58 +0100:
>
>> to historical package data. I would like to inquire about the
>> possibility of extracting specific data in large quantities from the
>> Debian Snapshot Archive for my research purposes. I noticed that there
>
> Hi Richard,
>
> Thanks for reaching out.
>
> What type(s) of data and what volumes are you planning to download?
>
> As pointed out in another response to your request, it might make sense
> for you to ask for (a copy of) the metadata kept in the database.

Could the snapshot team make those public?

It is harder than it should be to mirror snapshot locally.  You have to
screenscrape the web interface to get full data.  This creates
unnecessary load, so it would be nice if at least the list of filenames
(essentially SHA1 hashes) could be published.  Right now this
information is hidden.  As far as I understood earlier discussions on
this, that hiding is intentional (for reasons I couldn't understand).

/Simon

Attachment: signature.asc
Description: PGP signature


Reply to: