[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Re: Proposal -- Interpretation of DFSG on Artificial Intelligence (AI) Models



Hi all,

On Fri May 16, 2025 at 8:03 AM CEST, Stefano Zacchiroli wrote:
FWIW, I agree that "where is it hosted?" is a less important question wrt the one of whether the full/pristine training dataset is available, for our users, *somewhere* in the first place. But note that if Debian accepts not to host datasets on its own infrastructure, then a number of practical issues arises, e.g., what do we do with the package in main if/when the data disappears from the external hosting place?

This might have been asked before, but: wouldn't this be the perfect use case for the contrib archive area? The model complies with the DFSG, but requires software outside of the distribution to build.

Also, I had the impression that sometimes in this discussion "DFSG-free model" and "model in the main archive area" have been used as synonymous, while they are not. We can decide that a model is DFSG-free if its training data is provided by their authors, but still keep data outside of our archive and have the model live in contrib for as long as also hosting the training data is inconvenient for us.

Oh, and lastly: not all models are huge! (and we already have some fairly big packages in our archives)

Does it make sense?

Bye!


Reply to: