Re: Proposal -- Interpretation of DFSG on Artificial Intelligence (AI) Models
Hi all,
On Fri May 16, 2025 at 8:03 AM CEST, Stefano Zacchiroli wrote:
FWIW, I agree that "where is it hosted?" is a less important question
wrt the one of whether the full/pristine training dataset is
available, for our users, *somewhere* in the first place. But note
that if Debian accepts not to host datasets on its own infrastructure,
then a number of practical issues arises, e.g., what do we do with the
package in main if/when the data disappears from the external hosting
place?
This might have been asked before, but: wouldn't this be the perfect
use case for the contrib archive area? The model complies with the DFSG,
but requires software outside of the distribution to build.
Also, I had the impression that sometimes in this discussion "DFSG-free
model" and "model in the main archive area" have been used as
synonymous, while they are not. We can decide that a model is DFSG-free
if its training data is provided by their authors, but still keep data
outside of our archive and have the model live in contrib for as long as
also hosting the training data is inconvenient for us.
Oh, and lastly: not all models are huge! (and we already have some
fairly big packages in our archives)
Does it make sense?
Bye!
Reply to: