On Tue, May 06, 2025 at 08:36:50AM -0700, Russ Allbery wrote: > But, more directly to your point, I agree with you, but I don't understand > why this implies that it's necessary to put non-free data in Debian main. > I can exploit all sorts of non-open data from my Debian computer by > obtaining it from any number of other sources. I don't see the need for > Debian to host it. On Wed, May 07, 2025 at 05:23:21PM +0000, Clint Adams wrote: > So what is your preference and what would you want to see happen? > I ask because I see no good options here. I am thinking about > this from the perspective of a user who wants to use the models > unmodified and from the perspective of a user who wants to > modify the models to work better with a face that the models > "consider" an outlier. What I strongly suspect would happen, if proposal A wins (which I also consider quite likely) is that Debian maintainers of free software products that use trained ML models that lack DFSG-free training data, will have to go down the rabbit hole of patching those software to systematically download the models on first use. Or just give up on maintaining those packages, of course. Answering Russ upthread, I understand very well how such a situation will make us Debian people fell well, because we are not hosting it. But I fail to say how this helps in delivering software freedom to our users. First, they will have the models in question anyway, probably automatically so we will really not be "protecting" them from this eveil OSAID-but-not-DFSG-free stuff. (Or are we going to rule that free software that does this cannot be in main too?) Second, it will be more work for our maintainers, and deliver an overall worse experience in terms of security, mirroring, etc. Finally, we will also be making things harder for people that are fine with the limited modifications that are possible without the training data (e.g., fine tuning) as they will not be able to find the full sources (that are enough for their needs) within the Debian archive. Cheers -- Stefano Zacchiroli . zack@upsilon.cc . https://upsilon.cc/zack _. ^ ._ Full professor of Computer Science o o o \/|V|\/ Télécom Paris, Polytechnic Institute of Paris o o o </> <\> Co-founder & CSO Software Heritage o o o o /\|^|/\ Mastodon: https://mastodon.xyz/@zacchiro '" V "'
Attachment:
signature.asc
Description: PGP signature