[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Re: Proposal -- Interpretation of DFSG on Artificial Intelligence (AI) Models



Holger Levsen dijo [Tue, Apr 22, 2025 at 05:16:51PM +0000]:
On Sat, Apr 19, 2025 at 01:56:17PM -0400, M. Zhou wrote:
We acknowledge that releasing useful AI models under permissive licenses like
MIT/Expat and Apache-2.0 is a generous act from the original authors due to
huge costs, and it is a great contribution to the software ecosystem and the
society. We sincerely respect the respective authors' work.

i'm not sure i can subscribe to this. after all, most if not all "AI" models
exist because of stealing other peoples work...

or did i miss consentual "AIs"?

AI models _can_ be prepared with legally distributable content. And that
would be very adequate in this context. If we had an LLM trained _only_,
say, on the Wikipedia and the books of Project Gutenberg... Of course, it
would not be a generally-queriable chatbot, but it could be just-enough for
sustaining some level of dialogue. It could be distributed in our main
section. And if you "post-train" that very little model with the
domain-specific documents/knowledge you want to work in, you can find clear
value off a truly-free LLM.

Of course, I don't know if the training set I'm proposing would be anywhere
near enough. And I cannot say we have buildds with enough GPU power to
perform the training. But it's just an example that can surely be "beat
into correctness".


Reply to: