On Tue, May 06, 2025 at 12:02:08AM +0200, Aigars Mahinovs wrote:
The transformative criteria here is that the resulting work needs to be
transformed in such a way that it adds value. And generating new texts
from a LLM is pretty clearly a value-adding transformation compared to
the original articles. Even more so than the already ruled-on Google
Books case.
OK, let me change it around a bit, because I don't think this discussion
is going in any direction that is relevant for Debian.
The only way in which you can build a model is by taking loads and loads
of data, running some piece of software over it, and storing the result
somewhere.
How can we do this legally, reproducibly, and openly if we do not have
the rights to redistribute the said "loads and loads of data"?
The answer is, we can't.
(...)