Ceppo's message accidentally didn't include debian-wiki, so it is attached here.
--- Begin Message ---
- To: Maytham Alsudany <maytham@debian.org>
- Subject: Re: CC BY-SA and AI training?
- From: Ceppo <ceppo@oziosi.org>
- Date: Sat, 26 Jul 2025 16:50:13 +0000
- Message-id: <ln7obvxoh6mhh7ueysyntbjbm2y4djon5pmjm5ioxhlolxz4sj@cf4q3cwzlzkl>
- In-reply-to: <[🔎] 82a23f926eeb5ac230ff0d60222c653c1c379e15.camel@debian.org>
- References: <[🔎] aINEB9ZJPynS8pLt@kumo.plessy.net> <[🔎] 82a23f926eeb5ac230ff0d60222c653c1c379e15.camel@debian.org>
On Fri, Jul 25, 2025 at 05:49:14PM +0800, Maytham Alsudany wrote:On Fri, 2025-07-25 at 17:44 +0900, Charles Plessy wrote:Imaging that one day a Free software projects wants to train an entirely Free LLM, that among others knows well about Debian, and for which the ouptuts are guaranteed to be free from copyright violations. If that would have a chance to happen, wouldn't it be better that our wiki's contents are under a more permissive license that does not require attribution?- From what I know, the only way this would be achieved is putting the content in the public domain or an equivalent like the CC0. - I don't think contributors want their work to be used without attribution. - Virtually no content from other sources/sites could be copied to the wiki because nothing is compatible with public domain except other public domain content.[...]I don't think work written by people who have dedicated their own time to producing content on the wiki should have their work used without attribution.I agree with all Maytham points. I think that attribution is the minimum ackowledgement authors deserve for a work as demanding as a good wiki, and it wouldn't be fair to force them to give it up.It *is* possible for an AI algorithm to cite sources and comply with licensing. Such technology can and is being developed in the wild. For example, Google Gemini provides links for where it got it's information; it's not perfect, but it's a start.I think that no special technology is necessary to comply with CC BY-SA 4.0. It requiresidentification of the creator(s) of the Licensed Material and any others designated to receive attribution, in any reasonable manner requested by the LicensorI am pretty sure that providing a link to a dedicated file listing the contributors is a "reasonable manner". And it is so cheap that it would be unreasonable for the licensee to consider it an excessive hurdle.Cheers, -- Ceppo https://wiki.debian.org/CeppoPlease, encrypt our messages with the key at the link above and send me yours.Attachment: signature.asc
Description: PGP signature
--- End Message ---