[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Re: Non-LLM example where we do not in practice use original training data



Simon McVittie <smcv@debian.org> writes:

> On Tue, 06 May 2025 at 13:58:57 +0200, Stefano Zacchiroli wrote:
>>FWIW, in terms of free software ethics, I consider non-open data to be
>>"less nasty" than non-free code.
>
> Debian is unusual in the way we interpret our mission statement as
> extending to everything we distribute being Free, not just our
> executable code.

I don't think Debian is perfectly consistent in applying that principle:
for example, the text of the Developer Certificate of Origin (DCO) is
included in Debian packages (in 'main') and has a clearly non-free
license, and IIRC sometimes not even in debian/copyright.  There are
other examples historically of similar content too, e.g., IETF RFCs and
Unicode tables.  So I think there are some implicit acknowledgement from
Debian's (in)actions that non-open data like documentation or firmware
is not as evil as non-free code.  Extending that approach to AI models
isn't unreasonable, although I would prefer if we didn't.

/Simon

Developer Certificate of Origin
Version 1.1

Copyright (C) 2004, 2006 The Linux Foundation and its contributors.

Everyone is permitted to copy and distribute verbatim copies of this
license document, but changing it is not allowed.


Developer's Certificate of Origin 1.1

By making a contribution to this project, I certify that:

(a) The contribution was created in whole or in part by me and I
    have the right to submit it under the open source license
    indicated in the file; or

(b) The contribution is based upon previous work that, to the best
    of my knowledge, is covered under an appropriate open source
    license and I have the right under that license to submit that
    work with modifications, whether created in whole or in part
    by me, under the same open source license (unless I am
    permitted to submit under a different license), as indicated
    in the file; or

(c) The contribution was provided directly to me by some other
    person who certified (a), (b) or (c) and I have not modified
    it.

(d) I understand and agree that this project and the contribution
    are public and that a record of the contribution (including all
    personal information I submit with it, including my sign-off) is
    maintained indefinitely and may be redistributed consistent with
    this project or the open source license(s) involved.

Attachment: signature.asc
Description: PGP signature


Reply to: