Re: Concerns regarding the "Open Source AI Definition" 1.0-RC2

To: debian-project@lists.debian.org
Subject: Re: Concerns regarding the "Open Source AI Definition" 1.0-RC2
From: Stefano Zacchiroli <zack@debian.org>
Date: Tue, 29 Oct 2024 07:03:18 -0400
Message-id: <[🔎] 20241029110318.vc6q2hqndjrjostx@upsilon.cc>
In-reply-to: <[🔎] 5507b62b-6641-4c00-a451-3a749be1db35@debian.org>
References: <[🔎] 45b12387-d23e-4c7b-bc91-6238d3ed7ed4@debian.org> <[🔎] 5507b62b-6641-4c00-a451-3a749be1db35@debian.org>

On Mon, Oct 28, 2024 at 09:53:31PM +0200, Jonathan Carter wrote:
> The companies [...]  want to restrict what you can actually use it
> for, and call it open source? And then OSI makes a definition that
> seems carefully crafted to let these kind of licenses slip through?

The licensing terms for the Meta Llama models are indeed horrific, but I
don't understand your point here. In order to be OSAID compliant, Meta
will precisely have to change those licensing terms and make them
DFSG-compliant. That would be a *good* thing for the world and would fix
the main thing you are upset about.

And Meta is not liking that idea. Meta is, right now, lobbying EU
regulators to convince them that what should count as "open source AI"
for the purposes of the EU AI Act is their (Meta's) version, rather than
OSAID.

I have personally fought (and lost) during the OSAID definition process
to make access to training data mandatory in the definition. So while
I'm certainly not against criticizing OSAID, we should do that for the
right reasons.

Cheers

PS To make Llama models OSAID-compliant Meta, in addition to (1)
   changing the model license, will also have to: (2) provide "a listing
   of all publicly available training data and where to obtain it", and
   (3) release under DFSG-compatible terms their entire training
   pipeline (currently unreleased). I don't think they will ever get
   there. But if they do, these would also be good things for the world.
   Not *as good* as having access to the entire training dataset, but
   good nonetheless.

-- 
Stefano Zacchiroli . zack@upsilon.cc . https://upsilon.cc/zack  _. ^ ._
Full professor of Computer Science              o     o   o     \/|V|\/
Télécom Paris, Polytechnic Institute of Paris     o     o o    </>   <\>
Co-founder & CSO Software Heritage            o o o     o       /\|^|/\
Mastodon: https://mastodon.xyz/@zacchiro                        '" V "'

Attachment: signature.asc
Description: PGP signature

Reply to:

Follow-Ups:
- Re: Concerns regarding the "Open Source AI Definition" 1.0-RC2
  - From: Jonathan Carter <jcc@debian.org>
- Re: Concerns regarding the "Open Source AI Definition" 1.0-RC2
  - From: Gunnar Wolf <gwolf@debian.org>

References:
- Concerns regarding the "Open Source AI Definition" 1.0-RC2
  - From: Mo Zhou <lumin@debian.org>
- Re: Concerns regarding the "Open Source AI Definition" 1.0-RC2
  - From: Jonathan Carter <jcc@debian.org>

Prev by Date: Re: Concerns regarding the "Open Source AI Definition" 1.0-RC2
Next by Date: Re: Concerns regarding the "Open Source AI Definition" 1.0-RC2
Previous by thread: Re: Concerns regarding the "Open Source AI Definition" 1.0-RC2
Next by thread: Re: Concerns regarding the "Open Source AI Definition" 1.0-RC2
Index(es):
- Date
- Thread