Re: Bits from /me: A humble draft policy on "deep learning v.s. freedom"

To: debian-devel@lists.debian.org
Subject: Re: Bits from /me: A humble draft policy on "deep learning v.s. freedom"
From: Andy Simpkins <rattusrattus@debian.org>
Date: Thu, 23 May 2019 10:28:56 +0100
Message-id: <[🔎] 05eb680a-63aa-f4e4-343d-6f86f401c7c5@debian.org>
In-reply-to: <[🔎] 319bcc5280dd76a6911598333a85cb8c@debian.org>
References: <[🔎] f544829dcd6c0f92ea11cdb25543bdac@debian.org> <[🔎] 20190521090709.t4o3hsx4p665ws6w@an3as.eu> <[🔎] 7ba5a9c7-a58e-e173-a99b-28f1dfc3deae@cohens.org.il> <[🔎] 319bcc5280dd76a6911598333a85cb8c@debian.org>


On 22/05/2019 03:53, Mo Zhou wrote:

Hi Tzafrir,

On 2019-05-21 19:58, Tzafrir Cohen wrote:

Is there a way to prove in some way (reproducible build or something
similar) that the results were obtained from that set using the specific
algorithm?

I wrote a dedicated section about reproducibility:
https://salsa.debian.org/lumin/deeplearning-policy#neural-network-reproducibility

I suppose that the answer is negative, but it would have been nice to
have that.

In simple cases, fixing the seed for random number generator is enough.

If any upstream has ever claimed that their project aims to be of high
quality. Then unable to reproduce is very likely a fatal bug.

Reproducibility is also a headache among the machine learning and
deep learning communities. They are trying to improve the situation.
Everyone likes reproducible bits.


I agree completely.

Your wording "The model /should/be reproducible with a fixed randomseed." feelscorrect but wonder if guidance notes along the following lines should beadded?

*unless* we can reproduce the same results, from the same trainingdata,

    you cannot classify as group 1, "Free Model", because verification that

training has been carried out on the dataset explicitly licensedunder a free software license can not be achieved. This should be treatedas a

    severe bug and the entire suite should be classified as group 2,
    "ToxicCandy Model", until such time that verification is possible.

Finally,
Thank you for your work on this.

/Andy

Reply to:

Follow-Ups:
- Re: Bits from /me: A humble draft policy on "deep learning v.s. freedom"
  - From: Sam Hartman <hartmans@debian.org>
- Re: Bits from /me: A humble draft policy on "deep learning v.s. freedom"
  - From: Mo Zhou <lumin@debian.org>
- Re: Bits from /me: A humble draft policy on "deep learning v.s. freedom"
  - From: Don Armstrong <don@debian.org>

References:
- Bits from /me: A humble draft policy on "deep learning v.s. freedom"
  - From: Mo Zhou <lumin@debian.org>
- Re: Bits from /me: A humble draft policy on "deep learning v.s. freedom"
  - From: Andreas Tille <andreas@an3as.eu>
- Re: Bits from /me: A humble draft policy on "deep learning v.s. freedom"
  - From: Tzafrir Cohen <tzafrir@cohens.org.il>
- Re: Bits from /me: A humble draft policy on "deep learning v.s. freedom"
  - From: Mo Zhou <lumin@debian.org>

Prev by Date: Re: Debhelper compat levels
Next by Date: Bug#929438: ITP: parfive -- An asyncio based parallel file downloader for Python
Previous by thread: Re: Bits from /me: A humble draft policy on "deep learning v.s. freedom"
Next by thread: Re: Bits from /me: A humble draft policy on "deep learning v.s. freedom"
Index(es):
- Date
- Thread