Re: Bits from /me: A humble draft policy on "deep learning v.s. freedom"

To: Paul Wise <pabs@debian.org>
Cc: debian-devel@lists.debian.org, debian-science@lists.debian.org
Subject: Re: Bits from /me: A humble draft policy on "deep learning v.s. freedom"
From: Mo Zhou <lumin@debian.org>
Date: Tue, 21 May 2019 19:33:50 -0700
Message-id: <[🔎] 5bbe26d2b067d3d8150075eed67c01d4@debian.org>
In-reply-to: <[🔎] 657e8dcf5cb643e7b7ffefab412c35dcea2ba0cd.camel@debian.org>
References: <[🔎] f544829dcd6c0f92ea11cdb25543bdac@debian.org> <[🔎] CAKTje6E_FQiN2tP4gODBfZwPY1VpCuHz_a86s92nnGPS37+VGg@mail.gmail.com> <[🔎] 33ff0a371d092f00fb0bfa9aae9a6615@debian.org> <[🔎] 657e8dcf5cb643e7b7ffefab412c35dcea2ba0cd.camel@debian.org>

Hi Paul,

On 2019-05-21 23:52, Paul Wise wrote:
> Are there any other case studies we could add?

Anybody is welcome to open an issue and add more
cases to the document. I can dig into them in the
future.

> Has anyone repeated the training of Mozilla DeepSpeech for example?

Generally speaking, training is non-trivial and
requires expensive hardware. This fact will clearly
reduce the probability that "someone has tried to
reproduce it".

A real example to illustrate how hard reproducing a
**giant** model is, is BERT, one of the state-of-the-art
natural language representation model that takes
2 weeks to train on TPU at a cost about $500.

Cite:
https://github.com/google-research/bert#pre-training-tips-and-caveats

> Are deep learning models deterministically and reproducibly trainable?
> If I re-train a model using the exact same input data on different
> (GPU?) hardware will I get the same bits out at the end?

Making the training program reproducible is a good practice to everyone
who train / debug neural networks. I've ever wrote a simple deep
learning
framework with only C++ STL and hence trapped into many pitfalls.
Reproducibility is very important for debugging as mathematical
bug is much harder to diagnose compared to code bugs.

I wrote a dedicated section about reproducibility:
https://salsa.debian.org/lumin/deeplearning-policy#neural-network-reproducibility

Reply to:

References:
- Bits from /me: A humble draft policy on "deep learning v.s. freedom"
  - From: Mo Zhou <lumin@debian.org>
- Re: Bits from /me: A humble draft policy on "deep learning v.s. freedom"
  - From: Paul Wise <pabs@debian.org>
- Re: Bits from /me: A humble draft policy on "deep learning v.s. freedom"
  - From: Mo Zhou <lumin@debian.org>
- Re: Bits from /me: A humble draft policy on "deep learning v.s. freedom"
  - From: Paul Wise <pabs@debian.org>

Prev by Date: Re: Bits from /me: A humble draft policy on "deep learning v.s. freedom"
Next by Date: is it safe to "try" gcc 9 from experimental in Debian/testing
Previous by thread: Re: Bits from /me: A humble draft policy on "deep learning v.s. freedom"
Next by thread: Re: Bits from /me: A humble draft policy on "deep learning v.s. freedom"
Index(es):
- Date
- Thread