Re: We need a global decision about R data in binary format, and stick to it.

To: Jeremy Stanley <fungi@yuggoth.org>
Cc: debian-devel@lists.debian.org, ftpmaster@ftp-master.debian.org, debian-med-packaging@lists.alioth.debian.org
Subject: Re: We need a global decision about R data in binary format, and stick to it.
From: Ian Jackson <ijackson@chiark.greenend.org.uk>
Date: Mon, 5 Aug 2013 16:41:13 +0100
Message-id: <[🔎] 20991.51097.617273.783851@chiark.greenend.org.uk>
In-reply-to: <[🔎] 20130805151657.GD1472@yuggoth.org>
References: <E1V68nV-0005Z1-4v@franck.debian.org> <[🔎] 20130805005735.GE22595@falafel.plessy.net> <[🔎] 20130805014050.GA14446@leliel> <[🔎] 20991.42219.341036.231158@chiark.greenend.org.uk> <[🔎] 20130805151657.GD1472@yuggoth.org>

Jeremy Stanley writes ("Re: We need a global decision about R data in binary format, and stick to it."):
> No argument on the first, but the second sets a bad precedent if
> interpreted strongly. For example I have a program which relies on a
> fairly large set of correlative data requiring hours of expensive
> computation to generate. In the source package I include the
> original data on which the resulting tables are based and provide a
> means to regenerate it on the fly at package build time, but disable
> it by default so that it doesn't chew up build resources
> unnecessarily.

That makes sense, and is IMO a good reason for not doing the complete
from-scratch build each time.

> Since I need to generate the correlation data for other (non-Debian)
> users of the software anyway, I ship the generated files in the
> source package too and just include them in the binary package
> (along with instructions and tooling for the end user to be able to
> build datasets they can use to override the default ones provided).
> While my example is Python rather than R, I expect it's
> representative of situations for many scientific tools. Perhaps some
> guidance on when this tactic is or is not appropriate would be
> beneficial.

There should IMO be a standard way to request a source package to do
from-scratch rebuilds for this kind of thing, for QA purposes.

Ian.

Reply to:

Follow-Ups:
- Re: We need a global decision about R data in binary format, and stick to it.
  - From: Jeremy Stanley <fungi@yuggoth.org>
- Re: We need a global decision about R data in binary format, and stick to it.
  - From: Thorsten Glaser <tg@mirbsd.de>

References:
- We need a global decision about R data in binary format, and stick to it.
  - From: Charles Plessy <charles-listes-med-packaging@plessy.org>
- Re: We need a global decision about R data in binary format, and stick to it.
  - From: Paul Tagliamonte <paultag@debian.org>
- Re: We need a global decision about R data in binary format, and stick to it.
  - From: Ian Jackson <ijackson@chiark.greenend.org.uk>
- Re: We need a global decision about R data in binary format, and stick to it.
  - From: Jeremy Stanley <fungi@yuggoth.org>

Prev by Date: Re: We need a global decision about R data in binary format, and stick to it.
Next by Date: Re: We need a global decision about R data in binary format, and stick to it.
Previous by thread: Re: We need a global decision about R data in binary format, and stick to it.
Next by thread: Re: We need a global decision about R data in binary format, and stick to it.
Index(es):
- Date
- Thread