Re: Co-installable netCDF

To: Alastair McKinstry <alastair.mckinstry@sceal.ie>
Cc: debian-gis@lists.debian.org, debian-science@lists.debian.org, Nico Schlömer <nico.schloemer@gmail.com>, frankie@debian.org
Subject: Re: Co-installable netCDF
From: Bas Couwenberg <sebastic@xs4all.nl>
Date: Mon, 30 Apr 2018 16:05:29 +0200
Message-id: <[🔎] de24ccb9d88a2d16e1bd26b6db441f30@xs4all.nl>
In-reply-to: <[🔎] d3fc7fb9-b36c-1f0f-41dd-bb817e0ddfff@sceal.ie>
References: <cbebdb28-a3be-6fef-69d5-57766d5a2b48@mail.dk> <335b03c3-00e9-a05b-73a2-5650206ea923@sceal.ie> <6f7760fc-cc6f-ec99-9c05-e481b6cea464@xs4all.nl> <3a225f6c-3e45-c459-2534-63cc06fc9220@sceal.ie> <[🔎] d3fc7fb9-b36c-1f0f-41dd-bb817e0ddfff@sceal.ie>

Hi Alastair,

On 2018-04-30 15:42, Alastair McKinstry wrote:

This is an update to netcdf that I proposed for Debian stretch, but
postponed due to the HDF5 transition of the time.

I've updated the dev-coinstallable branch of netcdf (on salsa) to bring
it up-to-date with recent netcdf changes.

This would add two new flavours of netcdf lib, netcdf-mpi and
netcdf-pnetcdf, (lib and -dev packages) to enable parallel IO.  This is
designed to match the HDF5 flavour scheme (details below from previous
email), the serial version is left unchanged except for adding symbol
names; no transition is therefore required.

It is expected that only a handful of packages (ADIOS, XDMF) would be

using the parallel / MPI versions; most packages would continueunchanged.


Would it be possible to merge this package for buster?


No, because my concerns have not been addressed.

The changes to libnetcdf13.symbols are not acceptable.

The non-standard library path will break reverse dependencies, and willrequire a transition.

Symbol versioning scripts are still used, which are too hard to maintainproperly.

There are patches not included upstream, which very much should beaccepted upstream before I will consider them appropriate for the Debianpackage.

The changes to debian/rules are far too invasive. Maintaining theSOVERSION there is unacceptable.

Most importantly co-installability of the netcdf library variants needsto be supported upstream before I will consider it for the Debianpackage.


Sorry to rain on your parade.

Kind Regards,

Bas

Since netcdf 4.4.1 is in testing/unstable for some time, that's no
longer a blocker for the hdf5 transition.
I'm testing out the last of my changes for co-installable netcdfwhich I
hope to have ready by the beginning of next week.  It would be worth
thinking about doing both.
I'm aware of the outdated dev-coinstallable branch in the netcdf
repository, and no offence to your effort, but I don't like what Isee
there. The symbols version script will be a pain to maintain as our
experience with gdal has shown for example. I'm still very muchagainstpatching the netcdf source to make it build with HDF5 serial and itsMPI
variants. That needs to be solved upstream. I don't want to require
changes to reverse dependencies to select a Debian specific netcdf
variant as we do for hdf5. The situation we want to create in Debian
should be something supported out of the box by upstream. Has therebeen
any discussion with NetCDF (and HDF5) upstream about this?

Kind Regards,

Bas

I've been working on the dev-coinstallable so that it is no longer
requires a transition.

Yes, i've been talking to upstream about this, (mostly hdf5 people but

also netcdf) and my understanding is that the problem is basicallyHDF5/ netcdf compression: HDF5 (and hence netcdf) can do eithercompression

(SZIP, etc. ) or parallel read/write, but not both simultaneously.
Fixing this is on the todo list, but has been for many years without
progress. Estimates of 6-12 months work have been quoted, as HDF5 is

effectively becoming a high-performance filesystem within a file onHPC

systems with deep memory hierarchies, and any such changes are not
trivial. This development can only be really written and tested on
top-end HPC systems like the national labs; patches written by
developers on PCs will probably impact performance and not be accepted

by upstream. What I'm proposing is a temporary workaround until thisis

done, that is designed to go away later.

( Whats it means, technically: parallel write/reads work (on posix) by
dividing the file to be written into even-sized chunks handled by .eg.
MPI-IO. Compression means we don't  know in advance the size on disk a

given write will be, until after we've compressed it; if we have achunk

of memory of fixed size, we don't know the eventual byte-range on a
'serial bunch of bytes' file representation it will map to. However in
practice people are moving to a non-POSIX based representation of HDF5
files on 'modern object-based' APIs; no longer treating the file as a
serial bunch of bytes but a set of possibly different-sized blocks
handled as objects on eg. an S3 API object-based filesystem. In this

picture compression can be added. But the high-performance work ismore

important to HDF5 developers' funders than compression at the moment,
while on the netcdf front, compression is important for those of us
storing and archiving large files long-term).

We need to be able to handle both cases. Eventually in a 'deep'software

stack like Debian, we will have applications such as Visit, Paraview,

CDAT etc. that will need to be able to both (1) read compressed filesor

(2) read in parallel, on different workflows. These work using netcdf,

adios and xdmf plugins for IO, and currently cannot provideparallelism

on Debian because of the lack of parallel netcdf. Given this
incompatibility the 'serial' is the right default for Debian but
handicaps us on large systems. where i'm working for example, we build

portals on Debian for HPC, but can't do so due to this lack of MPIsupport.


So, the solution I'm proposing: We retain one 'master' netcdf version.
libnetcdf11 and libnetcdf-dev. Co-installable libnetcdf-mpi-11 and
libnetcdf-pnetcdf-11 exist, but are not used by most libraries /

applications. While a parallel libnetcdf- fortran and C++ librariesare

also required, I do not propose or expect any applications above the
netcdf stack provide serial and parallel versions; there will be no

combinatorial explosion of packages. A handful of libraries, apps maybe

linked to the MPI version of netcdf instead of the serial, and in
particular two higher-level IO libraries I maintain will be linked to

both : ADIOS and XDMF (XDMF would provide both xdmf.py andxdmf_mpi.pymodules, user selects which; while ADIOS provides an interface whereit

can decide at runtime whether to use serial or mpi).

The libraries are as follows: libnetcdf11 is as before, with NETCDF_*
symbols, library in /usr/lib/$arch/libnetcdf.so.11.3.0
The include files are in /usr/include/netcdf ; etc.
For MPI version, the library is /usr/lib/$arch/libnetcdf_mpi.so*,

symbols NETCDF_MPI_* . Note: mpi, not openmpi; the MPI dependenciesare

abstracted away by this layer.
The Fortran, C++ netcdf packages would ship both libnetcdff.so and
libnetcdff_mpi.so. etc.

pkgconfig files will be of the form netcdf-$flavor.pc, with an
alternatives default netcdf.pc -> netcdf_serial.pc

Similarly for pnetcdf : /usr/lib/$arch/libnetcf_netcdf.so.*
(pnetcdf is parallel netcdf: there are two flavours of MPI netcdf:
netcdf4 using HDF5 for its parallelism, and pnetcdf using MPI but
writing the old nc3 format. There are some applications currently
outside Debian  that find this more performant).

There is a directory structure for symlinks:

/usr/lib/$arch/netcdf/$flavor/{lib,include,cmake,pkgconfig} as perHDF5.

If you use this as your location directory when building, it all does
the right thing; if you don't use this, you get the default (currently
serial) version. As only a handful of packages are expected to use the
MPI version, no build changes for most would be needed).

Eventually in a new release if compression+parallelism isimplemented,

this can all be transitioned away with a single rebuild for the "MPI
netcdf" packages.

So, in summary: for all but three/four packages, this has no effect:
binary compatibility remains intact (symbols, versioning. etc). Third
party binaries will link with Debian netcdf libs and vice versa. When
the "proper upstream" changes are made, these changes will transition
away in Debian.

Alastair

Reply to:

Follow-Ups:
- Re: Co-installable netCDF
  - From: Bas Couwenberg <sebastic@xs4all.nl>

References:
- Co-installable netCDF
  - From: Alastair McKinstry <alastair.mckinstry@sceal.ie>

Prev by Date: Re: GDAL 2.3.0
Next by Date: Co-installable netCDF
Previous by thread: Co-installable netCDF
Next by thread: Re: Co-installable netCDF
Index(es):
- Date
- Thread