[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#954272: marked as done (slurmd: SLURM not working with OpenMPI)



Your message dated Wed, 02 Dec 2020 00:48:55 +0000
with message-id <E1kkGKF-000FXD-4G@fasolo.debian.org>
and subject line Bug#954272: fixed in slurm-wlm 20.02.6-2
has caused the Debian Bug report #954272,
regarding slurmd: SLURM not working with OpenMPI
to be marked as done.

This means that you claim that the problem has been dealt with.
If this is not the case it is now your responsibility to reopen the
Bug report if necessary, and/or fix the problem forthwith.

(NB: If you are a system administrator and have no idea what this
message is talking about, this may indicate a serious mail system
misconfiguration somewhere. Please contact owner@bugs.debian.org
immediately.)


-- 
954272: https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=954272
Debian Bug Tracking System
Contact owner@bugs.debian.org with problems
--- Begin Message ---
Package: slurmd
Version: 19.05.3.2-2+b1
Severity: important

Dear Maintainer,

I am trying to get SLURM working on a single node. I have installed and configured slurmd and slurmctld.

A simple test like `srun hostname` works, even on multiple cores. However, when trying to use MPI, it crashes with the following error message:

*** An error occurred in MPI_Init
*** on a NULL communicator
*** MPI_ERRORS_ARE_FATAL (processes in this communicator will now abort,
***    and potentially your MPI job)

This happens even in the most simple "Hello World" case, as long as the program is MPI-enabled.

I am trying to use OpenMPI (4.0.2) from the Debian repositories. `srun --mpi list` returns:

srun: MPI types are...
srun: openmpi
srun: pmi2
srun: none

I have tried all options, but the result is the same in all cases.

Maybe this is user error, as this is my first time setting up SLURM, but I have not been able to find any possible causes/solutions and I am kind of stuck at this point.

Regards,

Lars

-- System Information:
Debian Release: bullseye/sid
  APT prefers testing
  APT policy: (500, 'testing')
Architecture: amd64 (x86_64)

Kernel: Linux 5.4.0-3-amd64 (SMP w/64 CPU cores)
Kernel taint flags: TAINT_OOT_MODULE, TAINT_UNSIGNED_MODULE
Locale: LANG=en_US.UTF-8, LC_CTYPE=en_US.UTF-8 (charmap=UTF-8), LANGUAGE=en_US.UTF-8 (charmap=UTF-8)
Shell: /bin/sh linked to /usr/bin/dash
Init: systemd (via /run/systemd/system)
LSM: AppArmor: enabled

Versions of packages slurmd depends on:
ii  libc6                    2.30-2
ii  libhwloc15               2.1.0+dfsg-4
ii  liblz4-1                 1.9.2-2
ii  libnuma1                 2.0.12-1+b1
ii  libpam0g                 1.3.1-5
ii  lsb-base                 11.1.0
ii  munge                    0.5.13-2+b1
ii  openssl                  1.1.1d-2
ii  slurm-wlm-basic-plugins  19.05.3.2-2+b1
ii  ucf                      3.0038+nmu1
ii  zlib1g                   1:1.2.11.dfsg-2

slurmd recommends no packages.

slurmd suggests no packages.

-- no debconf information

--- End Message ---
--- Begin Message ---
Source: slurm-wlm
Source-Version: 20.02.6-2
Done: Gennaro Oliva <oliva.g@na.icar.cnr.it>

We believe that the bug you reported is fixed in the latest version of
slurm-wlm, which is due to be installed in the Debian FTP archive.

A summary of the changes between this version and the previous one is
attached.

Thank you for reporting the bug, which will now be closed.  If you
have further comments please address them to 954272@bugs.debian.org,
and the maintainer will reopen the bug report if appropriate.

Debian distribution maintenance software
pp.
Gennaro Oliva <oliva.g@na.icar.cnr.it> (supplier of updated slurm-wlm package)

(This message was generated automatically at their request; if you
believe that there is a problem with it please contact the archive
administrators by mailing ftpmaster@ftp-master.debian.org)


-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA512

Format: 1.8
Date: Wed, 02 Dec 2020 01:13:08 +0100
Source: slurm-wlm
Architecture: source
Version: 20.02.6-2
Distribution: unstable
Urgency: medium
Maintainer: Debian HPC Team <debian-hpc@lists.debian.org>
Changed-By: Gennaro Oliva <oliva.g@na.icar.cnr.it>
Closes: 954272
Changes:
 slurm-wlm (20.02.6-2) unstable; urgency=medium
 .
   * Fix pmix support (Closes: #954272)
   * Fix srun autopkgtest
   * Tweak slurm.conf for autopkgtest
   * Add mpi autopkgtest
   * Bump standard version to 4.5.1 (no changes)
Checksums-Sha1:
 426528e13a8c1190a0d3cbe37dd700c377e4a7f4 3655 slurm-wlm_20.02.6-2.dsc
 bcf30b34d62bb484ab82773c9cc6fc3962dbd258 125964 slurm-wlm_20.02.6-2.debian.tar.xz
 f8242208eee86b00ea843bffb1b5dff0a479eb22 21444 slurm-wlm_20.02.6-2_amd64.buildinfo
Checksums-Sha256:
 e99dc92a5a9c274b510ae2b91a8dcceb7bf98ee3c5119644e9ae5fdd9255a528 3655 slurm-wlm_20.02.6-2.dsc
 1f76d13e8c7aaee10c0d88326df77c0dfbd24ad757d10ccee31169f7583d9e2f 125964 slurm-wlm_20.02.6-2.debian.tar.xz
 9e0feb9c60d60c4d80f7f9b8e136eed9f54b91db7aac1575dddae2bd5aa0aecf 21444 slurm-wlm_20.02.6-2_amd64.buildinfo
Files:
 de2b6aa2bceabb637c0604fbd003a766 3655 admin optional slurm-wlm_20.02.6-2.dsc
 55b37d8daa2e92751632fadfa540e33d 125964 admin optional slurm-wlm_20.02.6-2.debian.tar.xz
 9525ebbd514688d543bc8e24cb522f9a 21444 admin optional slurm-wlm_20.02.6-2_amd64.buildinfo

-----BEGIN PGP SIGNATURE-----

iQJLBAEBCgA1FiEE6zNF9WRBuLgad5h2ffpBrZYZhdcFAl/G4o8XHG9saXZhLmdA
bmEuaWNhci5jbnIuaXQACgkQffpBrZYZhddF7g//ZvqvGznQGJcMqTixvI1v9och
caoTwh7fkQ9tdv1zCOKdGlChdauXUGItpa0qBMDwp8O/QEjEfzx+9bTwHqWhLYY6
Qyvlgacwd8KTRfLToYVpqZ/KUoKkYFZyPWYGVHYg2hC+qkbf74enXFlsadKdUp7p
bfrebMqF8Tj/DbuRzghd8c0rM1BClp5gb0lzyTSuqcbd9PDhztszdXyw+b32nXz4
+UvupNZ73ggZ67oNEjjaqIDqRD+EoUMyYjXNL+BLX/ygb2pdQIWj8SKPceQrA3SE
O3kOSlmftZ8I95Y1p5Z6rDOq99h0QDH+uXmiyzfvYTuQLtEqefdGwzJujGHc2pEd
QUp7NhdxQILM9YbBD+pAGtooAzZ74g/NutqzQ0rgYkZGwyi9+MJHlOE7uOhqV6NL
6tYUDtXq5QcMps7OJtzKnOvvnDyffrIkyLvWWzJmzKpzCSCTpOEfEzOldb5drY7m
qmfBBpsNEDveHXvq4F+Sz8iXVI2FaBtYnGOoYLkP+VgI/ztZYpUoGiqQ0thY72gI
0ijcvfBR7/3w+ybvGHVFs8UFk5vKk3xT3ODUB9cjV8SDotAF51qRbUqLlQ+6mVuG
taoYnhqdjX8vyY5p7BWd6TrM3xrjsCiUzB/oLElxU0zDU5Zxz+P+pZfFciL4RVdW
y7sKIPg/y3rAk1x3zFw=
=kgXG
-----END PGP SIGNATURE-----

--- End Message ---

Reply to: