--- Begin Message ---
- To: Debian Bug Tracking System <submit@bugs.debian.org>
- Subject: slurmd: SLURM not working with OpenMPI
- From: Lars Veldscholte <lars@tuxplace.nl>
- Date: Thu, 19 Mar 2020 15:16:15 +0100
- Message-id: <158462737525.1292459.8195940355201804636.reportbug@utwks159064.tnw.utwente.nl>
Package: slurmd
Version: 19.05.3.2-2+b1
Severity: important
Dear Maintainer,
I am trying to get SLURM working on a single node. I have installed and configured slurmd and slurmctld.
A simple test like `srun hostname` works, even on multiple cores. However, when trying to use MPI, it crashes with the following error message:
*** An error occurred in MPI_Init
*** on a NULL communicator
*** MPI_ERRORS_ARE_FATAL (processes in this communicator will now abort,
*** and potentially your MPI job)
This happens even in the most simple "Hello World" case, as long as the program is MPI-enabled.
I am trying to use OpenMPI (4.0.2) from the Debian repositories. `srun --mpi list` returns:
srun: MPI types are...
srun: openmpi
srun: pmi2
srun: none
I have tried all options, but the result is the same in all cases.
Maybe this is user error, as this is my first time setting up SLURM, but I have not been able to find any possible causes/solutions and I am kind of stuck at this point.
Regards,
Lars
-- System Information:
Debian Release: bullseye/sid
APT prefers testing
APT policy: (500, 'testing')
Architecture: amd64 (x86_64)
Kernel: Linux 5.4.0-3-amd64 (SMP w/64 CPU cores)
Kernel taint flags: TAINT_OOT_MODULE, TAINT_UNSIGNED_MODULE
Locale: LANG=en_US.UTF-8, LC_CTYPE=en_US.UTF-8 (charmap=UTF-8), LANGUAGE=en_US.UTF-8 (charmap=UTF-8)
Shell: /bin/sh linked to /usr/bin/dash
Init: systemd (via /run/systemd/system)
LSM: AppArmor: enabled
Versions of packages slurmd depends on:
ii libc6 2.30-2
ii libhwloc15 2.1.0+dfsg-4
ii liblz4-1 1.9.2-2
ii libnuma1 2.0.12-1+b1
ii libpam0g 1.3.1-5
ii lsb-base 11.1.0
ii munge 0.5.13-2+b1
ii openssl 1.1.1d-2
ii slurm-wlm-basic-plugins 19.05.3.2-2+b1
ii ucf 3.0038+nmu1
ii zlib1g 1:1.2.11.dfsg-2
slurmd recommends no packages.
slurmd suggests no packages.
-- no debconf information
--- End Message ---
--- Begin Message ---
Source: slurm-wlm
Source-Version: 20.02.6-2
Done: Gennaro Oliva <oliva.g@na.icar.cnr.it>
We believe that the bug you reported is fixed in the latest version of
slurm-wlm, which is due to be installed in the Debian FTP archive.
A summary of the changes between this version and the previous one is
attached.
Thank you for reporting the bug, which will now be closed. If you
have further comments please address them to 954272@bugs.debian.org,
and the maintainer will reopen the bug report if appropriate.
Debian distribution maintenance software
pp.
Gennaro Oliva <oliva.g@na.icar.cnr.it> (supplier of updated slurm-wlm package)
(This message was generated automatically at their request; if you
believe that there is a problem with it please contact the archive
administrators by mailing ftpmaster@ftp-master.debian.org)
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA512
Format: 1.8
Date: Wed, 02 Dec 2020 01:13:08 +0100
Source: slurm-wlm
Architecture: source
Version: 20.02.6-2
Distribution: unstable
Urgency: medium
Maintainer: Debian HPC Team <debian-hpc@lists.debian.org>
Changed-By: Gennaro Oliva <oliva.g@na.icar.cnr.it>
Closes: 954272
Changes:
slurm-wlm (20.02.6-2) unstable; urgency=medium
.
* Fix pmix support (Closes: #954272)
* Fix srun autopkgtest
* Tweak slurm.conf for autopkgtest
* Add mpi autopkgtest
* Bump standard version to 4.5.1 (no changes)
Checksums-Sha1:
426528e13a8c1190a0d3cbe37dd700c377e4a7f4 3655 slurm-wlm_20.02.6-2.dsc
bcf30b34d62bb484ab82773c9cc6fc3962dbd258 125964 slurm-wlm_20.02.6-2.debian.tar.xz
f8242208eee86b00ea843bffb1b5dff0a479eb22 21444 slurm-wlm_20.02.6-2_amd64.buildinfo
Checksums-Sha256:
e99dc92a5a9c274b510ae2b91a8dcceb7bf98ee3c5119644e9ae5fdd9255a528 3655 slurm-wlm_20.02.6-2.dsc
1f76d13e8c7aaee10c0d88326df77c0dfbd24ad757d10ccee31169f7583d9e2f 125964 slurm-wlm_20.02.6-2.debian.tar.xz
9e0feb9c60d60c4d80f7f9b8e136eed9f54b91db7aac1575dddae2bd5aa0aecf 21444 slurm-wlm_20.02.6-2_amd64.buildinfo
Files:
de2b6aa2bceabb637c0604fbd003a766 3655 admin optional slurm-wlm_20.02.6-2.dsc
55b37d8daa2e92751632fadfa540e33d 125964 admin optional slurm-wlm_20.02.6-2.debian.tar.xz
9525ebbd514688d543bc8e24cb522f9a 21444 admin optional slurm-wlm_20.02.6-2_amd64.buildinfo
-----BEGIN PGP SIGNATURE-----
iQJLBAEBCgA1FiEE6zNF9WRBuLgad5h2ffpBrZYZhdcFAl/G4o8XHG9saXZhLmdA
bmEuaWNhci5jbnIuaXQACgkQffpBrZYZhddF7g//ZvqvGznQGJcMqTixvI1v9och
caoTwh7fkQ9tdv1zCOKdGlChdauXUGItpa0qBMDwp8O/QEjEfzx+9bTwHqWhLYY6
Qyvlgacwd8KTRfLToYVpqZ/KUoKkYFZyPWYGVHYg2hC+qkbf74enXFlsadKdUp7p
bfrebMqF8Tj/DbuRzghd8c0rM1BClp5gb0lzyTSuqcbd9PDhztszdXyw+b32nXz4
+UvupNZ73ggZ67oNEjjaqIDqRD+EoUMyYjXNL+BLX/ygb2pdQIWj8SKPceQrA3SE
O3kOSlmftZ8I95Y1p5Z6rDOq99h0QDH+uXmiyzfvYTuQLtEqefdGwzJujGHc2pEd
QUp7NhdxQILM9YbBD+pAGtooAzZ74g/NutqzQ0rgYkZGwyi9+MJHlOE7uOhqV6NL
6tYUDtXq5QcMps7OJtzKnOvvnDyffrIkyLvWWzJmzKpzCSCTpOEfEzOldb5drY7m
qmfBBpsNEDveHXvq4F+Sz8iXVI2FaBtYnGOoYLkP+VgI/ztZYpUoGiqQ0thY72gI
0ijcvfBR7/3w+ybvGHVFs8UFk5vKk3xT3ODUB9cjV8SDotAF51qRbUqLlQ+6mVuG
taoYnhqdjX8vyY5p7BWd6TrM3xrjsCiUzB/oLElxU0zDU5Zxz+P+pZfFciL4RVdW
y7sKIPg/y3rAk1x3zFw=
=kgXG
-----END PGP SIGNATURE-----
--- End Message ---