[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#982173: mpich breaks bagel autopkgtest: Internal error



Source: mpich, bagel
Control: found -1 mpich/3.4.1-2
Control: found -1 bagel/1.2.2-1
Severity: serious
Tags: sid bullseye
X-Debbugs-CC: debian-ci@lists.debian.org
User: debian-ci@lists.debian.org
Usertags: breaks needs-update

Dear maintainer(s),

With a recent upload of mpich the autopkgtest of bagel fails in testing
when that autopkgtest is run with the binary packages of mpich from
unstable. It passes when run with only packages from testing. In tabular
form:

                       pass            fail
mpich                  from testing    3.4.1-2
bagel                  from testing    1.2.2-1
all others             from testing    from testing

I copied some of the output at the bottom of this report.

Currently this regression is blocking the migration of mpich to testing
[1]. Due to the nature of this issue, I filed this bug report against
both packages. Can you please investigate the situation and reassign the
bug to the right package?

More information about this bug and the reason for filing it can be found on
https://wiki.debian.org/ContinuousIntegration/RegressionEmailInformation

Paul

[1] https://qa.debian.org/excuses.php?package=mpich

https://ci.debian.net/data/autopkgtest/testing/amd64/b/bagel/10269200/log.gz

running test case 'hf_sto3g_fci_dist'... Assertion failed in file
./src/mpid/ch4/netmod/include/../ofi/ofi_impl.h at line 316:
MPIDI_OFI_global.max_order_war != 0
/lib/x86_64-linux-gnu/libmpich.so.12(MPL_backtrace_show+0x35)
[0x7ff101b7a5c5]
/lib/x86_64-linux-gnu/libmpich.so.12(+0x3d41f4) [0x7ff101af11f4]
/lib/x86_64-linux-gnu/libmpich.so.12(+0x2df929) [0x7ff1019fc929]
/lib/x86_64-linux-gnu/libmpich.so.12(MPI_Raccumulate+0xaf3) [0x7ff1019fdb43]
BAGEL(+0x1175449) [0x55bce38aa449]
BAGEL(+0x117556a) [0x55bce38aa56a]
BAGEL(+0x2dad12e) [0x55bce54e212e]
BAGEL(+0x2da77e7) [0x55bce54dc7e7]
BAGEL(+0x1287971) [0x55bce39bc971]
BAGEL(+0x128cb48) [0x55bce39c1b48]
BAGEL(+0x7984da) [0x55bce2ecd4da]
BAGEL(+0x6d00cb) [0x55bce2e050cb]
BAGEL(+0x6d0600) [0x55bce2e05600]
BAGEL(+0x630a79) [0x55bce2d65a79]
/lib/x86_64-linux-gnu/libc.so.6(__libc_start_main+0xea) [0x7ff101251d0a]
BAGEL(+0x6cd3ca) [0x55bce2e023ca]
Abort(1) on node 0: Internal error
Abort(806445583) on node 0 (rank 0 in comm 0): Fatal error in
PMPI_Finalize: Other MPI error, error stack:
PMPI_Finalize(189)..............: MPI_Finalize failed
PMPI_Finalize(149)..............:
MPID_Finalize(702)..............:
MPIDI_OFI_mpi_finalize_hook(827):
destroy_vni_context(1079).......: OFI domain close failed
(ofi_init.c:1079:destroy_vni_context:Device or resource busy)
FAILED.

Attachment: OpenPGP_signature
Description: OpenPGP digital signature


Reply to: