Processor 6 Exiting: Caught Signal ------------ Signal: segmentation violation
Hi:
I wonder whether the error signal I got when trying to launch a
computational procedure (molecular dynamics with software NAMD):
===========
TCL: Minimizing for 1000 steps
------------- Processor 6 Exiting: Caught Signal ------------
Signal: segmentation violation
Suggestion: Try running with '++debug', or linking with '-memory paranoid'.
[6] Stack Traceback:
[0] /lib/libc.so.6 [0x7f89f2837f60]
[1] _Z24sortEntries_mergeSort_v2RP12__sort_entryS1_i+0xba [0x5e5f92]
[2] _ZN20ComputeNonbondedUtil32calc_pair_energy_merge_fullelectEP9nonbonded+0x3551
[0x5a995b]
[3] _ZN20ComputeNonbondedPair7doForceEPP8CompAtomPP11CompAtomExtPP7Results+0xaca
[0x5918f4]
[4] _ZN16ComputePatchPair6doWorkEv+0xa7 [0x73ab63]
[5] _ZN11WorkDistrib12enqueueWorkAEP12LocalWorkMsg+0x16 [0x9a437e]
[6] _ZN19CkIndex_WorkDistrib31_call_enqueueWorkA_LocalWorkMsgEPvP11WorkDistrib+0xf
[0x9a4365]
[7] CkDeliverMessageFree+0x21 [0xa343cb]
[8] _Z15_processHandlerPvP11CkCoreState+0x530 [0xa339bc]
[9] CsdScheduleForever+0xa5 [0xabd4b9]
[10] CsdScheduler+0x1c [0xabd0ba]
[11] _Z11master_initiPPc+0x2d6 [0x512386]
[12] _ZN7BackEnd4initEiPPc+0x31 [0x5120a9]
[13] main+0x2f [0x50d99f]
[14] __libc_start_main+0xe6 [0x7f89f28241a6]
[15] _ZNSt8ios_base4InitD1Ev+0x52 [0x508d5a]
Fatal error on PE 6> segmentation violation
===============
has to do with the hardware. This error came with an otherwise
efficien parallel UMA-type computer with four double-opterons. The
same procedure run normally on a similar computer with two
double-opterns. In both cases amd64 lenny. I did not know how to
implement bthe suggestions "debug" '-memory paranoid'
Thanks
francesco pietra
Reply to: