AMBER Archive (2006)Subject: AMBER: job crashes
From: Xiaowei (David) Li (xl3a_at_virginia.edu)
Date: Tue Sep 26 2006 - 22:09:04 CDT
Dear all:
I have met intensive (almost every job I submitted) MD job crashes
during recent simulation work. The job crashes always happend upon the
completion points of simulations (for example, the crash happens around
950 ps for a 1 ns simulation). All of the errors messages have the
"semop lock failed" information as following.
Job is running on node(s):
------------------------
compute-2-5 compute-2-6 compute-2-7 compute-2-9
------------------------
p4_error: latest msg from perror: Invalid argument
p0_9469: p4_error: OOPS: semop lock failed: -1
forrtl: error (69): process interrupted (SIGINT)
forrtl: error (69): process interrupted (SIGINT)
forrtl: error (69): process interrupted (SIGINT)
forrtl: error (69): process interrupted (SIGINT)
p3_28418: (207846.623829) net_send: could not write to fd=5, errno = 32
forrtl: error (69): process interrupted (SIGINT)
p0_9469: (207848.918605) net_send: could not write to fd=4, errno = 32:
I was running the parallel simulation with MPI on a linux cluster with
Athlon Opteron 244 processors. The input file is:
&cntrl
imin = 0,
irest = 1,
ntx = 5,
ntb = 2,
pres0 = 1.0,
ntp = 1,
tautp=5
taup =5,
cut = 10,
ntr = 0,
ntc = 2,
ntf = 2,
tempi = 300.0,
temp0 = 300.0,
ntt = 1,
nstlim =500000,
dt = 0.002,
ntpr = 100,
ntwx = 100,
ntwr = 1000,
nscm=1,
&end
Any help or suggestion will be deeply appreciated. Thanks.
Best,
Xiaowei Li
University of Virginia
-----------------------------------------------------------------------
The AMBER Mail Reflector
To post, send mail to amber_at_scripps.edu
To unsubscribe, send "unsubscribe amber" to majordomo_at_scripps.edu
|