AMBER Archive (2009)Subject: [AMBER] reg.error in parallel run
From: balaji nagarajan (balaji_sethu_at_hotmail.com)
Date: Fri Jan 23 2009 - 04:05:21 CST
Dear Amber ,
I am trying parallel using LAM
I have installed it in two nodes with different IP # .
and they are having 2cores each
I have created .rhosts file for remote login with out password
when i give rsh with IP# it works
when I give lamboot
like
lamboot -v nodes
it gives the error
--------------------------------------------------------------------------------
[balaji_at_ACTINIDE ~/mddna]$ lamboot -v nodes
LAM 7.1.3/MPI 2 C++/ROMIO - Indiana University
n-1<12069> ssi:boot:base:linear: booting n0 (192.1.1.53)
n-1<12069> ssi:boot:base:linear: booting n1 (192.1.1.54)
ERROR: LAM/MPI unexpectedly received the following on stderr:
connect to address 192.1.1.54 port 544: Connection refused
connect to address 192.1.1.54 port 544: Connection refused
trying normal rsh (/usr/bin/rsh)
-----------------------------------------------------------------------------
LAM attempted to execute a process on the remote node "192.1.1.54",
but received some output on the standard error. This heuristic
assumes that any output on the standard error indicates a fatal error,
and therefore aborts. You can disable this behavior (i.e., have LAM
ignore output on standard error) in the rsh boot module by setting the
SSI parameter boot_rsh_ignore_stderr to 1.
---------------------------------------------------------------------------------------
II) when i give
lamboot -v -ssi boot_rsh_ignore_stderr nodes
it works
lamboot -v -ssi boot_rsh_ignore_stderr nodes
LAM 7.1.3/MPI 2 C++/ROMIO - Indiana University
n-1<12113> ssi:boot:base:linear: booting n0 (localhost)
n-1<12113> ssi:boot:base:linear: finished
-------------------------------------------------------------------------------------------
but when i give the run from the directory
by
mpirun -np 2 $AMBERHOME/amber9/exe/sander -O -i polyAT_wat_min1.in -o polyAT_wat_min1.out -p polyAT_wat.prmtop -c polyAT_wat.inpcrd -r polyAT_wat_min1.rst -ref polyAT_wat.inpcrd
its not picking up the second machine
but in the first machine it takes the two processor
-----------------------------------------------------------------
could some one help me to solve the problem
thanks in advance
balaji
UOM
_________________________________________________________________
See all the ways you can stay connected to friends and family
http://www.microsoft.com/windows/windowslive/default.aspx_______________________________________________
AMBER mailing list
AMBER_at_ambermd.org
http://lists.ambermd.org/mailman/listinfo/amber
|