AMBER Archive (2002)Subject: Re: parallel jobs die with no error message from sander
From: Joffre Heredia (joffre_at_yogi.uab.es)
Date: Tue Jul 30 2002 - 12:46:37 CDT
Our nodes are isolated from the rest of the network. They are in a
different network. I guess it could be a problem with mpich, what do you
think?
-------------------------------------------------------------
Joffre Heredia Rodrigo Tel: (34)-93-5813812
Laboratory of Computational Medicine Fax: (34)-93-5812344
Biostatistic Dept.
UAB School of Medicine. Bellaterra Joffre.Heredia_at_uab.es
08193-Barcelona (SPAIN)
-------------------------------------------------------------
On Mon, 29 Jul 2002, jim caldwell wrote:
>
> That looks like a communication problem to me. Are your machines
> on a busy network? Can you isolate the compute nodes behind a
> router/switch?
>
> jim
>
|