Red Hat Bugzilla – Bug 195426
ypserv complains "o_ypcall: Timed out" in my XeonLV
Last modified: 2015-01-07 19:12:58 EST
Description of problem:
I am running RedHat Enterprise Linux AS4U2 on my XeonLV cluster. In order to
run my parallel job, I installed MPICH-220.127.116.11, certainly, I have installed
ypserv, ypbind, nfs, rsh, etc.
The problem is when I startup a parallel job just like `mpirun -np 16 -
machinefile myhosts mpihelloworld', ypserv complains:
[compute1] /home/kartwall/testcode > /usr/local/mpich/bin/mpirun -np 16 -
machinefile hostfile hello
o_ypcall: clnt_call: RPC: Timed out
p0_18235: p4_error: Child process exited while making connection to remote
process on compute2: 0
I know that in ypserv-2.13-9 version, this is a ypserv bug and it can be fixed
if we downgrade ypserv to ypserv-2.13-5. But my ypserv is ypserv-2.13-5 now.
Another interesting thing is that the error message above is "o_ypcall" while
I have tried to upgrade my ypserv to `ypserv-2.13-11' but it doesn't work
I think that maybe ypserv doesn't support XeonLV now. I have found a lot of
softwares on my XeonLV platform don't work correctly. And on my another
cluster which is based on Xeon, ypserv-2.13-5 works well.
Version-Release number of selected component (if applicable):
Steps to Reproduce:
> I know that in ypserv-2.13-9 version, this is a ypserv bug and it can be fixed
Which bug are you talking about?
As this has been in needinfo for a long time with no movement the decision is to
close WONTFI. Please re-open with the requested information if the problem persists.