Displaying 3 results from an estimated 3 matches for "pbs_nodefile".
2008 Oct 22
1
torque/psb & snow library
Hello all;
I'm trying to execute parallel jobs trough library snow on a cluster built
through torque/PSB. I'm succesfully obtaining the cluster with:
>system("cat $PBS_NODEFILE > cluster.txt")
>mycluster <- scan(file="cluster.txt",what="character")
>cl <- makeSOCKcluster(mycluster)
The only problem, at the moment, is that if I use processors in nodes
other that the one in which I'm running R, the communication is extremely
slow...
2009 Sep 22
0
snowfall: missing MPI node
Hello,
I don't know if the question pertains to Rmpi, snow or snowfall.
I run my job by:
mpirun -np N -hostfile $PBS_NODEFILE RMPISNOW -f my-script.r --slave
In the snowfall sfInit call I have to specify one less CPU respect to
the mpirun call
sfInit(parallel=TRUE, cpus=N-1, type="MPI")
otherwise I receive an error similar to: "cluster size N-1 already
running" (sorry I don't remember the...
2013 Jul 20
7
Failure to Launch (was override -q option)
Attached is the very verbose ssh output. Just to be perverse, this time two
nodes lost connectivity. The only thing I see is lines saying that the two
connections are lost, although being honest I have no idea what everything
else means. For reference, 8 ssh cinnections were being made at the same
time for a 8x8mpi task.
N.B., since the OS I am using does not have rsh, I am currently using the