This is driving me crazy. The PBS script below works fine except that for the cd
command. If the line cd $PBS_O_WORKDIR
is uncommented, the process is running forever on the cluster.
#PBS -lnodes=1:ppn=8
#PBS -lwalltime=48:00:00
#PBS -S /bin/bash
echo $PBS_O_WORKDIR
#cd $PBS_O_WORKDIR
cat $PBS_NODEFILE
export THIS_HOST=$(hostname)
echo Hello World from host $THIS_HOST
Note: I submit the job with qsub test.bash
Returned output (if cd $PBS_O_WORKDIR
is commented):
/scratch/users/angela/mpi_test
au01.cluster
au01.cluster
au01.cluster
au01.cluster
au01.cluster
au01.cluster
au01.cluster
au01.cluster
Hello World from host au01
Edited code with mpiexec
line added:
#PBS -lnodes=1:ppn=8
#PBS -lwalltime=48:00:00
#PBS -S /bin/bash
echo $PBS_O_WORKDIR
#cd $PBS_O_WORKDIR
cat $PBS_NODEFILE
export THIS_HOST=$(hostname)
echo Hello World from host $THIS_HOST
NPROC=2
mpiexec -n $NPROC -hostfile $PBS_NODEFILE -mca plm_tm_verbose 1 hostname
In this case, an error message is returned:
[au01:47000] mca: base: component_find: unable to open /soft/openmpi/1.6.4/intel-13.1.1/lib/openmpi/mca_plm_tm: libtorque.so.2: cannot open shared object file: No such file or directory (ignored)