Questions tagged [torque]

TORQUE Resource Manager provides control over batch jobs and distributed computing resources. It is an open-source product based on the original PBS project.

TORQUE Resource Manager provides control over batch jobs and distributed computing resources. It is an open-source product based on the original PBS project.

While TORQUE has a built-in scheduler, pbs_sched, it is typically used solely as a resource manager with a scheduler making requests to it. Resources managers provide the low-level functionality to start, hold, cancel, and monitor jobs. Without these capabilities, a scheduler alone cannot control jobs.

251 questions
25
votes
2 answers

How to submit a job to a specific node in PBS

How do I send a job to a specific node in PBS/TORQUE? I think you must specify the node name after nodes. #PBS -l nodes=abc However, this doesn't seem to work and I'm not sure why. This question was asked here on PBS and specify nodes to use Here…
Ashwin
  • 577
  • 3
  • 6
  • 15
23
votes
5 answers

how do i find a complete list of available torque pbs queues?

Q: How do I find the available PBS queues on the "typical" Torque MPI system? (asking our admin takes 24+ hours, and the system changes with constant migration) (for example, "Std8" is one possible queue) #PBS -q Std8
russian_spy
  • 6,465
  • 4
  • 30
  • 26
18
votes
3 answers

Does qsub pass command line arguments to my script?

When I submit a job using qsub script.sh is $@ setted to some value inside script.sh? That is, are there any command line arguments passed to script.sh?
a06e
  • 18,594
  • 33
  • 93
  • 169
17
votes
4 answers

How to return to bash prompt after printing output from backgrounded function?

How can I return to my bash prompt automatically after printing output from a function that was put in the background? For example, when I run the following script in a bash shell: fn(){ sleep 10 echo "Done" exit } fn…
ff524
  • 387
  • 4
  • 19
14
votes
4 answers

Submit jobs to a slave node from within an R script?

To get myscript.R to run on a cluster slave node using a job scheduler (specifically, PBS) Currently, I submit an R script to a slave node using the following command qsub -S /bin/bash -p -1 -cwd -pe mpich 1 -j y -o output.log ./myscript.R Are…
David LeBauer
  • 31,011
  • 31
  • 115
  • 189
14
votes
4 answers

PBS, refresh stdout

I have a long running Torque/PBS job and I'd like to monitor output. But log file only gets copied after the job is finished. Is there a way to convince PBS to refresh it?
Anycorn
  • 50,217
  • 42
  • 167
  • 261
11
votes
1 answer

How to use the qsub -v command in PBS torque?

I would like to pass variables to a csh script by using "qsub -v" command. I understand we can list the parameters-value pairs as below, qsub -v par1=value1 par2=value2 myScript.csh Does anyone know if the values of these parameters can be a…
Cassie
  • 1,179
  • 6
  • 18
  • 30
8
votes
1 answer

Exclude certain nodes when submitting jobs with qsub / torque?

When submitting batch jobs with qsub, is there a way to exclude a certain node (by hostname)? Something like # this is just a pseudo command: qsub myscript.sh --exclude computer01
jdm
  • 9,470
  • 12
  • 58
  • 110
8
votes
0 answers

Torque cannot communicate with host

I have been attempting to setup the torque scheduler for a small cluster. I followed the steps to setup the scheduler from http://docs.adaptivecomputing.com/torque/archive/3-0-2/1.2configuring_torque_on_server.php However when i attempt qterm -t…
j-money
  • 509
  • 2
  • 9
  • 32
6
votes
1 answer

Torque job with dependencies on job array

I'm trying to submit a torque job that is dependent on an array completing. FIRST=`qsub -q hep -t 1-5 foo.sh` qsub -q hep -W depend=afterok:$FIRST bar.sh The FIRST job array submits and completes just fine but the second job bar.sh just holds…
J. Vasquez
  • 161
  • 8
6
votes
5 answers

How get information of completed PBS or Torque jobs?

I have IDs of completed jobs. How do I check its detailed information, such as execution time, allocated nodes, etc? I remember SGE has a command for it (qacct?). But I could not find it for PBS or Torque. Thanks.
user3282611
  • 870
  • 1
  • 9
  • 9
6
votes
3 answers

How to use qdel all to remove only idle jobs

I am having an issue where if I have 2000 jobs queued and try to delete them with qdel all, it'll keep trying to delete the running jobs first. This means I have to wait a long time for the jobs to get deleted because removing from the Running list…
user2763361
  • 3,789
  • 11
  • 45
  • 81
6
votes
2 answers

Loading shared library in open-mpi/ mpi-run

I'm trying to run my program using torque scheduler using mpi run. Though in my pbs file I load all the library by export LD_LIBRARY_PATH=/path/to/library yet it gives error i.e. error while loading shared libraries: libarmadillo.so.3: cannot…
arbitUser1401
  • 575
  • 2
  • 8
  • 25
5
votes
2 answers

Does a PBS batch system move multiple serial jobs across nodes?

If I need to run many serial programs "in parallel" (because the problem is simple but time consuming - I need to read in many different data sets for the same program), the solution is simple if I only use one node. All I do is keep submitting…
Lt Dax
  • 143
  • 1
  • 5
5
votes
1 answer

what are the main differences between TORQUE, HTCondor and Apache Mesos

http://www.adaptivecomputing.com/products/open-source/torque/ https://research.cs.wisc.edu/htcondor/ I am looking for a program to perform distributed computing (no parallel computing needed though) which has: a scheduler a queue management…
RockScience
  • 17,932
  • 26
  • 89
  • 125
1
2 3
16 17