Questions tagged [lsf]

LSF, aka Load Sharing Facility is software that executes batch jobs on networked Unix and Windows systems on many different architectures. It is commonly used in HPC Clusters in many universities and research centers around the world.

The Load Sharing Facility or LSF is a set of tools for distributing jobs across a set of networked systems. If was developed initially by the Platform Computing company, which was acquired by IBM in 2012. It is now called IBM Spectrum LSF

Resources:

229 questions
0
votes
1 answer

What is the equivalent syntax of LSF for this PBS command?

In PBS script, this allocates 4 CPUs on each of 4 nodes : #PBS -l select=4:ncpus=4 How do I translate this into LSF command ? I think the following will do : #BSUB -n 16 -R "span[ptile=4]" Am I correct ?
nasica88
  • 1,185
  • 10
  • 10
0
votes
0 answers

Launch jobs in LSF one batch after the other

I have say 10,000 orders. Each order has multiple(10 to 5,000) order items. I have a pipeline in Spring-batch to do some processing on those data. I have one job(tasklet style) written to process orders and another job written to process individual…
sureshhewabi
  • 91
  • 11
0
votes
2 answers

How to get job ID from job array within for loop?

I am running a greedy feature selection algorithm, and I am attempting to use job arrays to explore parallelization. The idea is that we have three steps that depend on the previous step: Step 1: Setup for iteration i Step 2: Fit models at…
nespereira
  • 67
  • 6
0
votes
1 answer

How to delete a lot of directories using LSF

I have access to LSF cluster. Is it possible to delete (rm -rf) a whole bunch of directories and take advantage of the LSF? Thanks in advance.
justrajdeep
  • 855
  • 3
  • 12
  • 29
0
votes
1 answer

How to extend time for program to finish after USR2 signal

I have got a program that will run for a very long time on my universities LSF cluster. I don't know if it will finish before it exceeds its job's time limit. If a job exceeds the time limit, the LSF system will send increasingly unfriendly…
Iridium
  • 113
  • 1
  • 8
0
votes
1 answer

how to limit job submision depending on location or partition

I am wondering if there is a way to limit the job submission depending on the location where the submission was made in HPC. Thing is, recently a storage for a scratch disk was added. So now I have two partitions. home directory scratch directory…
Seong
  • 556
  • 4
  • 18
0
votes
0 answers

Passing calculation commands to cluster job

TL;DR Trying to pass a computation of the form $(($LSB_JOBINDEX-1)) to a cluster call, but getting an error $((2-1)): syntax error: operand expected (error token is "$((2-1))") How do I escape correctly or what alternative command to use so that…
FlyingTeller
  • 17,638
  • 3
  • 38
  • 53
0
votes
0 answers

LSF ERROR:Project must be 'acc_*'

I need to run a Python script on a supercomputer by submitting a job with LSF. I have been trying to become acquainted with the syntax using a simple example script: #!/bin/bash #BSUB –q alloc #BSUB –n 1 #BSUB –o t.out echo “Salve Munde!” I saved…
John Smith
  • 393
  • 1
  • 6
  • 17
0
votes
0 answers

Running multiple make instances at a time results in errors

I have an LSF system where commands can be sent to different machines to execute at one time, with a shared filesystem. I want to make different modules of my C program at one time. Like 'make reportmoduletestbench' and 'make…
苹果user3822749
  • 154
  • 1
  • 1
  • 6
0
votes
1 answer

Submitting LSF job array using different arguments for each element of the array

I'm trying to avoid submitting separate jobs. I have so far have this at the start of my script: #!/bin/bash #BSUB -P account #BSUB -q queue #BSUB -W 48:00 #BSUB -n 2 #BSUB -R rusage[mem=40000] #BSUB -J jobname[1-22] #BSUB -a 000-176:1 #BSUB -eo…
joeblow
  • 13
  • 6
0
votes
1 answer

Can LSF be configured to restrict access to a queue based on executable or by passing a token

I'm a casual and mostly inexperienced LSF user, so please bear with... I develop software in a corporate setting that submits jobs to LSF for processing. We have a set of machines that we want to use for a specific application but not open up to…
daveg
  • 1,051
  • 11
  • 24
0
votes
1 answer

ERROR: No OpenCL platforms found, check OpenCL installation

I tried to run Matlab program on gpu (CentOS 7.3). This Matlab use caffe. When I run it from the command line with: matlab -nodisplay -r "demo, quit" it run okay. When I run it with LSF command: bsub -q gpu -R "select[ngpus>0]…
user1980099
  • 573
  • 1
  • 8
  • 30
0
votes
1 answer

How to prevent multiple executables from running at the same time on cluster

I have submitted a job to a multicore cluster with LSF platform. It looks like the code at the end. The two executables, exec1 and exec2, start at the same time. In my intention they are separated by a column comma and the second should start after…
simona
  • 2,009
  • 6
  • 29
  • 41
0
votes
1 answer

Make cshell script wait until LSF job finish

I have a shell script which submit the task to LSF server, and not until the job is finished, certain sets of commands will be performed over the output files. But since I put the task into bsub, the task gets executed on other server, if I put the…
王云龙
  • 71
  • 8
0
votes
0 answers

SAS EG is unable to connect to SAS APP Sever context also unable to validate sas Logical workspace server and sas grid server in management console

SAS Environment details sas 9.4_M3 2 metadata server 19 grid nodes 1 grid master 2 web servers issue in grid: sas eg is unable to connect to sasAPP/ sasAPPgrid server context. error message "Unable to establish connection with SAS APP. also when…
DGaynar
  • 1
  • 1
  • 2