Questions tagged [azure-batch]

The azure-batch tag should be used to ask questions regarding all aspects of the Azure Batch service including job scheduling, compute node provisioning, and pool orchestration. Extended ecosystem questions are also welcome that are related to Azure Batch accelerator offerings, Azure Portal Batch blades, and Batch Explorer.

Azure Batch is a job scheduling and compute management orchestration solution for the Microsoft Azure cloud. The service is available in all Azure regions and can deploy with nearly all Azure VM sizes available, including those with specialized hardware such as GPUs and Infiniband/RDMA interfaces.

Azure Batch is a managed job scheduling solution where the underlying compute resources are allocated as pools. Pools may be comprised of Linux or Windows VMs and any number of dedicated and low priority nodes and can be automatically scaled through autoscale formulas. An Azure Batch Job can be thought of as a queue for tasks. Each Azure Batch Job can be assigned to a pool. Any number of tasks can be assigned to jobs, including multi-instance tasks (i.e., multi-node) which support MPI jobs. Azure Batch also supports containerized execution.

For a full introductory treatment of Azure Batch, see the Azure Batch Technical Overview. The service team also provides a GitHub repository for issue tracking, feature requests, and change logs.

416 questions
0
votes
1 answer

Azure Batch pool of VMs and on-premises Active Directory

Can VMs from the pool of Azure Batch service be joined to on-premises Active Directory while executing the jobs so that they can access on-premises resources and data stores?
YuGagarin
  • 341
  • 7
  • 20
0
votes
1 answer

MaxInternalRetryCountReached - The specified task has reached maximum internal retry count

Would you please explain when this task scheduling error will happen? MaxInternalRetryCountReached - The specified task has reached maximum internal retry count MSDN link here. The background is that I see this error sometimes, for jobs with tasks…
rprasadk
  • 5
  • 2
0
votes
1 answer

Azure Process Orchestration

I'm looking for advice on what the best practice is with regards to process orchestration. To give some context I have the following tasks to orchestrate: Scale up Azure Batch Pool to provide adequate nodes Execute custom .Net code which calls a…
DavidReid
  • 449
  • 1
  • 5
  • 21
0
votes
2 answers

Cannot load R packages on Azure Batch nodes

I am having difficulty loading packages into R on my compute pool nodes using the Azure Batch Python API. The code that I am using is similar to what is provided in the Azure Batch Python SDK Tutorial, except the task is more complicated -- I want…
ncemami
  • 439
  • 2
  • 10
0
votes
1 answer

How to silently get access token to user subscription Azure Batch?

i am working on project, where we have service that run computation on Azure Batch in user subscription mode (because we are using custom image). I have now my code fully working, but it requires every launch to provide user credentials to log into…
Daniel
  • 149
  • 1
  • 1
  • 8
0
votes
1 answer

Azure Batch: Frequency based scheduling

How can I configure a frequency based schedule on Azure Batch Service (ex. hourly/daily/weekly job)? I suppose Azure Batch Service has job scheduling features, but couldn't find a time based scheduling descriptions, although I found this page that…
Naoto Usuyama
  • 845
  • 1
  • 7
  • 13
0
votes
1 answer

Azure Batch - Setting custom user identity for tasks

I am using Azure Batch C# Client API 6.1. I am trying to have all my runs using the same user identity. I am setting a custom user identity as below, as per MSDN documentation. var task = new CloudTask("{guid}", "command string") { DisplayName…
rprasadk
  • 5
  • 2
0
votes
1 answer

Failed to delete Azure Batch Pool (Stays in Allocation State "Stopping")

I have created a Azure Batch Pool inside a Batch Account. After playing with the scaling formular for a while the pool now says, it is in allocation state "stopping". This allocation state is the shown state since about two weeks now. Neither…
S. Pauka
  • 1
  • 2
0
votes
0 answers

Azure Batch - No files on node after task completion

I'm using Azure Batch .NET API. After tasks completion, I try to retrieve the output files (or any files) from the tasks via CloudTask.GetNodeFile() but this throws a "not found" exception. Same for stdout…
tomi.lee.jones
  • 1,563
  • 1
  • 15
  • 22
0
votes
1 answer

Azure batch job start tasks failed

I'm using Azure batch python API. When I'm creating a new job, I see exit code 128 (image attached). How can I know what is the reason for that? I'm creating a new job using this code : def wrap_commands_in_shell(commands): return "/bin/bash -c…
liorko
  • 1,435
  • 2
  • 24
  • 41
0
votes
2 answers

Unexpected token error creating a .ps1 startup task in azure batch

I'm trying to run a powershell script in batch StartTask, but met an error. The following is the code where I went wrong when creating a pool startTask: def create_pool(batch_service_client, pool_id, resource_files, node_os_family): …
user123
  • 231
  • 2
  • 12
0
votes
1 answer

AddTaskAsync BatchErrorException: Operation returned an invalid status code 'BadRequest'

I have implemented an Azure Batch example using a template based on the following sample: https://github.com/Azure/azure-batch-samples/tree/master/CSharp/TextSearch My solution creates a Job Manager task, which in turn creates multiple child tasks…
Terry Mandin
  • 140
  • 1
  • 3
  • 11
0
votes
2 answers

Azure Batch Instance Sizes

I'm running massive parallel jobs on Azure Batch now, works ok but to my regret I can't seem to use A0 instances. The jobs I run are very low in memory or disk usage, so I only want 'bare' cores ideally. A1 instances are fine also, but the cost ~ 5…
0
votes
1 answer

Azure batch job error:Could not load file or assembly 'Microsoft.Data.OData

I read https://azure.microsoft.com/en-us/documentation/articles/batch-dotnet-get-started/#step-2-upload-task-application-and-data-files and downloaded the sample file. I could run the sample without any issues. However , when I created a new…
esther fang
  • 149
  • 2
  • 12
0
votes
1 answer

Existing code on github, principal functions never called

I am trying to work on an addon developed by Microsoft Azure for his old Cloud Service. The aim is to render Blender scenes using the Azure environment. Here it is : https://github.com/Azure/azure-batch-apps-blender As Microsoft doesn't support this…
Romain
  • 799
  • 1
  • 9
  • 29