I am using a rather large snakemake file to call mutations for 30 patients. The first step of the workflow is the alignment. The problem that I am running into is, that the snakemake scheduler seems to perform the first step (i.e. the alignment) first for all 30 patients. This in turn requires a huge amount of (temporarily) disk space (>>10TB). This is quite ineffective because when the workflow finishes, one patient only takes up less than 1 GB (only vcf files).
So my question is if there is a way to "force" snakemake to finish processing one patient before starting the alignment (the first step) for a new patient, while still parallizing everything.
I tried the --prioritize
option to prioritize the last rule of the workflow, but that did not seem to do the trick.
Any help is much appreciated!
Cheers!