in my mpi program each process has or works with a block of data:
char *datablock;
the blocks are of similar but not identical size.
What is the best way (which functions to use and how) to distribute those blocks from each process to each other process? In the end i want each process to have (maybe) an array of all blocks:
char **blockarray;
so that
*blockarray[i] // for i in [0... number_of_processes-1]
is the former block of the ith process. BUT it is not the order that matters, "i" does not have to be the process id and the order can differ on each process (if this is faster)! I just want the fastest way to get each block on each thread.