Is it possible to use python's integration of dmtcp to checkpoint a child process in parallel execution?
My situation is as follows: I have a multiprocessing.Pool with several workers receiving async jobs (using apply_async). Certain big jobs require all the resources (cpu cores & memory). When one of these jobs is accepted, I'd like to checkpoint all pending processes, kick them out execution, launch the big job and finally resume the checkpointed processes.