I am using PFactory (panorama factory) to stitch images in batch mode. Currently it uses multiple threads to do the same on CPU. I want to run it on my nvidia GPU (which has 144 cores). I wrote a kernel program which calls - system(pfactory) - to execute one instance of PFactory. But when I compile it using nvcc, it throws up an error - can't execute a host function in device..."
Is there a way by which I could run multiple instances of this program on GPU cores?