I'm trying to get some values(which I get using extract function) from urls which are stored in data.file and there are about 3000000 url links in the file. here is my code snippet,
from multiprocessing import Pool
p = Pool(10)
revenuelist = p.map(extract, data.file )
But the problem is, due to internet connection, this is code runs again if there connection problem. How do I add fault tolerance to my code(i:e store intermediate result, to avoid repetition of doing same task).