I'm running Rcrawler on a very large website, so it takes a very long time (3+ days with default page depth). Is there a way to not download all the HTMLs to make the process faster?
I only need the URLs that are stored in the INDEX. Or can anyone recommend another way to make Rcrawler run faster?
I have tried running it with a smaller page depth (5), but it is still taking forever.