Let's say there is a website abc.com and we crawl abc.com for 100 pages as below.
Day 1: create a crawl job in heritrix by specifying maxDocumentsToDownload as 100 Day 2: clone the above job in heritrix and run.
If website doesn't change over two days of time, will I be getting same 100 pages or different set of 100 pages?
In case any more information is required please let me know
Thanks, Hareesh