I am trying to scrape all sites from THIS website.
I will use www.site.com instead of real domain just to simpify my problem.
Basically, there is a list of around 300 000 sites, each page has 30 results, so there should be around 10000 pages.
This is an example:
www.site.com/1 -> sites from 1-30
www.site.com/2 -> sites from 30-60
www.site.com/3 -> sites from 60-90
www.site.com/4 -> sites from 90-120
The problem is, when I reach page 167, there are no more results after that shown. That way, I can see only list of the first 5000 sites.
When I write this:
I get this error: PHP Warning – yii\base\ErrorException
Click HERE to see full error.
I was able to create a script in python that will scrape first 5000 sites, but I don't have any idea on how to access full list.
For example, there is a possibility to search for certain keywords on that page, but again, if there are more than 5000 results, only first 5000 sites will be shown.
Any ideas on how to solve this problem?