I am trying to download all (about 2000) of the Google Doodles (not .gif or interactive ones) from their archive, however I have been having no luck in doing so.
I have tried many scraping commands for terminal, and a few packages from Github, such as ImageScraper. Through each of the different ways I have tried, I found that either:
- the process just gets stopped after about 100 images using this chrome extension
- it downloads only some sample images (most command-line tools did this)
- just downloads a couple of images and did not keep going
The archive itself is not indexed from what I can see and each image has its own name, which also makes it harder. To keep viewing more images, the page also needs to keep being loaded as you scroll further down (causing the issue of only downloading a few images). I am also worried that the connection or something may be being cut-off so that these cannot be all downloaded at once, probably to not cause overload (not sure how to fix this though).
I would very much appreciate anyone's help, who has experience in retrieving/scraping such images, given the mentioned problems.