I am trying to find the main image link(Usually, the Infobox one) from a page within cirrussearch wikipedia dump. I am able to get it by using wikipedia API but its too much overhead for Wikipedia server to get it for all wikipedia pages. As a matter of fact, I'd like to get it from an offline dump.
There is an interesting stackoverflow post to generate the link from the image name(md5 on the name), and append the result to the domain https://upload.wikimedia.org/wikipedia/commons/ Unfortunately, it does not work for all images. Example for Bouygues Telecom where the image path is not on wikipedia/commons but on wikipedia/fr
I also tried to get it from http://dumps.wikimedia.org/enwiki/latest/enwiki-latest-imagelinks.sql.gz but it does not help me.
Do you know if there is any possibility to get the main image link of a page from a dump?