I am parsing through wikipedia dump in java. In my module I want to know the page id of the internal pages of wiki those are referred by the current page. Getting the internal links and thus the url from it is easy. But how to get Page ID from url.
Do I have to use some mediaWiki for this? If yes how Any other alternative?
for eg: http://en.wikipedia.org/wiki/United_States I want to get its Page-Id i.e 3434750