I have an issue in trying to write a scraper for RightMove.
In a generic search like this:
http://www.rightmove.co.uk/property-for-sale/find.html?locationIdentifier=OUTCODE^1891&insId=3&minBedrooms=2&maxBedrooms=2&displayPropertyType=houses&oldDisplayPropertyType=houses&googleAnalyticsChannel=buying
We have an OUTCODE parameter. So an outcode is the begining of a UK postcode, for example OX17, but I cant tell where the number 1891 comes from. There is the possibility that rightmove have some propriatary system in place in order to stop scrapers, but there may be some database.
I have a CSV I downloaded from some 'official' website, but I dont remember where (sorry about that). The outcodes have a number but they dont corrolate to what I was expecting.
I also found this by mistake on google (I think it is the full version of the CSV I had) which is verry similar to what I was playing with.
Thanks,
Joe
======================================================
So, turns out the nuber iterates from 1 to something (not sure yet).
1891 is OX17, 1890 is OX16 and so on.
I wrote a simple scraper which goes from location Ident 1 until there are no locations left and adds the to a Mongo Database... Will put everything on a GitHub Repo when it is done. I will also export the db as JSON for anyone who wants it.
Tah!