I am trying to find the internal page rank of Wikipedia using Mapreduce. I implemented my Pagerank algorithm on a small subset of wikipages. There are 6349 pages. I used this formula to calculate the pagerank (d = 0.85).
I wanted to verify if the sum of all the pagerank is equal to the total number of pages(6349).
What I found so far:
1.The total page rank of all the 6349 pages is 1001.26044
2.According to WikiPedia if I use the above formula then each PageRank is multiplied by N and the sum becomes N
. I multiplied each page rank by N (6349) and calculated the sum, I got 6356789.5.
Is there a reason why the sum of page ranks is not equal to the total number of pages? Should I use the second formula to verify ?
Note: I ran my mapreduce code for 10 iterations to get a good approximation.