I'll start with describing my problem.
I have n pages each with its own popularity factor. Popularity factor is on a scale of 10. Now, I have total page hits for each of the pages with me and I want to use those total page hits for calculating the popularity factor again on a scale of 10.
The total page hits is an absolute number and I have these values for only 1,70,000 pages. The total pages which I have with me is 41,00,000.
Now, my problem is I don't know how to normalize these total page hits for all of the total pages.
I tried doing this:
Popularity factor for each page = Total page hits for all the pages/total no. of pages.
I'll assume that the pages with no data will be having at least 1 total page hits. But that way my denominator becomes really big number and in the process of scaling on a scale of 10, I'm lost.
Can anyone please help with how can I approach it ?