4

I'm doing this only for learning purposes. I've no intentions of reversing the methods of IMDB.

I asked myself I owned IMDB or similar website. How would I compute the movie rating?

All I can think of is Weighted Average(which is nothing but Arithmetic Mean)

For a movie data provided below computation would be

(38591*10 + 27994*9 + 32732*8 + 17864*7 + 7361*6 + 2965*5 + 1562*4 + 1073*3 + 891*2 + 3401*1) / 134434 = 8.17055953

alt text


My rating 8.17055953 doesn't match with IMDBs rating (=weighted average). So my conclusion is I'm missing something here or my score is not an ideal score. I'm might be missing lot of things.

  1. Whats wrong with my score? Why is it not ideal?
  2. If you had to compute. How would you have done it?
Josh Lee
  • 171,072
  • 38
  • 269
  • 275
claws
  • 52,236
  • 58
  • 146
  • 195
  • 1
    Everyone: claws is *not* asking about the ranking mechanism for the Top/Bottom listing. This question is about how IMDb turns the arithmetic mean into the weighted mean. Take "Manos: The Hands of Fate" — Unweighted mean 2.1, weighted mean 1.5. (Or "The Godfather", another example of ballot stuffing.) – Josh Lee Sep 15 '10 at 12:55

2 Answers2

7

Not exactly on topic, but useful for you:

How the ranking is calculated.

I hope that helps.

P.S. This is confirmed on this page:

IMDB Chart

Sam Chats
  • 2,271
  • 1
  • 12
  • 34
Dave
  • 6,905
  • 2
  • 32
  • 35
  • I found the Wikipedia page a bit hard to digest (not an actuary), but something at the bottom of the IMDB page - at the bottom of the Top 250 page the say "true Bayesian Estimate" and provide the same formula as Wikipedia.. . I found this page a bit easier to read: http://wiki.answers.com/Q/What_does_true_Bayesian_estimate_mean_in_connection_with_the_IMDb_Top_250_ratings – M. Tibbits Sep 30 '10 at 13:12
  • That is a less wordy and much more comprehensible article, thanks for the link :) – Dave Oct 04 '10 at 08:30
4

First up: your calculation (which is an un weighted average, or simply the mean) has a typo: you have 19562 '4' ratings, but you should have 1562. Correcting this gives ~8.17, which rounds to the same as IMDB's Arithmetic mean'.

Next, if you do want a weighted average, you need something to weight with! So if for example users have a 'reputation' or similar (even just a length of time registered), you could use that as a multiplier for the weight of their rating. At the end, divide by the sum of weights rather than the total number of votes, and you have a weighted average.

A scaling factor of some kind for the weight would probably be needed - eg if SO did something like this, it might be a little unfair for Jon Skeet's opinion to carry two hundred thousand times the weight of a newly-registered user's opinion :)

AakashM
  • 62,551
  • 17
  • 151
  • 186