Donald Knuth algorithm for Mastermind - can we do better?

Question

I implemented Donald Knuth 1977 algorithm for Mastermind https://www.cs.uni.edu/~wallingf/teaching/cs3530/resources/knuth-mastermind.pdf

I was able to reproduce his results - 5 guess to win in the worst case and 4.476 on average.

And then I tried something different. I ran Knuth's algorithm repeatedly and shuffled the entire list of combinations randomly each time before starting. I was able to land on a strategy with 5 guesses to win in the worst case (like Knuth) but with 4.451 guesses to win on average. Better than Knuth.

Are there any previous work trying to outperform Knuth algorithm on average , while maintaining the worst case ? I could not find any indication of it on the web so far.

Thanks!

Alon

The only question mark in your question is in “Are there any previous work trying to outperform Knuth algorithm on average , while maintaining the worst case?” so that's what I answered. :-) — ShreevatsaR, Mar 01 '19 at 00:48

score 2 · Answer 1 · answered Feb 28 '19 at 02:50

In the paper, Knuth describes how the strategy was chosen:

Table 1 was found by choosing at every stage a test pattern that minimizes the maximum number of remaining possibilities, over all conceivable responses by the codemaker. If this minimum can be achieved by a “valid” pattern (a pattern that makes “four black hits” possible), a valid one should be used. Subject to this condition, the first such test pattern in numeric order was selected. Fortunately this procedure turns out to guarantee a win in five moves.

So it is to some extent a greedy strategy (trying to make the most progress at each step, rather than overall), and moreover there's an ad-hoc tie-breaking strategy. This means that it need not be optimal in expected value, and indeed Knuth says exactly that:

The strategy in Table 1 isn’t optimal from the “expected number of moves” standpoint, but it is probably very close. One line that can be improved [...]

So already at the time the paper was published, Knuth was aware that it's not optimal and even had an explicit example.

When this paper was republished in his collection Selected Papers on Fun and Games (2010), he adds a 5-page addendum to the 6-page paper. In this addendum, he starts by mentioning randomization in the very first paragraph, and discusses the question of minimizing the expected number of moves. Analyzing it as the sum of all moves made over all 1296 possible codewords, he mentions a few papers:

His original algorithm gave 5801 (average of 5801/1296 ≈ 4.47608), and the minor improvement gives 5800 (≈ 4.4753).
Robert W. Irving, “Towards an optimum Mastermind strategy,” Journal of Recreational Mathematics 11 (1978), 81-87 [while staying within the “at most 5” achieves 5664 ⇒ ≈4.37]
E. Neuwirth, “Some strategies for Mastermind,” Zeitschrift fur Operations Research 26 (1982), B257-B278 [achieves 5658 ⇒ ≈4.3657]
Kenji Koyama and Tony W. Lai, “An optimal Mastermind strategy,” Journal of Recreational Mathematics 25 (1993), 251-256 [achieves 5626 ⇒ ≈4.34104938]

The last of these is the best possible, as it was found with an exhaustive depth-first search. (Note that all of these papers can do slightly better in the expected number of moves, if you allow them to take 6 moves sometimes... I gave the numbers with the “at most 5” constraint because that's what the question here asks for.)

You can make this more general (harder) by assuming the codemaker is adversarial and does not choose uniformly at random among the 1296 possible codewords, but according to whatever distribution will make it hardest for the codebreaker. Finally he mentions a lot of work done by Tom Nestor, which conclusively settles many such questions.

You might have fun trying to follow up or reproduce these results (e.g. write the exhaustive search program). Enjoy!

score 0 · Accepted Answer · answered Feb 27 '19 at 22:09

0

As far as I know, up till now there is no published work about this effect yet. I have made this observation some time ago, one can get better results by not always choosing the (canonically) first trial out of the "one-step-lookahead-set". I observed the different results by not starting with 1122 but with e.g. with 5544. One can also try to choose randomly and not use the canonically first. Yes, I agree with you, that is an interesting point - but a very, very special one.

answered Feb 27 '19 at 22:09

Excalibur

286
3
11

Thanks for resurrecting this question; it got me to finally look it up. Not only does there exist more published work, some of it is even referenced from the Wikipedia article on this game. :-) – ShreevatsaR Feb 28 '19 at 02:51
@ShreevatsaR: You are right, of course there exist better algorithm since many years, and since 1993 an optimal algo was published by Koyama. My answer was just focusing on the observation of Alon, that with a different start code a slightly better result than the one published by Knuth can be found (and using the Knuth algorithm). – Excalibur Feb 28 '19 at 18:12
@GilShafriri It's unlikely that when a better algorithm is found, someone will publish details about tweaks to a worse algorithm, so there may indeed be no published work on this particular tweak to Knuth's selection procedure. On the other hand, the very first sentence of Knuth's addendum is “Of course, I wouldn't stick precisely to the strategy of Table 1 if I were the codebreaker in a real game. I'd randomize it,…” so some sort of randomization is already suggested IMO. – ShreevatsaR Mar 01 '19 at 00:48

Donald Knuth algorithm for Mastermind - can we do better?

2 Answers2