From what I understood from this article, the blue circles are the level curves and the blue dot is the optimal solution that minimizes the cost function. The yellow circle is the L2-norm constraint.
The solution that we need is the one that minimizes the cost function as much as possible and also, at the same time, is within the circle. Meaning, the solution is the tangent point between the yellow circle and the level curve.
But, my question is how this can be the solution if the W values at the tangent point don't completely minimize the cost function? Only the blue dot is the one that minimizes the cost function.