On the page http://mxnet.io/tutorials/python/mnist.html# the weight matrix dimension is incorrect

Question

Under Multilayer Perceptron subsection, the documentation says that weight matrix has a dimension of mk.However, I think it should be km because the output layer-Y has the dimension of n*k.

Is my understanding correct?

score 0 · Answer 1 · answered Dec 11 '17 at 20:10

The correct matrix calculation formula for a FullyConnected layer is:

output = input x weight_transpose + bias

So in this formula, your input is n x m, your weight is k x m before being transposed, and your output is n x k. See the FullyConnected API doc here.

If you're just getting started, it may make sense to focus on the new Gluon API, which is far easier to learn and debug.

On the page http://mxnet.io/tutorials/python/mnist.html# the weight matrix dimension is incorrect

1 Answers1