Weights becoming "NaN" in implementation of Neural Networks

Question

I am trying to implement Neural Networks for classifcation having 5 hidden layers, and with softmax cross entropy in the output layer. The implementation is in JAVA.

For optimization, I have used MiniBatch gradient descent(Batch size=100, learning rate = 0.01)

However, after a couple of iterations, the weights become "NaN" and the predicted values turn out to be the same for every testcase.

Unable to debug the source of this error. Here is the github link to the code(with the test/training file.) https://github.com/ahana204/NeuralNetworks

score 1 · Answer 1 · answered Feb 10 '19 at 21:48

1

In my case, i forgot to normalize the training data (by subtracting mean). This was causing the denominator of my softmax equation to be 0. Hope this helps.

answered Feb 10 '19 at 21:48

Mohit Chawla

181
7

score 0 · Answer 2 · answered Apr 04 '18 at 12:56

0

Assuming the code you implemented is correct, one reason would be large learning rate. If learning rate is large, weights may not converge and may become very small or very large which could be shown NaN. Try to lower learning rate to see if anything changes.

answered Apr 04 '18 at 12:56

Seljuk Gulcan

1,826
13
24

Not working. I have added the code for review. Thanks! – 204 Apr 05 '18 at 13:38

Weights becoming "NaN" in implementation of Neural Networks

2 Answers2