-2

I am trying to determine the exact mathematics used to train feed forward networks used for classification in deeplearning4j with stochastic gradient descent. I have tried stepping through the code but am getting lost in the forest.

Is this documented anywhere?

Brian Tompsett - 汤莱恩
  • 5,753
  • 72
  • 57
  • 129

1 Answers1

0

You can find it in the preOutput and backpropGradient methods in the various layers implementations:

Here's the basic implementation used in DenseLayers

And the more complex convolution one used in Convolutions

The implementation might be swapped out when using CuDNN or MKL however.

wm_eddie
  • 3,938
  • 22
  • 22