Correct me if I was wrong; the input image's dimension is 227x227x3
so after the first convolution layer the output dimension will be 55x55x(3x96)=55x55x288 not 55x55x96
.
See image bellow:
Correct me if I was wrong; the input image's dimension is 227x227x3
so after the first convolution layer the output dimension will be 55x55x(3x96)=55x55x288 not 55x55x96
.
See image bellow:
A convolutional layer is made of a number of filters with kernel size (n x m). Each filter has dimension (n x m x c), where c is the number of channels in the previous layer. In your example, the input is 227x227x3, and the convolutional layer has 96 filters with 11x11x3 parameters (kernel size is 11x11). Each filter creates a new output channel that is 55x55x1. The result is of dimension 55x55x96.