Can ReLU replace a Sigmoid Activation Function in Neural Network

Asked Nov 12 '18 at 10:22

Active Nov 13 '18 at 09:45

Viewed 254 times

I'm new into this and I'm trying to replace the sigmoid activation function in the following simple NN with ReLU. Can I do that? I've tried replacing the sigmoid function, but it's not working. The output should be the AND gate(if input (0,0)-> output 0).

import numpy as np

 # sigmoid function
def nonlin(x, deriv=False):
   if(deriv == True):
       return x*(1-x)
   return 1/(1+np.exp(-x))

# input dataset
X = np.array([[0, 0],
          [0, 1],
          [1, 0],
          [1, 1]])

# output dataset            
y = np.array([[0, 0, 0, 1]]).T

# seed random numbers to make calculation
# deterministic (just a good practice)
np.random.seed(1)

# initialize weights randomly with mean 0
syn0 = 2*np.random.random((2, 1)) - 1

for iter in xrange(10000):

    # forward propagation
    l0 = X
    l1 = nonlin(np.dot(l0,syn0))

    # how much did we miss?
    l1_error = y - l1

    l1_delta = l1_error * nonlin(l1, True)

    syn0 += np.dot(l0.T,l1_delta)

edited Nov 12 '18 at 10:25

Ali AzG

1,861
2
18
28

asked Nov 12 '18 at 10:22

andree17914

2

When outputs are in the range [0,1], it is better to keep the sigmoid function for the output layer. Relu are best used in hidden layers ! (or when the output is not contrain to the range [0,1]) – Jérémy Blain Nov 12 '18 at 10:27
2

@JérémyBlain is correct, reLu's outputs are in range [0, x] which isn't suitable for the final layer. Also, consider dropping questions as such on [datascience stack exchange](https://datascience.stackexchange.com) – gavin Nov 12 '18 at 10:33
@JérémyBlain thank you! – andree17914 Nov 12 '18 at 11:43

Can ReLU replace a Sigmoid Activation Function in Neural Network

0 Answers0