Why can't PyBrain Learn Binary

Question

I am attempting to get a network (PyBrain) to learn binary. This my code and it keeps return values around 8, but it should be return 9 when I activate with this target.

from pybrain.tools.shortcuts import buildNetwork
from pybrain.structure import *
from pybrain.datasets import *
from pybrain.supervised.trainers import BackpropTrainer
from matplotlib.pyplot import *


trains = 3000
hiddenLayers = 4
dim = 4
target = (1, 0, 0, 1)

ds = SupervisedDataSet(dim, 1)

ds.addSample((0, 0, 0, 0), (0,))
ds.addSample((0, 0, 0, 1), (1,))
ds.addSample((0, 0, 1, 0), (2,))
ds.addSample((0, 0, 1, 1), (3,))
ds.addSample((0, 1, 0, 0), (4,))
ds.addSample((0, 1, 0, 1), (5,))
ds.addSample((0, 1, 1, 0), (6,))
ds.addSample((0, 1, 1, 1), (7,))
ds.addSample((1, 0, 0, 0), (8,))


net = buildNetwork(dim, hiddenLayers, 1, bias=True, hiddenclass=SigmoidLayer)
trainer = BackpropTrainer(net, ds)

tests = []

for i in range(trains):
    trainer.train()
    tests.append(net.activate(target))


plot(range(len(tests)), tests)


print net.activate(target)
show()

I have tried adjusting the number hidden Layers, the hiddenclass from TanhLayer to SigmoidLayer and varied the number of trains, but it always converges around 500 times (training the network to the dataset). Should I be using a different trainer than back propagation and if so why?

What kind of output transfer function are you using? tansig? linear? — DrFalk3n, Sep 06 '16 at 08:18

score 2 · Accepted Answer · answered Sep 06 '16 at 05:51

You've built a network with 4 input nodes, 4 hidden nodes, and 1 output node, and 2 biases.

Considering each letter as the activation for that node, we can say each hidden node computes its activation as sigmoid(w₀*1 + w₁*A + w₂*B + w₃*C + w₄*D), and the output node computes its activation as (w₀*1 + w₁*E + w₂*F + w₃*G + w₄*H) (with no sigmoid). The number of lines in the diagram is the number of the weight parameters in the model that are tweaked during learning.

With so many parameters but only 9 samples to train on, there are many locally optimal, not-quite-right solutions that the network can converge to.

One way to fix this is to increase your number of training samples. You could generalize past 1s and 0s and offer samples such as ((0, 0, 1.0, 0.5), (2.5,)) and ((0, 1.2, 0.0, 1.0), (5.8,)).

Another option is to simplify your model. All you need for a perfect solution is 4 inputs hooked directly to the output with no biases or sigmoids. That model would only have 4 weights which training would set to 1, 2, 4, and 8. The final computation would be 1*A + 2*B + 4*C + 8*D.

score 1 · Answer 2 · answered Sep 06 '16 at 04:53

I would suggest you make the target something in the middle instead of on the fringe.

I tried expanding the training data upwards with 10 and 11, then it produced better results on predicting 9, even with 9 left out of the training data. Also you get a pretty good result if you try to predict 4, even if you do not have 4 in the training data.

From my experience I would not expect a neural net to readily guess numbers that are beyond the borders of the test data.

Why can't PyBrain Learn Binary

2 Answers2