I am undertaking a course in Neural Networks and the Professor introduced us to the XOR problem. I understand the XOR problem is not linearly separable and why we need to employ Neural Network for this problem.
However, he mentioned XOR works better with a bipolar representation(-1, +1) which I do not understand. I have looked into resources in different websites yet I can't really understand the reason.
I am wondering why a bipolar representation would be better than a binary representation? What's the rationale for that argument?