If I want to build a model that can predict a class 1 binary outcome that has very low incidence, for example 0.1% of the total in the test set. Would the training set also ideally need to have the same proportions, 1 class 1 to 999 class 0? Or could I train on a class balance of 50% class 1 and 50% class 0, but test on 0.1% class 1 and 99.9% class 0? Thanks in advance.
Asked
Active
Viewed 15 times